最終更新:2010-06-22 (火) 22:04:13 (5416d)
ChaSen
Top / ChaSen
形態素解析ツール。
メモ
- Due to historical reasons, the default encoding of ChaSen is set to EUC-JP.If you'd like to handle text files written in UTF-8 or Shift_JIS, you may use -r and -i options.
UTF-8) chasen -r /opt/local/etc/chasenrc-UTF-8 -i w <input> Shift_JIS) chasen -r /opt/local/etc/chasenrc-Shift_JIS -i s <input>