最終更新:2010-06-22 (火) 22:04:13 (5050d)  

ChaSen
Top / ChaSen

形態素解析ツール。

メモ

  • Due to historical reasons, the default encoding of ChaSen is set to EUC-JP.If you'd like to handle text files written in UTF-8 or Shift_JIS, you may use -r and -i options.
UTF-8)     chasen -r /opt/local/etc/chasenrc-UTF-8 -i w <input>
Shift_JIS) chasen -r /opt/local/etc/chasenrc-Shift_JIS -i s <input>

参考