CasualMecab is a simple utility program to parse Japanese text with MeCab. You need to install MeCab and MeCab-Ruby to use this program. Check this page for how to install them. Currently this program is beta. By the way CasualMecab is also free. System Requirement: Any Mac with Mac OS X Leopard or Snow Leopard + MeCab + MeCab-Ruby
The current version of CasualMecab is 0.6.
How to useSingle This mode allows you to open a text file or copy and paste text to parse. Single mode has two text boxes. Input (left) and Output (right).  - Click Open button to open a text file in the text box on the left. You need to select the encoding of the text file.
Supported file formats (experimental) are:
Plain Text (.txt) Rich Text Format (.rft, .rtfd) MS Word (.doc, .docx) HTML (.html, htm) Web Archive (.webarchive) from Safari [WebKit] OpenOffice (.odt, .sxw)
The following text encodings are supported for plain text files:
UTF-8 - unicode (Mac Standard) SJIS - shift_jis (Windows Standard) EUC - euc-jp (Unix Standard) JIS - iso-2022-jp
- Alternatively, you can first click
the small text box next to the encoding menu and drag & drop a text
file on it. Then, click Get File button. You can edit the imported text.
- If you check Delete Aozora Rubi, text in 《》 will be deleted when the file opens. You can delete them after you open the file by clicking Delete Aozora Rubi button.
- Select how to process the text. Your choices are
MeCab - MeCab standard output Chasen - Chasen-like output Wakachi - wakachi-gaki (words separated by 1-byte space) Yomi - yomi in katakana
Then click Parse button. You can edit the parsed text.
- Click Save button and select a folder and give a name to the text file. The saved text file will be encoded in UTF-8.
Batch Batch
mode allows you to batch process text file parsing. Text file will be
created for each file with
the _mecab added (.txt -> _mecab.txt) when saved in the same folder
or saved with the same name in a different folder. Due to forced line
breaks, PDF files cannnot be batch-processed (technically, this
function can be added).  - Click Add File(s) button to add text files to the table. You can add multiple files/folders.
- Alternately, you can drag & drop text files onto the table.
- If you do not want to process some files, simply select the files on the table and click Delete Selected File(s) button. You can clear the table by clicking Clear Table button.
- If you check Delete Aozora Rubi, text in 《》 will be deleted when the files are processed.
- You can check the content of the files by checking Preview and selecting a file on the table.
- Select text encoding of the original files and the process type. You can change text encodings on the table.
- If you are sure you want to process the files on the table, click Process
button. The program prompts you to select a folder to save files. Unfortunately, with
current version, you cannot create a folder when you select a folder.
- If
you check Save to Original Folder, the text files will be created in
the same folder as the original text files when you click Process button.
|