CasualConc

© 2008-2009 Yasu Imao

CasualMecab


CasualMecab is a simple utility program to parse Japanese text with MeCab. You need to install MeCab and MeCab-Ruby to use this program.  Check this page for how to install them. Currently this program is beta. By the way CasualMecab is also free.

System Requirement: Any Mac with Mac OS X Leopard or Snow Leopard + MeCab + MeCab-Ruby


The current version of CasualMecab is 0.6.


How to use

Single

This mode allows you to open a text file or copy and paste text to parse.

Single mode has two text boxes.  Input (left) and Output (right).

  1. Click Open button to open a text file in the text box on the left. You need to select the encoding of the text file.

    Supported file formats (experimental) are:

    Plain Text (.txt)
    Rich Text Format (.rft, .rtfd)
    MS Word (.doc, .docx)
    HTML (.html, htm)
    Web Archive (.webarchive) from Safari [WebKit]
    OpenOffice (.odt, .sxw)

    The following text encodings are supported for plain text files:

    UTF-8 - unicode (Mac Standard)
    SJIS - shift_jis (Windows Standard)
    EUC - euc-jp (Unix Standard)
    JIS - iso-2022-jp


  2. Alternatively, you can first click the small text box next to the encoding menu and drag & drop a text file on it.  Then, click Get File button.  You can edit the imported text.

  3. If you check Delete Aozora Rubi, text in 《》 will be deleted when the file opens.  You can delete them after you open the file by clicking Delete Aozora Rubi button.

  4. Select how to process the text. Your choices are

    MeCab - MeCab standard output
    Chasen - Chasen-like output
    Wakachi - wakachi-gaki (words separated by 1-byte space)
    Yomi - yomi in katakana

    Then click Parse button. You can edit the parsed text.

  5. Click Save button and select a folder and give a name to the text file. The saved text file will be encoded in UTF-8.


Batch

Batch mode allows you to batch process text file parsing.  Text file will be created for each file with the _mecab added (.txt -> _mecab.txt) when saved in the same folder or saved with the same name in a different folder.  Due to forced line breaks, PDF files cannnot be batch-processed (technically, this function can be added).

  1. Click Add File(s) button to add text files to the table.  You can add multiple files/folders.

  2. Alternately, you can drag & drop text files onto the table.

  3. If you do not want to process some files, simply select the files on the table and click Delete Selected File(s) button.  You can clear the table by clicking Clear Table button.

  4. If you check Delete Aozora Rubi, text in 《》 will be deleted when the files are processed.

  5. You can check the content of the files by checking Preview and selecting a file on the table.

  6. Select text encoding of the original files and the process type.  You can change text encodings on the table.

  7. If you are sure you want to process the files on the table, click Process button.  The program prompts you to select a folder to save files.  Unfortunately, with current version, you cannot create a folder when you select a folder.

  8. If you check Save to Original Folder, the text files will be created in the same folder as the original text files when you click Process button.