Software‎ > ‎

MLCT



MLCT
(Multilingual Corpus Toolkit) is a JAVA software package with a GUI (Graphical User Interfce). It provides various useful functionalities for building and processing corpora, including frequency list extraction, concordancing list extraction, statistical collocation extraction, words/tags pattern sequence frequency extraction, encoding conversion etc.

This software is not a fully-fledged automatic system, but with some knowledge and skills of regular expression, it can become a highly flexible and useful tool for building and analyzing your own corpus data.

To run the program, user needs to install the Java Runtime Environment (JRE), which is freely available from Oracle website

After installing the JRE, download the zipped file mlct_public.zip and unzip it. Open the folder and click on the file "mlct_public.jar" to start the graphical desktop application.

For a user's guide, please read the Readme file, or contact me at email: s.piao - at - lancaster.ac.uk

Reference paper:
Piao, Scott, Andrew Wilson and Tony McEnery (2002). A Multilingual Corpus Toolkit, AAACL-2002, Indianapolis, Indiana, USA.

COPYRIGHT STATEMENT: PERMISSION IS GRANTED TO USE THIS SOFTWARE, FREE OF CHARGE, FOR NON-PROFITABLE RESEARCH PURPOSES. PERMISSION MUST BE OBTAINED FROM THE COPYRIGHT HOLDER (DR SCOTT SONGLIN PIAO) FOR ANY OTHER PURPOSES.