Posted by: manchucka | February 18, 2008

Update: Cross-platform concordance software

Multi-lingual Corpus Toolkit

In a previous post we looked at the concordance software AntConc, which, despite its many qualities, runs with great difficulty limps on OS X when trying to use it for Chinese. Now we present an alternative program, MLCT (Multi-lingual Corpus Toolkit). This tiny program runs on the Java Runtime Environment (JRE), which makes it cross-platform. There is no problem typing Chinese characters with your native input method. I can report that I have successfully built a concordance with a Chinese text in OS X, Windows XP, and Ubuntu Linux.

Direct link to .zip file here.

See below for more information:


Make sure to choose the correct formatting, which in most cases should be UTF-8. When in doubt, open your .txt file in a text editor or word processor and resave it with UTF-8 formatting.

formatting

To install in Ubunbtu, copy and paste this command into Terminal:

sudo apt-get install sun-java6-jre sun-java6-plugin sun-java6-fonts

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Categories

%d bloggers like this: