Please take our Survey
logo       

Choosing A Webhost:
A web hosting service is a type of Internet hosting service that allows individuals and organizations to provide their own website accessible via the World Wide Web. Web hosts are companies that provide space on a server they own for use by their clients as well as providing Internet connectivity, typically in a data center. Web hosts can also provide data center space and connectivity to the Internet for servers they do not own to be located in their data center, called colocation. more...

Mobile phone implementation of the English Stemmer: msg#00005

search.snowball

Subject: Mobile phone implementation of the English Stemmer

Hi
I wrote a couple of emails to this mailing list back in July - I am an MSc student studying Computer Science at Imperial College, London. I have just about completed my thesis/project which has been concerned with writing a mobile phone translator (english to french and french to english). Please excuse the length of this email but I thought you might be interested in the work I have done using the Porter algorithm.

Very briefly, there is a small dictionary of words stored as part of the application on the mobile phone. A user inputs a word to be translated and the application returns the translation if the word is found in the phone dictionary. If the word is not in the dictionary, the application queries a remote dictionary and returns the translation.

Given the constrained system requirements of mobile phones, I have had to work at compressing the words to be stored on the phone. For this I used the Porter algorithm and the Java implementation from the website.
The words that make up the dictionary are stemmed and stored on the phone. When the user inputs a word, that word is then stemmed (using the Java implementation modified slightly for the mobile phone) and then
matched against the stemmed words in the dictionary.
By doing this, I was able to get about 25% compression on the english words I had.

I only implement the stemming for the english words and therefore only the english words are compressed. I did try to implement the french stemmer but I found it was too large for the mobile phone and more complicated.

I would like to say thank you for the excellent and informative website - it has been of great use to me in the past 3 months.

I was also wondering if you know of anyone who has implemented the stemmer on a mobile phone. If not, this would lend my project a bit of extra kudos, I have to say!

I will be finalising the code and writing the actual thesis in the next 2 weeks. If anyone is interested in the work that I have done on it, please let me know as I would be more than happy to supply the code and/or the report.

Thank you once again
Alex Duncan


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
qnx.openqnx.dev...    gcc.libstdc++.c...    solaris.opensol...    information-ret...    misc.misterhous...    web.catalyst.ge...    apache.webservi...    redhat.release....    hardware.lirc/2...    kernel.autofs/2...    technology.sust...    linux.vdr/2003-...    editors.lyx.gen...    org.user-groups...    netbsd.devel.pk...    xdg.devel/2004-...    version-control...    jakarta.slide.d...    debian.packages...    creativecommons...    ports.ppc.embed...    bug-tracking.bu...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe