|
romaji word list: msg#00072science.linguistics.corpora
Hi all, I've been searching around the web for a word frequency list for Japanese, in romaji (Latin letters). I haven't had any luck but I did find a nifty web based converter from kanji, hiragana, katakana to romaji, and thought I would pass on the address. http://kanjidict.stc.cx/kakasifilt.php Since I don't know Japanese, I found that if I searched generic English terms like hiv, gene, u.s.a., cancer, etc in google's Japanese only search I got some reasonable pages to convert. Cnn jp was also a good soucre: http://www.cnn.co.jp/ It's a bit time consuming and balanced this corpus might not be. But if you are looking for stop words, like I am, this method seems to work ok - better than no frequency list at all. And if there is a romaji frequency list out there, could you let me know :) If any one has suggestions for Korean, that would be great, too. BTW, njstar's Chinese word processing software, which I think is still free, will convert a text of characters to pinyin (among other cool things). www.njstar.com/ Best, Jerry _________________________________________________________________ Express yourself instantly with MSN Messenger! Download today it's FREE! http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/ |
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: RE: Numpties and bennies: 00072, Nicholas Sanders |
|---|---|
| Next by Date: | Job announcement reminder: Postdoctoral Research Fellow/Senior Research Fellow in Question Answering: 00072, Constantin Orasan |
| Previous by Thread: | [Cluj-Napoca, Romania] KEPT 2007: The First International Conference on Knowledge Engineering: Principles and Techniquesi: 00072, Rada Mihalcea |
| Next by Thread: | Job announcement reminder: Postdoctoral Research Fellow/Senior Research Fellow in Question Answering: 00072, Constantin Orasan |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
| News | FAQ | advertise |