Please take our Survey
logo       

Choosing A Webhost:
A web hosting service is a type of Internet hosting service that allows individuals and organizations to provide their own website accessible via the World Wide Web. Web hosts are companies that provide space on a server they own for use by their clients as well as providing Internet connectivity, typically in a data center. Web hosts can also provide data center space and connectivity to the Internet for servers they do not own to be located in their data center, called colocation. more...

Re: Slovene stemmer: msg#00011

search.snowball

Subject: Re: Slovene stemmer

Hello Martin,

No problem. I was just wondering what's going on.
Thanks for the answer,

Boštjan

Martin Porter wrote:
Bostjan,

Yes, I have always kept in mind the possibility of putting your Slovene
stemmer among the various snowball stemmers at snowball.tartarus.org.

Various things arose:

I recall that when I asked you for a sample vocab, you sent back a small
text in Slovene (the beginning of a translation of Orwell's 1984, if I
remember correctly), and what I wanted to do was to put together a larger
wordlist, in alphabetical order, derived from a more substantial set of
texts, and then try your stemmer out.

I also wanted to rework your program to use 'among' statements. As I said at
the time, this would make it run really fast.

One thing that struck me about your stemmer is that (again, if I remember
right) the rules were not based on any measure of syllable length. For the
Snowball stemmers syllable length has proved quite useful -- although for
the Russian one it was less important. I wanted to see how far that mattered.

I also wanted to look at the Willett Popovic paper again.

Unfortunately, I have not had too much time to devote to Snowball over the
past six months, so none of this was done. But I would still like to tackle
it. Could you perhaps give me a little more time? Just another month ...

Martin



_______________________________________________
Snowball-discuss mailing list
Snowball-discuss@xxxxxxxxxxxxxxxxxx
http://lists.tartarus.org/mailman/listinfo/snowball-discuss


<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Recently Viewed:
qnx.openqnx.dev...    gcc.libstdc++.c...    solaris.opensol...    information-ret...    misc.misterhous...    web.catalyst.ge...    apache.webservi...    redhat.release....    hardware.lirc/2...    kernel.autofs/2...    technology.sust...    linux.vdr/2003-...    editors.lyx.gen...    org.user-groups...    netbsd.devel.pk...    xdg.devel/2004-...    version-control...    jakarta.slide.d...    debian.packages...    creativecommons...    ports.ppc.embed...    bug-tracking.bu...   
Home | blog view | USPTO Patent Archive | advertise | OSDir is an inevitable website. super tiny logo

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe