|
|
Choosing A Webhost: |
Re: Stemming 'communing' and 'communed': msg#00009search.snowball
On 3/29/07, Martin Porter <martin.porter@xxxxxxxxxxxxxxx> wrote: > ... my algorithm stems it to "commun". I have run through the spec Thanks for the reply! I'm definitely planning to contribute the PHP version to the community when I am confident it performs well in a production setting. I currently have 'gener', 'commun', and 'arsen' as the exceptions you reference. If I am correct, what you are saying is that I should always treat these exceptional prefixes as short syllables? It is not clear to me from reading the spec's definition of short syllables and short words that I should be doing this. Rather, it reads as though the only difference is in the setting of R1 which is not intrinsically linked to the definition of short syllables or short words in the spec. So, I am just looking for a little more clarification so that I can try to future-proof my code with respect to additional exceptional prefixes that may be added down the road. Best regards, Michael
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: Stemming 'communing' and 'communed', Martin Porter |
|---|---|
| Next by Date: | Re: Stemming 'communing' and 'communed', Martin Porter |
| Previous by Thread: | Re: Stemming 'communing' and 'communed', Martin Porter |
| Next by Thread: | Re: Stemming 'communing' and 'communed', Martin Porter |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
Free MagazinesCisco NewsReceive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business. subscribe Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field. subscribe The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business. subscribe Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company. subscribe Total Telecom Total Telecom is "The Economist of the communications industry". subscribe |