|
|
Choosing A Webhost: |
Questions about english stemmer & the apostrophe: msg#00022search.snowball
Question: I'm sure this has been discussed before... I tried a google search on the snowball-discuss archive with no luck. Is there a rationale for behavior below on words with the apostrophe? bagpipe -> bagpip bagpipe's -> bagpipe' bagpipes -> bagpip bakeries -> bakeri bakeries' -> bakeries' bakery -> bakeri bakery's -> bakery' bakerys -> bakeri //This isn't a word - but the form is OK sometimes. I looked at several older versions of various (porter derived) english stemmers, all have this behavior. One could argue that when the apostrophe is used an IR application would want to preserve the original noun. Apostrophes are used to denote possession by an entity, and the generalization of stemming 'bakery's -> bakeri' would be inappropriate. Since stemming is used to generalize word forms... you could also argue that the possessive form should be generalized as well. Eh??? Thanks! Neal Richter Knowledgebase Developer RightNow Technologies, Inc. Customer Service for Every Web Site Office: 406-522-1485
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: Problems with -Wall flag, Martin Porter |
|---|---|
| Next by Date: | Re: Questions about english stemmer & the apostrophe, Martin Porter |
| Previous by Thread: | Problems with -Wall flag, Neal Richter |
| Next by Thread: | Re: Questions about english stemmer & the apostrophe, Martin Porter |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
Free MagazinesCisco NewsReceive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business. subscribe Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field. subscribe The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business. subscribe Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company. subscribe Total Telecom Total Telecom is "The Economist of the communications industry". subscribe |