|
|
Choosing A Webhost: |
Re: Snowball API versioning: msg#00002search.snowball
Oleg Bartunov wrote: Hi there, Some kind of versioning would indeed be a good idea, but it's not clear to me what the API changes you're referring to are: as far as I can see, these are the ways in which the code accessible from snowball changes: 1. Internal changes to the compiler, resulting in the generated stemmer code being different, but behaving the same. 2. New features being added to the snowball language, but old .sbl files will still produce equivalent output. 3. Changes to the definition of the snowball language, resulting in .sbl files no-longer producing equivalent output. 4. Changes to a snowball script, such that it produces different output. 5. Changes to the libstemmer interface (ie, the libstemmer.h file, for C). IIRC, there have been several changes of type 1, but none of 2 or 3 in recent months/years. There have been no changes to libstemmer.h since August 2005. Therefore, I suspect you're talking about changes of type 4. I would like to add versioning to the stemming algorithms at some point, such that each change to an algorithm increments the version number, but haven't had time to do this yet. Also, I would like to modify the libstemmer interface such that the current version of a stemming algorithm can be obtained, and also such that a particular version of a stemming algorithm can be requested. It would also be possible to compile a version of libstemmer such that several old versions of a particular stemmer were available. This would allow a database to store the stemmer version number which was used to index with, so that searches can use the same stemmer version. However, a newly created database would simply use the latest stemmer version. Again, I simply haven't had time to do this yet. For now, I recommend that you simply take a copy of libstemmer into your distribution, and update that static version of libstemmer as appropriate when you make new releases of your distribution. I don't think that a major.minor versioning scheme would be appropriate here, but maybe you are thinking of something different to me (in which case, please enlighten me). -- Richard
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Snowball API versioning, Oleg Bartunov |
|---|---|
| Next by Date: | Spanish stemmer with accents stripped before stemming, Andrew Green |
| Previous by Thread: | Snowball API versioning, Oleg Bartunov |
| Next by Thread: | Spanish stemmer with accents stripped before stemming, Andrew Green |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
Free MagazinesCisco NewsReceive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business. subscribe Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field. subscribe The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business. subscribe Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company. subscribe Total Telecom Total Telecom is "The Economist of the communications industry". subscribe |