|
|
Choosing A Webhost: |
Re: searching and sorting by date: msg#00124search.xapian.general
Querying is more complex than it needs to be. We could easily add I think that would be very useful.
It's a sample of how one uses Xapwrap. Sorry I wasn't clear about that. I'll take a guess and say you're adding "foo bar bang" (term generated, stemmed), Yep Looks to me like you don't want to use the omega term style Xapwrap manages those mappings for you. That's one of the really nice things it does out of the box. When a document is indexed, the keys of "Keywords" are remembered in a dictionary and the query parser is automatically configured with the appropriate prefixes. You can either save/restore the mapping from a dictionary (which I use), or Xapwrap has support for storing its metadata in document==1 in the xapian database. As to term generation, the procedure you explained in your previous message (very thoroughly, thank you!) to generate terms according to the same capitalization convention is done by Xapwrap already, at least as far as I can tell (there are probably some differences). It handles and prefixes text and keywords and has classes for terms and values as well. In this case, your Index.search() method won't be able By default in Xapwrap it just prints the score and document id. You can access values of the document form the result, so if title was an existing value: print result['values']['title'] would print the title of the matching document. Note that Xapian doesn't currently include term generators for Right. While the term generation explanation makes sense once it's explained, it's a tough concept for a new user to jump over right away. Xapwrap is, as I understand, intended to give a simple interface to Well that would be great, Xapwrap currently indexes terms along the lines of the capitalization scheme you described, so I don't think it would be difficult to move my concepts from one to the other. Again, maybe that's me - I like to know what's going into my database :-) Xapwrap does not occlude the database from you, the index classes they provide are just wrappers, and the database objects themselves are easily accessed via the 'db' attribute. it gives you quite a bit of flexibility on what terms get generated, and when that fails, hey, it's Python. ;) But I know how you feel, libraries of a very raw nature have their own set of risks that are unacceptable for many applications.
every day! -Michel
|
|
| <Prev in Thread] | Current Thread | [Next in Thread> |
|---|---|---|
| Previous by Date: | Re: (no subject), James Aylett |
|---|---|
| Next by Date: | php bindings error, Ryan Mahoney |
| Previous by Thread: | Re: Re: searching and sorting by date, James Aylett |
| Next by Thread: | Re: Re: searching and sorting by date, James Aylett |
| Indexes: | [Date] [Thread] [Top] [All Lists] |
Free MagazinesCisco NewsReceive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business. subscribe Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field. subscribe The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business. subscribe Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company. subscribe Total Telecom Total Telecom is "The Economist of the communications industry". subscribe |