Omgili, forum search, forums search, search forums, discussion search,discussions search, search discussions, board search, boards search, search boards
  Advanced Search

Join between indexes

On Wed, 1 Feb 2012 06:05:11 -0800 (PST), Arnon Mazza <...@yahoo.com

Assume we have a Lucene index over which several types of analyses are performed.
 
Assume that the conclusions of some analysis require that new tokens be added to existing documents in the index.
For example, a repeating pattern p (sequence of words) that appears in a large part of the documents should be tagged in every document in its exact position.
 
Now it is required to execute proximity queries involving standard terms and also the pattern p (e.g. find all documents in which the word "hello" is adjacent to the pattern p).
 
Is there a way of achieving this without re-indexing all the documents where the pattern p was found ?
In other words, is it possible to maintain a separate index that would keep only patterns- 
If not, is there a plan to support this in the future ?
 
Thanks,
Arnon.



On Wed, 1 Feb 2012 17:09:39 +0100, Simon Willnauer <...@googlemail.com

maybe this link will help: http://bit.ly/AhwIw6

simon

On Wed, Feb 1, 2012 at 3:05 PM, Arnon Mazza <...@yahoo.com
---------------------------------------------------------------------
To unsubscribe, e-mail: java...@lucene.apache.org
For additional commands, e-mail: java...@lucene.apache.org

On Wed, 1 Feb 2012 18:56:18 +0100, "Francisco A. Lozano" <...@gmail.com

Wow, thanks for pointing this out, didn't know such a feature was in progress.

I see a mention that there are some chances this will be released in
3.6... crossing my fingers :)

Francisco A. Lozano

On Wed, Feb 1, 2012 at 17:09, Simon Willnauer
<...@googlemail.com
---------------------------------------------------------------------
To unsubscribe, e-mail: java...@lucene.apache.org
For additional commands, e-mail: java...@lucene.apache.org

On Thu, 2 Feb 2012 12:56:44 -0800 (PST), Arnon Mazza <...@yahoo.com

Thanks, that's a very nice feature.
 
Would it also enable joining on the docId level, meaning that part of a document is kept in some index and another part of the same document is kept in another index ?
 
In the example that was given in the articles & comments link, that could be for instance:
articles index:
- docId=1: "(1) this (2) paper (3) is (4) about (5) lucene". (numbers are positions in the doc).
comments index:
- docId=1: "(3) very (4) recommended".
 
So that one would be able to know that the comment "very recommended" was written next to the word "paper".
(Conceptually the query could be: articles.paper NEAR comments."very recommended").
 
Is this also part of the feature ?
 
Thanks,
Arnon.

From: Francisco A. Lozano <...@gmail.comTo: java...@lucene.apache.org
Sent: Wednesday, February 1, 2012 7:56 PM
Subject: Re: Join between indexes

Wow, thanks for pointing this out, didn't know such a feature was in progress.

I see a mention that there are some chances this will be released in
3.6... crossing my fingers :)

Francisco A. Lozano

On Wed, Feb 1, 2012 at 17:09, Simon Willnauer
<...@googlemail.com
---------------------------------------------------------------------
To unsubscribe, e-mail: java...@lucene.apache.org
For additional commands, e-mail: java...@lucene.apache.org

Discussion Title: Join between indexes
Title Keywords: Join  between  indexes