-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Currently the precedence/near operator has either no named argument (and thus is defined on the token precedence) or has the specific name of the segmentation chain. In cases where you search e.g. for "the" . "house"
and there are segmentations in the corpus also the segmentations will be search for the annotation values "the" and "house". Unfortunately there is no "any segmentation" counter-part for the operator itself. My suggestion is to use an character that is not allowed as ID to mark this. In SQL there would be only a check that both segmentation names are equal.
My suggestions for the character are:
"the" .~ "house"
"the" .? "house"
"the" .+ "house"
"the" .@ "house"
"the" .= "house"
All of them have advantages and disadvantes, like some have semantically similar meaning in regular expressions (like "+"), some are used in AQL already and some would be completely new and therefore possible confusing. My current favourite is ".=" since it would express that both segmentations need to be the same (as a kind of binding).
@amir-zeldes, @CarolinOdebrecht Do you have any ideas what syntax would be the best?