noise-words

sys.dm_fts_parser sql full text

我怕爱的太早我们不能终老 提交于 2020-01-02 10:02:22
问题 We having a really hard time to figure out two similar strings given to sys.dm_fts_parser gives different results select * from sys.dm_fts_parser('"0 CAD"', 0, null, 0) seems to think that "0 CAD" is one token (returns 2 token) select * from sys.dm_fts_parser('"0 cad"', 0, null, 0) returns 3 tokens - correctly more importantly and even more confusing is why select * from Table where contains(*,"point 5 CAD") works and select * from Table where contains(*,"point 5 cad") fails where the column

sys.dm_fts_parser sql full text

戏子无情 提交于 2020-01-02 10:00:11
问题 We having a really hard time to figure out two similar strings given to sys.dm_fts_parser gives different results select * from sys.dm_fts_parser('"0 CAD"', 0, null, 0) seems to think that "0 CAD" is one token (returns 2 token) select * from sys.dm_fts_parser('"0 cad"', 0, null, 0) returns 3 tokens - correctly more importantly and even more confusing is why select * from Table where contains(*,"point 5 CAD") works and select * from Table where contains(*,"point 5 cad") fails where the column

ANTLR on a noisy data stream

对着背影说爱祢 提交于 2019-12-20 03:35:13
问题 I'm very new in the ANTLR world and I'm trying to figure out how can I use this parsing tool to interpret a set of "noisy" string. What I would like to achieve is the following. let's take for example this phrase : It's 10PM and the Lazy CAT is currently SLEEPING heavily on the SOFA in front of the TV What I would like to extract is CAT , SLEEPING and SOFA and have a grammar that match easily the following pattern : SUBJECT - VERB - INDIRECT OBJECT... where I could define VERB : 'SLEEPING' |

ANTLR on a noisy data stream Part 2

别说谁变了你拦得住时间么 提交于 2019-12-11 10:18:10
问题 Following a very interesing discussion with Bart Kiers on parsing a noisy datastream with ANTLR, I'm ending up with another problem... The aim is still the same : only extracting useful information with the following grammar, VERB : 'SLEEPING' | 'WALKING'; SUBJECT : 'CAT'|'DOG'|'BIRD'; INDIRECT_OBJECT : 'CAR'| 'SOFA'; ANY : . {skip();}; parse : sentenceParts+ EOF ; sentenceParts : SUBJECT VERB INDIRECT_OBJECT ; a sentence like it's 10PM and the Lazy CAT is currently SLEEPING heavily on the

sys.dm_fts_parser sql full text

时光怂恿深爱的人放手 提交于 2019-12-06 05:11:43
We having a really hard time to figure out two similar strings given to sys.dm_fts_parser gives different results select * from sys.dm_fts_parser('"0 CAD"', 0, null, 0) seems to think that "0 CAD" is one token (returns 2 token) select * from sys.dm_fts_parser('"0 cad"', 0, null, 0) returns 3 tokens - correctly more importantly and even more confusing is why select * from Table where contains(*,"point 5 CAD") works and select * from Table where contains(*,"point 5 cad") fails where the column searched contains "point 5 CAD" - Shouldn't the full text index builder either ignore noise words (e.g.