技术头条 - 一个快速在微博传播文章的方式     搜索本站
您现在的位置首页 --> MySQL --> MySQL全文检索中不进行全文索引默认过滤词表(ft_stopword_file =>ft_precompiled_stopwords)

MySQL全文检索中不进行全文索引默认过滤词表(ft_stopword_file =>ft_precompiled_stopwords)

浏览:2173次  出处信息

    MySQL支持全文索引和全文检索功能。 (详细的MySQL全文检索的介绍请见官方手册

    对于全文索引,MySQL会从 ft_stopword_file 变量指定的文件中读取不进行全文索引的过滤词表, 一行一个。若将该变量设置为空字符串(”)则禁用过滤词表。

    此变量默认为built-in,即MySQL使用自建默认过滤词表,查看MySQL4到MySQL6的源码,不同版本的默认词表是一样的,共545个,详细如下:

以下是代码片段:
*ft_precompiled_stopwords[] = {
    /* This particular stopword list was taken from SMART distribution
     ftp://ftp.cs.cornell.edu/pub/smart/smart.11.0.tar.Z
     it was slightly modified to my taste, though
     */
     “a’s”,
     “able”,
     “about”,
     “above”,
     “according”,
     “accordingly”,
     “across”,
     “actually”,
     “after”,
     “afterwards”,
     “again”,
     “against”,
     “ain’t”,
     “all”,
     “allow”,
     “allows”,
     “almost”,
     “alone”,
     “along”,
     “already”,
     “also”,
     “although”,
     “always”,
     “am”,
     “among”,
     “amongst”,
     “an”,
     “and”,
     “another”,
     “any”,
     “anybody”,
     “anyhow”,
     “anyone”,
     “anything”,
     “anyway”,
     “anyways”,
     “anywhere”,
     “apart”,
     “appear”,
     “appreciate”,
     “appropriate”,
     “are”,
     “aren’t”,
     “around”,
     “as”,
     “aside”,
     “ask”,
     “asking”,
     “associated”,
     “at”,
     “available”,
     “away”,
     “awfully”,
     “be”,
     “became”,
     “because”,
     “become”,
     “becomes”,
     “becoming”,
     “been”,
     “before”,
     “beforehand”,
     “behind”,
     “being”,
     “believe”,
     “below”,
     “beside”,
     “besides”,
     “best”,
     “better”,
     “between”,
     “beyond”,
     “both”,
     “brief”,
     “but”,
     “by”,
     “c’mon”,
     “c’s”,
     “came”,
     “can”,
     “can’t”,
     “cannot”,
     “cant”,
     “cause”,
     “causes”,
     “certain”,
     “certainly”,
     “changes”,
     “clearly”,
     “co”,
     “com”,
     “come”,
     “comes”,
     “concerning”,
     “consequently”,
     “consider”,
     “considering”,
     “contain”,
     “containing”,
     “contains”,
     “corresponding”,
     “could”,
     “couldn’t”,
     “course”,
     “currently”,
     “definitely”,
     “described”,
     “despite”,
     “did”,
     “didn’t”,
     “different”,
     “do”,
     “does”,
     “doesn’t”,
     “doing”,
     “don’t”,
     “done”,
     “down”,
     “downwards”,
     “during”,
     “each”,
     “edu”,
     “eg”,
     “eight”,
     “either”,
     “else”,
     “elsewhere”,
     “enough”,
     “entirely”,
     “especially”,
     “et”,
     “etc”,
     “even”,
     “ever”,
     “every”,
     “everybody”,
     “everyone”,
     “everything”,
     “everywhere”,
     “ex”,
     “exactly”,
     “example”,
     “except”,
     “far”,
     “few”,
     “fifth”,
     “first”,
     “five”,
     “followed”,
     “following”,
     “follows”,
     “for”,
     “former”,
     “formerly”,
     “forth”,
     “four”,
     “from”,
     “further”,
     “furthermore”,
     “get”,
     “gets”,
     “getting”,
     “given”,
     “gives”,
     “go”,
     “goes”,
     “going”,
     “gone”,
     “got”,
     “gotten”,
     “greetings”,
     “had”,
     “hadn’t”,
     “happens”,
     “hardly”,
     “has”,
     “hasn’t”,
     “have”,
     “haven’t”,
     “having”,
     “he”,
     “he’s”,
     “hello”,
     “help”,
     “hence”,
     “her”,
     “here”,
     “here’s”,
     “hereafter”,
     “hereby”,
     “herein”,
     “hereupon”,
     “hers”,
     “herself”,
     “hi”,
     “him”,
     “himself”,
     “his”,
     “hither”,
     “hopefully”,
     “how”,
     “howbeit”,
     “however”,
     “i’d”,
     “i’ll”,
     “i’m”,
     “i’ve”,
     “ie”,
     “if”,
     “ignored”,
     “immediate”,
     “in”,
     “inasmuch”,
     “inc”,
     “indeed”,
     “indicate”,
     “indicated”,
     “indicates”,
     “inner”,
     “insofar”,
     “instead”,
     “into”,
     “inward”,
     “is”,
     “isn’t”,
     “it”,
     “it’d”,
     “it’ll”,
     “it’s”,
     “its”,
     “itself”,
     “just”,
     “keep”,
     “keeps”,
     “kept”,
     “know”,
     “knows”,
     “known”,
     “last”,
     “lately”,
     “later”,
     “latter”,
     “latterly”,
     “least”,
     “less”,
     “lest”,
     “let”,
     “let’s”,
     “like”,
     “liked”,
     “likely”,
     “little”,
     “look”,
     “looking”,
     “looks”,
     “ltd”,
     “mainly”,
     “many”,
     “may”,
     “maybe”,
     “me”,
     “mean”,
     “meanwhile”,
     “merely”,
     “might”,
     “more”,
     “moreover”,
     “most”,
     “mostly”,
     “much”,
     “must”,
     “my”,
     “myself”,
     “name”,
     “namely”,
     “nd”,
     “near”,
     “nearly”,
     “necessary”,
     “need”,
     “needs”,
     “neither”,
     “never”,
     “nevertheless”,
     “new”,
     “next”,
     “nine”,
     “no”,
     “nobody”,
     “non”,
     “none”,
     “noone”,
     “nor”,
     “normally”,
     “not”,
     “nothing”,
     “novel”,
     “now”,
     “nowhere”,
     “obviously”,
     “of”,
     “off”,
     “often”,
     “oh”,
     “ok”,
     “okay”,
     “old”,
     “on”,
     “once”,
     “one”,
     “ones”,
     “only”,
     “onto”,
     “or”,
     “other”,
     “others”,
     “otherwise”,
     “ought”,
     “our”,
     “ours”,
     “ourselves”,
     “out”,
     “outside”,
     “over”,
     “overall”,
     “own”,
     “particular”,
     “particularly”,
     “per”,
     “perhaps”,
     “placed”,
     “please”,
     “plus”,
     “possible”,
     “presumably”,
     “probably”,
     “provides”,
     “que”,
     “quite”,
     “qv”,
     “rather”,
     “rd”,
     “re”,
     “really”,
     “reasonably”,
     “regarding”,
     “regardless”,
     “regards”,
     “relatively”,
     “respectively”,
     “right”,
     “said”,
     “same”,
     “saw”,
     “say”,
     “saying”,
     “says”,
     “second”,
     “secondly”,
     “see”,
     “seeing”,
     “seem”,
     “seemed”,
     “seeming”,
     “seems”,
     “seen”,
     “self”,
     “selves”,
     “sensible”,
     “sent”,
     “serious”,
     “seriously”,
     “seven”,
     “several”,
     “shall”,
     “she”,
     “should”,
     “shouldn’t”,
     “since”,
     “six”,
     “so”,
     “some”,
     “somebody”,
     “somehow”,
     “someone”,
     “something”,
     “sometime”,
     “sometimes”,
     “somewhat”,
     “somewhere”,
     “soon”,
     “sorry”,
     “specified”,
     “specify”,
     “specifying”,
     “still”,
     “sub”,
     “such”,
     “sup”,
     “sure”,
     “t’s”,
     “take”,
     “taken”,
     “tell”,
     “tends”,
     “th”,
     “than”,
     “thank”,
     “thanks”,
     “thanx”,
     “that”,
     “that’s”,
     “thats”,
     “the”,
     “their”,
     “theirs”,
     “them”,
     “themselves”,
     “then”,
     “thence”,
     “there”,
     “there’s”,
     “thereafter”,
     “thereby”,
     “therefore”,
     “therein”,
     “theres”,
     “thereupon”,
     “these”,
     “they”,
     “they’d”,
     “they’ll”,
     “they’re”,
     “they’ve”,
     “think”,
     “third”,
     “this”,
     “thorough”,
     “thoroughly”,
     “those”,
     “though”,
     “three”,
     “through”,
     “throughout”,
     “thru”,
     “thus”,
     “to”,
     “together”,
     “too”,
     “took”,
     “toward”,
     “towards”,
     “tried”,
     “tries”,
     “truly”,
     “try”,
     “trying”,
     “twice”,
     “two”,
     “un”,
     “under”,
     “unfortunately”,
     “unless”,
     “unlikely”,
     “until”,
     “unto”,
     “up”,
     “upon”,
     “us”,
     “use”,
     “used”,
     “useful”,
     “uses”,
     “using”,
     “usually”,
     “value”,
     “various”,
     “very”,
     “via”,
     “viz”,
     “vs”,
     “want”,
     “wants”,
     “was”,
     “wasn’t”,
     “way”,
     “we”,
     “we’d”,
     “we’ll”,
     “we’re”,
     “we’ve”,
     “welcome”,
     “well”,
     “went”,
     “were”,
     “weren’t”,
     “what”,
     “what’s”,
     “whatever”,
     “when”,
     “whence”,
     “whenever”,
     “where”,
     “where’s”,
     “whereafter”,
     “whereas”,
     “whereby”,
     “wherein”,
     “whereupon”,
     “wherever”,
     “whether”,
     “which”,
     “while”,
     “whither”,
     “who”,
     “who’s”,
     “whoever”,
     “whole”,
     “whom”,
     “whose”,
     “why”,
     “will”,
     “willing”,
     “wish”,
     “with”,
     “within”,
     “without”,
     “won’t”,
     “wonder”,
     “would”,
     “would”,
     “wouldn’t”,
     “yes”,
     “yet”,
     “you”,
     “you’d”,
     “you’ll”,
     “you’re”,
     “you’ve”,
     “your”,
     “yours”,
     “yourself”,
     “yourselves”,
     “zero”,
     NULL };

建议继续学习:

  1. MySQL中文全文索引插件推荐:mysqlcft    (阅读:3000)
  2. mysql的全文索引限制    (阅读:2390)
QQ技术交流群:445447336,欢迎加入!
扫一扫订阅我的微信号:IT技术博客大学习
© 2009 - 2024 by blogread.cn 微博:@IT技术博客大学习

京ICP备15002552号-1