我想根据所显示的文章将一些关键字元标记放入页面中。
假设您加载页面 blabla.com/article.aspx?id=2 id 等于 2 的文章标题为“商业管理中故意错误的智慧”
所以我想包括这样的元标记:
<META name="keywords" content="wisdom, deliberate, mistakes, business, management" />
所以我需要一种方法来排除嘈杂的词(就像 SQL Server FullText 一样)。你会怎么做?
1)在webconfig中保存干扰词列表?2)将噪声词保存在数据库中?3) 将干扰词保存在文本文件中?4)硬编码代码中的噪声词(NOT =P)
那么,您将如何加载这些干扰词以最小化页面负载?最后,您将如何解析去除干扰词的字符串?
谢谢!
编辑:噪音(或停止)词将与 SQL Server 2005 FTS 使用的相同(检查 MSSQL\FTDATA 中的 noiseENU.txt)这是该文件的内容:
about
1
after
2
all
also
3
an
4
and
5
another
6
any
7
are
8
as
9
at
0
be
$
because
been
before
being
between
both
but
by
came
can
come
could
did
do
does
each
else
for
from
get
got
has
had
he
have
her
here
him
himself
his
how
if
in
into
is
it
its
just
like
make
many
me
might
more
most
much
must
my
never
no
now
of
on
only
or
other
our
out
over
re
said
same
see
should
since
so
some
still
such
take
than
that
the
their
them
then
there
these
they
this
those
through
to
too
under
up
use
very
want
was
way
we
well
were
what
when
where
which
while
who
will
with
would
you
your
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z