171 lines
1.8 KiB
Text
171 lines
1.8 KiB
Text
# Licensed to the Apache Software Foundation (ASF) under one or more
|
|
# contributor license agreements. See the NOTICE file distributed with
|
|
# this work for additional information regarding copyright ownership.
|
|
# The ASF licenses this file to You under the Apache License, Version 2.0
|
|
# (the "License"); you may not use this file except in compliance with
|
|
# the License. You may obtain a copy of the License at
|
|
#
|
|
# http://www.apache.org/licenses/LICENSE-2.0
|
|
#
|
|
# Unless required by applicable law or agreed to in writing, software
|
|
# distributed under the License is distributed on an "AS IS" BASIS,
|
|
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
# See the License for the specific language governing permissions and
|
|
# limitations under the License.
|
|
|
|
#-----------------------------------------------------------------------
|
|
# a couple of test stopwords to test that the words are really being
|
|
# configured from this file:
|
|
stopworda
|
|
stopwordb
|
|
|
|
#Standard english stop words taken from Lucene's StopAnalyzer
|
|
a
|
|
an
|
|
and
|
|
are
|
|
as
|
|
at
|
|
be
|
|
but
|
|
by
|
|
for
|
|
if
|
|
in
|
|
into
|
|
is
|
|
it
|
|
no
|
|
not
|
|
of
|
|
on
|
|
or
|
|
s
|
|
such
|
|
t
|
|
that
|
|
the
|
|
their
|
|
then
|
|
there
|
|
these
|
|
they
|
|
this
|
|
to
|
|
was
|
|
will
|
|
with
|
|
|
|
# these stopwords are taken
|
|
# from http://www.onjava.com/pub/a/onjava/2003/01/15/lucene.html?page=2
|
|
|
|
about
|
|
after
|
|
all
|
|
also
|
|
an
|
|
and
|
|
another
|
|
any
|
|
are
|
|
as
|
|
at
|
|
be
|
|
because
|
|
been
|
|
before
|
|
being
|
|
between
|
|
both
|
|
but
|
|
by
|
|
came
|
|
can
|
|
come
|
|
could
|
|
did
|
|
do
|
|
does
|
|
each
|
|
else
|
|
for
|
|
from
|
|
get
|
|
got
|
|
has
|
|
had
|
|
he
|
|
have
|
|
her
|
|
here
|
|
him
|
|
himself
|
|
his
|
|
how
|
|
if
|
|
in
|
|
into
|
|
is
|
|
it
|
|
its
|
|
just
|
|
like
|
|
make
|
|
many
|
|
me
|
|
might
|
|
more
|
|
most
|
|
much
|
|
must
|
|
my
|
|
never
|
|
now
|
|
of
|
|
on
|
|
only
|
|
or
|
|
other
|
|
our
|
|
out
|
|
over
|
|
re
|
|
should
|
|
since
|
|
so
|
|
some
|
|
still
|
|
such
|
|
take
|
|
than
|
|
that
|
|
the
|
|
their
|
|
them
|
|
then
|
|
there
|
|
these
|
|
they
|
|
this
|
|
those
|
|
through
|
|
to
|
|
too
|
|
under
|
|
up
|
|
very
|
|
want
|
|
was
|
|
we
|
|
were
|
|
what
|
|
when
|
|
where
|
|
which
|
|
while
|
|
who
|
|
will
|
|
with
|
|
would
|
|
you
|
|
your
|