StandardAnalyzer

I wrote a StandardAnalyzer for PHP Lucene to be used by those who need a google-like search which performs:

  • Word Stemming
  • Stop word filtering
  • Lowercase filtering

A brief background of the StandardAnalyzer is available in my post which discusses it. There is also a Readme.txt in the project which provides some more information.

Provided below is the sample project mentioned in that post, which includes the StandardAnalyzer code. This is the first release of this code, hence the beta designation. If you happen to find an issue, let me know: katzgrau@gmail.com.

Download

Zip file:
standardanalyzer-1.0.0b.zip

Tarball:
standardanalyzer-1.0.0b.tar.gz

One Comment

  1. Vitor ALmeida
    Posted February 22, 2011 at 7:24 pm | Permalink

    Hey mannn nice work !
    I looking for the same funcionality to portuguese language.. can u give me some ideas ??

    How can i implement a stemming for portuguese? wath is the process ?

    thank u !

2 Trackbacks

  1. By A Lucene-based Search Plugin For Wordpress on June 30, 2008 at 8:01 pm

    [...] wpSearch uses the PHP port of the library by Zend. It also spawned a sub-project, the PHP StandardAnalyzer. You can read more about that here. [...]

  2. [...] implementation of a PHP standard analyzer can be downloaded from the StandardAnalyzer project page. This analyzer is for the English language and it performs the following [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="">