I wrote a StandardAnalyzer for PHP Lucene to be used by those who need a google-like search which performs:
- Word Stemming
- Stop word filtering
- Lowercase filtering
A brief background of the StandardAnalyzer is available in my post which discusses it. There is also a Readme.txt in the project which provides some more information.
Provided below is the sample project mentioned in that post, which includes the StandardAnalyzer code. This is the first release of this code, hence the beta designation. If you happen to find an issue, let me know: katzgrau@gmail.com.
Download
Zip file:
standardanalyzer-1.0.0b.zip
Tarball:
standardanalyzer-1.0.0b.tar.gz

One Comment
Hey mannn nice work !
I looking for the same funcionality to portuguese language.. can u give me some ideas ??
How can i implement a stemming for portuguese? wath is the process ?
thank u !
2 Trackbacks
[...] wpSearch uses the PHP port of the library by Zend. It also spawned a sub-project, the PHP StandardAnalyzer. You can read more about that here. [...]
[...] implementation of a PHP standard analyzer can be downloaded from the StandardAnalyzer project page. This analyzer is for the English language and it performs the following [...]