09 March 2008

Comparision of desktop search software

Comparision of desktop search software

From Wikipedia, the free encyclopedia
This article is being considered for deletion in accordance with Wikipedia's deletion policy.
Please share your thoughts on the matter at this article's entry on the Articles for deletion page.
Feel free to edit the article, but the article must not be blanked, and this notice must not be removed, until the discussion is closed. For more information, particularly on merging or moving the article during the discussion, read the guide to deletion.

Steps to list an article for deletion: 1. {{subst:afd}} 2. {{subst:afd2|pg=Comparision of desktop search software|cat=|text=}} ~~~~ (categories) 3. {{subst:afd3|pg=Comparision of desktop search software}} (add to top of list) 4. Please consider notifying the author(s) by placing {{subst:adw|Comparision of desktop search software}} ~~~~ on their talk page(s).
This article does not cite any references or sources.
Please help improve this article by adding citations to reliable sources. Unverifiable material may be challenged and removed.
Contents
[hide]

* 1 Feature comparison
* 2 Operating systems supported
* 3 Archive file types supported
* 4 Databases supported for storing indexed data
* 5 See also
* 6 External links
* 7 Notes and references

[edit] 1 Feature comparison
Beagle[1] Tracker[2][3] Recoll[4] Strigi Jindex
Regular expressions Partial[5] Yes[6] ? ? ?
Boolean operators:
AND OR NOT Yes Yes[7] Yes ? ?
Multiple character encoding and languages Partial[8] Yes[9] Yes[10] ? ?
Keyword search Yes Yes ? ? ?
Full text search Yes No[11] Yes ? ?
Searching exact sentences supports:
line breaks Yes ? ? ? ?
Searching exact sentences supports:
de-hyphenation on line breaks[12] No ? ? ? ?
Searching exact sentences supports:
text in columns[13] ? ? ? ? ?
Searching exact sentences supports:
non-alphanumeric characters Partial[14] ? ? ? ?
Stemming Yes[15] Yes Yes ? ?
Allow user tagging No Yes ? ? ?
Restrict search to tags N/A Yes ? ? ?
Restrict search to directories Partial[16] Yes Yes[17] ? ?
Metadata-based image retrieval Yes ? ? ? ?
Content-based image retrieval No ? ? ? ?
Thumbnails for indexed images and videos Yes[18] Yes ? ? ?
Index archive files recursively Yes No[19] ? ? ?
Index removable media No[20] No[20] ? ? ?
Different database catalogs for indexing data Yes Yes ? ? ?
File checksum (allows finding duplicate files) No No[21] ? Yes ?
Back end used Lucene.Net ? Xapian ? ?

[edit] 2 Operating systems supported
Beagle Tracker Recoll Strigi Jindex
Linux Yes Yes Yes Yes Yes
Mac OS X Work In Progress Yes Yes Yes ?
Windows Work In Progress No No Yes ?

[edit] 3 Archive file types supported
Beagle Tracker Recoll Strigi Jindex
zip Yes No ? ? ?
rar No No ? ? ?
7-zip No No ? ? ?
tar Yes No ? ? ?
gzip Yes No ? ? ?
bzip2 Yes No ? ? ?


[edit] 4 Databases supported for storing indexed data
Beagle Tracker Recoll Strigi Jindex
SQLite ? Yes ? ? ?


[edit] 5 See also

* Desktop search
* List of desktop search engines

[edit] 6 External links

* Desktop search tools for GNU/Linux: the competition hots up - Tracker, Recoll Strigi and Deskbar
* Comparison of indexers: Beagle, JIndex, Tracker, Strigi (December 2006)

[edit] 7 Notes and references

1. ^ http://mail.gnome.org/archives/dashboard-hackers/2008-March/msg00012.html
2. ^ http://www.gnome.org/projects/tracker/features.html
3. ^ http://mail.gnome.org/archives/tracker-list/2008-March/msg00031.html
4. ^ http://www.lesbonscomptes.com/recoll/features.html
5. ^ Only wildcard query terms supported for full text searches.
6. ^ Through RDF query. Additionally, future Xesam implementation will do this too.
7. ^ An expression tree is planned in the near future to do other booleans.
8. ^ Planned. Currently, its utf8 by default if the encoding is not specified for the file (some files e.g. html files can specify the encoding in their metadata).
9. ^ Everything is converted to UTF-8. Non-UTF8 needs the user's locales set up appropriately so that data can be successfully converted to UTF-8.
10. ^ Support for multiple charsets. Internal processing and storage uses Unicode UTF-8.
11. ^ Exact and precise phrases is planned to be supported shortly. It will be case-insensitive but otherwise precise including non-alphanumerics.
12. ^ That would mean the text "wa-(line break here)ter" would be indexed as "water".
13. ^ It is common for scientific articles to be available in PDF format where each page has two columns. This feature means that lines are index as per-column (correct mode) instead of per-page.
14. ^ String "a+b" can be searched and will return matching files with "a+b" in them, but will also return files with "a-b", i.e. the non-alphanumeric character is not matched.
15. ^ Full text searches are always stemmed. Keyword seaches are never stemmed.
16. ^ Searching by specifying a directory will only search in that directory (or directories if the name matches multiple actual locations) but no recursively in its subdirectories.
17. ^ Also allows specific file name searches with wildcards.
18. ^ The search service does not generate thumbnails itself. The search GUIs use the thumbnailers of the respective desktop environments (e.g. beagle-search uses the GNOME thumbnailer, kerry uses KDE thumbnail API).
19. ^ Will probably do so soon.
20. ^ a b Planned feature.
21. ^ Hasn't been fully implemented since to the author's this has not been necessary.

Retrieved from "http://en.wikipedia.org/wiki/Comparision_of_desktop_search_software"

This page was last modified on 2008-03-08, at 19:57:23. All text is available under the terms of the GNU Free Documentation License. (See Copyrights for details.)
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a U.S. registered 501(c)(3) tax-deductible nonprofit charity.

Edit this page | Watch this page | Discuss this page | Page history | What links here | Related changes
| Move this page

Main Page | About Wikipedia |
Find:

This page was last modified on 2008-03-08, at 19:57:23. All text is available under the terms of the GNU Free Documentation License. (See Copyrights for details.)
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a U.S. registered 501(c)(3) tax-deductible nonprofit charity.

No comments: