links for 2009-11-24
-
It's not that often I need to mess around with CData using PHP SimpleXML but whenever I do I always seem to forget a couple of basics and spend 20 min looking everything up again. So here's that stuff on one page…
-
How to setup the secure shell daemon on a Windows 2003 server
links for 2009-11-23
-
Ajaxify is a jQuery plugin. it can convert all links in a web page into an ajax load and submit requests. with this ultra weight plugin, you can build a complex ajax website with one single line. welcome to web 2.0 world!!.
links for 2009-11-17
-
“Search engines, big and small, have certain standard elements and processes. The more you understand them, the easier to tune them to solve your real information needs. This practical overview provides a big picture view of how search fits within enterprise and websites, and a focused introduction to search technology and user experience. Elements of search covered include robot spiders, database connectors and other tools for locating content, indexing issues, query parsing, retrieval, relevance ranking, and designing usable search interfaces. The workshop addresses common search problems and solutions, security issues, languages, new interface elements, important (and unimportant) features as well as providing tools for choosing a search engine or evaluating an existing one.”
-
I needed to implement an autosuggest/autocomplete search box for use with Solr. After a little research, I found the new TermsComponent feature in Solr 1.4. To use TermsComponent for suggestions, you need to provide set the prefix and lower bound to the input term and make the lower bound exclusive. Use the terms.fl parameter to set the source field. This means:
links for 2009-11-09
-
This document describes the procedure for installing Perl modules, on Unix and NT. It covers CPAN.pm for Unix, the PPM for NT, and a variety of methods of installing 'manually'. Unless specifically stated, when I refer to NT, I really mean all Win32 platforms, including Win95 and Win98. I don't have Windows 2000 installed yet, but I have no particular reason to think that these procedures won't work there.
-
Named Entity Recognition is another trivial issue in Natural Language Processing. For details about named entities refer the wiki article on Named entity recognition.
Usually a named entity refers name of person, companies etc… It may be a group of word optionally joined with of the and like(In English). Automatic identification of this groups are a problematic one.
links for 2009-11-05
-
Solr 1.4 contains a little advertised feature that I think folks ought to know about, Solritas. What a silly name, you say! Yeah, ok, call it the boring ol’ VelocityResponseWriter if you like. Pronunciation accessibility aside, Solritas makes Solr responses look like a real user interface. Close enough for government work, as they say. We’ve effectively used Solritas on a number of our recent engagements. After bringing clients’ data into Solr for the first time, wow, they have a search box, facets, and even toggle-able inline hit score explanations and request/response dumps (when using &debugQuery=true). And we like to toss the clients’ logo in the upper left for good measure, because we can and it’s easy.
links for 2009-11-02
-
That's where the very useful Perl module called XML::Simple comes in. It takes away the drudgery of parsing XML data, making the process easier than you ever thought possible. When you're done with this article, you'll know everything from how to convert the XML data into a Perl variable to going in the other direction and creating an XML file from a Perl hash.
-
Songr does not index nor host any mp3 files, it's just a desktop interface to third party mp3 search engines. Songr has no control over any web sites or resources which are provided by companies or persons other than Songr Do not download copyrighted music.
-
Image search engines sift through the web searching by type of image, by license or by tags and keywords. There are of course, standalone photo reservoirs like Flickr, Stock.xchng, Photobucket, Morguefile et.al; but your kind of image may just be tucked away in some other obscure corner of the web. Google Image Search is powerful but it always pays to have a few more options up your browser.
-
An enterprise search engine has two components: a front end and a back end. Both work together with the search index. The index is built statically for search speed, and is updated periodically. This is unlike a database where the indexes are updated in real time when data is changed or added.
links for 2009-11-01
-
TNR Global's Enterprise Search Blog http://www.tnrglobal.com/enterprise-search-blog focuses on applications and implementation of enterprise search technologies, both commercial and open source. TNR Global specializes in implementing FAST ESP (Enterprise Search Platform) and the open source search system Lucene/Solr.