links for 2010-03-09
-
We've all seen countless articles, blog and forum posts explaining how to back up a server with rsync and other tools. While I've cringed when people talked about using non-scalable methods, there actually is a place for quick and dirty backup mechanisms. Small companies running just a few virtual machines in the cloud, or even enterprises with test instances, may wish for a quick and effective backup.
links for 2010-03-05
-
Graph visualization is a way of representing structural information as diagrams of abstract graphs and networks. Automatic graph drawing has many important applications in software engineering, database and web design, networking, and in visual interfaces for many other domains.
Graphviz is open source graph visualization software. It has several main graph layout programs. See the gallery for some sample layouts. It also has web and interactive graphical interfaces, and auxiliary tools, libraries, and language bindings.
-
How do you measure a successful Enterprise Search project?
-
In a move that might rewrite the entire search market, Google is rumored to be creating a system that will let allow web publishers to submit content to Google for search indexing in real-time.
-
AUTONOMY INTERWOVEN FIRST TO DELIVER INTEGRATED WEB CONTENT MANAGEMENT, SEARCH, OPTIMIZATION, AND RICH MEDIA ON A SINGLE PLATFORM
links for 2010-03-04
-
The CQL context set defines a set of indexes, relations and relation modifiers. The indexes supplied are 'utility' indexes which are generally useful across all applications of the language. These utility indexes are for instances when CQL is required to express a concept not directly related to the records, or for indexes applicable in practically every context.
links for 2010-03-02
-
Indexing Text and HTML Files with Solr
-
Solr focusses to get the most out of one index type: Lucene. Meresco supports a number of different index types, each specialized for a specific task.
links for 2010-03-01
-
Google has offered a general explanation of how it ranks its search results, one day after the European Commission said it was looking into antitrust complaints against the company.
links for 2010-02-26
-
# quickly try Carrot2 with your own data
# tune Carrot2 clustering settings in real time -
At the heart of any search solution is a good understanding of the business problem to be solved as well as knowledge of the available content and metadata. You have to work within the confines of the content you have (or can add with content enhancement). You have to analyze that content and be able to describe how you can identify some documents as being relevant for solving a business problem and why other documents are not relevant. That is just the beginning.
-
Make your site richer with free, high-quality data and smarter with powerful tools for crosslinking content.
links for 2010-02-25
-
We've all seen promises that semantic search will be the next big thing. However I'd love to know if it actually exists in a workable form for the enterprise, or whether it's still just a marketing department's dream. Comments? Examples? Thoughts?
links for 2010-02-24
-
In this whiteboard video, Attivio Architect Martin Serrano provides an overview of query-side JOIN in AIE.
links for 2010-02-23
-
The text-overflow declaration allows you to deal with clipped text: that is, text that does not fit into its box. The ellipsis value causes three periods to be appended to the text.
-
The World Clock – Time Zone Converter – results
-
The first two pair nicely. Each entrant — working independently — tried to explain IA in terms simple enough for a 5 year old. I know US newspapers are supposedly written at a 4th grade level, but given how chronically underfunded IA is, I think the profession has learned to go even younger as a survival skill.
links for 2010-02-22
-
The Enterprise Search Bus is becoming key in the information and IT infrastructure of organisations. It announces a decline in the strategic importance of the relational database, emphasizing its greatest weakness: its rigidity that seen from an intelligence perspective makes it a rather dumb tool. Here's why and here are some consequences to the software ecosystem.