dtSearch Corp is preparing to announce a new 64-bit version of its developer-focused text retrieval engine to add scalability to search.
A new beta build of dtSearch’s Text Retrieval Engine for Windows and .NET developers is being prepped for release later in July.
The software lets developers add dtSearch’s terabyte indexer search functionality and file format support to Web-based and other applications.
The revamped dtSearch Engine also provides over two dozen new search options, including hit-highlighted file display formats. It also comes with distributed and federated search capabilities across multiple data sources. The latter includes .NET 2.0 API access to dtSearch’s built-in web spidering capabilities.
All of dtSearch Engine features have now been ported to 64-bit in the beta version.
For example, the 64-bit engine can now index over a terabyte of text in a single index and search across multiple indices in parallel. dtSearch claims sub-second search times even across terabyte volumes of data.
The range of supported search file types have also been expanded to display HTML, XML and PDF files with highlighted hits and with embedded images, links and formatting intact. The software also uses a built-in proprietary HTML converter for browser-based displays of non-web like Office documents, OpenOffice, ZIP, CSV and Unicode files.
In addition to searching fields in all supported file types, dtSearch Engine also supports hit-highlighted precision search options for ODBC, XML and SQL-type databases, including BLOB data. On-the-fly classification capabilities for database fields during indexing are also included.
dtSeearch Engine for Windows and .NET is also the first release that will offer integration with Office 2007, Microsoft Corp’s next generation personal productivity tools suite for the desktop that is due early next year. Both the preliminary Word 2007 and Excel 2007, in 32- and 64-bit versions, will be supported.
dtSearch Engine for Win & .NET is part of an expanding line of search and retrieval technologies offered by the firm. These include desktop PC, network and web spidering tools, and searchable content publishing tools. A Linux version of dtSearch Engine, complete with C++ and Java APIs, is also provided.
dtSearch was founded in 1991 but started R&D in text retrieval in 1998; its first commercial product was released in the first quarter of that year. The software offers an array of indexed and unindexed search options including fuzzy, synonym, concept, thesaurus, boolean, natural language relevancy ranking, phonic and special forensics.
The company’s products are used by medical, recruiting and accounting firms for Internet and Intranet searching, access to technical documentation, policy manual publishing and email filtering applications. Its software is also listed on the US government’s GSA schedule.
dtSearch products are sold direct and via distributors to customers spanning across 70 countries (via Unicode support) and include US defense, space and law enforcement agencies.