In the first part of this article, i explained how you can use Lucene to query a document (Word, PDF etc...), and find matches for specific keywords, which was necessary for us in order to automatically identify the document's category based on its content.
We've chosen a simple approach to demonstrate the automatic classification extension : if a document contains the name of a category, then...
Saturday, December 4, 2010
Tuesday, November 30, 2010
Alfresco automatic document classification : Part 1
Alfresco is capable of handling multiple classifications, or hierarchies of classification, it's a very useful feature, and can make your life a lot easier when looking for documents, especially the ones with no indexed content like pictures, scanned documents etc...
Classifying a document in Alfresco can be as easy as few clicks on the browser, however it can be very time-consuming process if...
Friday, November 26, 2010
Getting started with Alfresco
During my internship at TGR, one of the project's requirements was indexing and managing documents, and that was my first experience with Alfresco, which is an open source Enterprise Content Management (ECM), it combines a collection of content-centric technologies like Document Management (DM), Records Management (RM), and other technologies that should make your life, if your field of work...