The journey from now to the Semantic Web is a long one. What we currently have on our hands with the current version of the Web are billions of documents totaling terabytes of data. This data is usually found within HTML pages comprised mainly of non-validating markup and very little, if any, meta data.

While there are billions of documents on the Web that contain no meta data whatsoever there is one shining star of hope: Natural Language Processing. NLP can be used to sift through the "garbage" data to extract coherent statements about the information held within.

Continue reading Natural Language Processing and the Semantic Web

JAN 8th 2007

Let's face it, Web documents in their current state are just about the worst way we exchange data. It usually comes in the form of HTML or XHTML, and rarely validates to any degree. In order to begin our approach towards Tim Berners-Lee's dream of the Semantic Web we must begin writing valid, semantic markup.

Continue reading Semantic markup gives Web documents meaning

Page 11 of 11