SEP 13th 2007

This entry is a response to I will never support the Semantic Web by Brian of d'bug.

I'm getting tired of reading about how the Semantic Web is some kind of pipe dream that will never be realized. The Semantic Web is completely and entirely within our technological reach. People may have been given the impression that we cannot create the Semantic Web because of its complexity, the number of years it has been in development, or even the unanswered questions that still exist for certain problems we will face. These are valid reasons to doubt our progress, but progress is certainly what we are making.

Split philosophiesWe want everybody to communicate freely by crossing the barriers of language differences and cultural variety. This is the commonly agreed upon ultimate goal of the Semantic Web. How we are to realize the Semantic Web in particular is, however, another story. Typically, there are two thoughts on how to achieve this common goal. One thought is to build a web of data; the other is to build a web of agents. Nevertheless, these two thoughts approach the same goal and represent two different philosophies. This philosophical difference may eventually determine the fate of these two approaches.

The Semantic WebToday we reach an important milestone in this series. We are crossing a great divide between familiar technologies such as XML, Unicode, URI, and RDF to the Web Ontology Language (OWL). This, my friends, is where things really start to get interesting because this is the point where the Semantic Web vision really starts to take form. Today, we present a screencast exhibiting Protégé — a free, open source ontology editor and knowledge-base framework developed by Stanford Center for Biomedical Informatics Research at the Stanford University School of Medicine. In this screencast, we show you how to develop a useful Semantic Web-ready application in just minutes. You will learn how to model a very simple ontology in OWL (the Web Ontology Language).

True KnowledgeTrue Knowledge is a natural language search engine and question answering site, but to leave it at that would not do the site justice. What makes it stand out from similar sounding services like Powerset and Freebase? True Knowledge tackles natural language search and question answering (much like Powerset and Hakia), and it also maintains a knowledge base of facts about the world (similar to DBpedia and Freebase). However, what makes True Knowledge stand out is that they've combined these features and encourage their userbase to contribute facts and add new knowledge.

JAN 18th 2007

Last night I had an interesting conversation with an online acquaintance about the Semantic Web. I was surprised to find that the mere mention of the name "Semantic Web" sent him into a 5 minute rant about how much he disliked everything to do with it. His biggest qualm was with what he considered to be the empty promises made by proliferators and supporters of the Semantic Web. One example promise was that the Web would be transformed into an artificial intelligence that will think and act independently from humans.

JAN 26th 2007

A mashup is a hybrid Web application that combines complementary elements from two or more sources to create one integrated experience. Content used in mashups is generally sourced from a third party via an API or from Web feeds (e.g. RSS or Atom). Basically, the point is to take multiple data sources or Web services and turn them into something useful. The idea of combining Web services is not a new one, but it has gained immense traction in recent times and will likely continue to grow in popularity. In this entry I will be discussing both the promising future mashups offer and also potential pitfalls.

FEB 22nd 2007

The value of a dataset may be determined by any number of factors, however it can generally be agreed upon that the data's accuracy, how difficult it is to re-create, its source, and other important factors can affect the value of the data. However, as technology evolves to allow easier access to the information we require, the value of dataset may eventually decrease over time.

MAR 11th 2007

For just about every area of research there exists documents online describing background information or techniques to accomplish a task in that domain of research. These documents are often referred to as white papers, provided their content is of technical or research orientation. The information held within white papers is essentially accessible by humans only because machines are not able to read and comprehend text in the same way humans can. If machines were able to read white papers and extract information in the same way humans can we would be able to store each fact and piece of knowledge from the documents. This method of indexing would facilitate much more detailed searches, allowing users to search by topic, theory, conclusion, methods, citations, references, etc.

MAR 19th 2007

I have 3 interesting links that you need to check out. The first two are products for discovering and storing metadata, natural language processing, and many more things. The third link goes to a post on Geospatial Semantic Web Blog which gives us an update on Metalink's ability to map its descriptions into RDF.

APR 13th 2007

It seems as though nothing short of a new buzzword can stop the burst of activity in the vertical search market, and who are we to complain? Vertical search engines differ from their horizontal brethren (who attempt to index the Web as a whole) by focusing on a single topic or niche about which to index information from the Web. Often, a VSE can deliver results with much greater relevancy and accuracy than major horizontal players like Google, Yahoo, and Microsoft.

