-
Website
http://www.scobleizer.com/ -
Original page
http://scobleizer.com/2007/04/05/i-finally-get-semantic-web/ -
Subscribe
All Comments -
Community
-
Top Commenters
-
danja
44 comments · 4 points
-
polizeros
52 comments · 1 points
-
AndyBeard
69 comments · 4 points
-
Zachary Adam Cohen
35 comments · 8 points
-
dbarefoot
40 comments · 3 points
-
-
Popular Threads
-
The best and worst thing Twitter did in 2009: RT
22 hours ago · 20 comments
-
World-brand-building mistakes France’s entrepreneurs make
1 week ago · 181 comments
-
2010: the year SEO isn’t important anymore
6 days ago · 67 comments
-
iPhone developers abandoning app model for HTML5?
6 days ago · 51 comments
-
Google eating Yelp?
5 days ago · 25 comments
-
The best and worst thing Twitter did in 2009: RT
Perhaps, this is a better description. I'm not into comic books..but it sounds like a layout with all the "word bubbles" that display all the different conversations in the order that they took place.
Am I right? Or am I totally off?
Well, people couldn't even get HTML right. They can't get RSS right (Mark Pilgrim had to make his feed parser "ultraliberal" doing things that make baby Jesus cry for a reason). How are they going to get much more complicated data formats right?
And if all this data is part of some kind of propositional logic-based forward-chaining inference framework, isn't the entire chain only as strong as the weakest link, and all it takes to muck everything up is one guy who didn't produce proper data?
It seems doomed to fail.
Correct me if I'm wrong because I'd love for this to work.
While the concept sounds really good, I just think that most (+90%) of people won't be able to grasp it.
If the process is broken up into steps, I think it's more feasible. For example, if everyone started off by adding REL tags (relationship) to their content to identify themselves, a large social grid would be build.
The problem with the current hCard movement is that it does nothing for verification. I can mark something as [choose name] and throw in a few relationship URL's, and that's it; there's no verification. If someone searches [insert name again], they come across my spoofed content Instead, I think we need to begin by associating communication with ID tags (i.e. e-mail to ID).
What really excites me is GRDDL, which basically allows one to provide a bridge between straight HTML and the Semantic Web.
It will become (is) a critical business requirement to make sure one's metadata is up to par, and (more) software will be built to help with the process.
Moreover, I foresee Web applications in the not-too-distant future evolving similarly to 3D modeling applications did in the last few years - from being crude sculpting tools to putting the operator in the director's seat. We won't be schlepping bits of markup text anymore, we will be arranging information as we see fit.
Not that there isn't a lot of promise to "semantic web" technologies. These technologies are likely to join a whole list of technologies that help manage information, but I remain sceptical of magic bullet (killer app) status. I distinctly remember when the same was said of XML.
Much of the magic of semantic web can be achieved by other means. There is certainly something to be said for making it easier to find common information (hell I would love all blogs were marked in a common to make it easy to process them) but there is a lot to be said for statistical analysis as well.
2. This won't be accepted by the general public as a standard.
3. Client side scripting is used sparingly by large sites because it's more or less annoying. The standard browser isn't built to navigate AJAX, and if a new browser comes out that will let you navigate DIVs and ajax stuff it will be a total pain in the butt to use.
4. This is grasping at straws.
5. This is more hype du jour from Silicon valley.
Why more interfaces to the same old data?
It's not easier, it's just annoying.
They should find some better way to get venture cap.
Like say, oh, working on something meaningful.
I'm so disappointed that TBL supports this nonsense, back in the day his markup was simple, and even non-techies could write it. He now seems determined to take markup out of the reach of the common man.
Even Google is in on the act. I would have thought Google were all about smart software, but the losing rel="nofollow" stuff showed even they aren't immune to stupid hacks.
Thanks for the notice. I am glad you got a chance to see what we've been doing. The "semantic web will fail" meme has been around a long time. As you saw, Radar is not trying the semanticize the entire web. It's a semantic app. It just so happens that like any good app, it tends to encourage 'good behavior' from the rest of the world that wants to interact with it, but the app stands on its own. That's the main point the Semweb naysayers don't get. Semantic web technologies work and are great for building useful apps - the world doesn't need to speak the same language for us to be able to work together. But speaking the same language definitely reduces translation friction.
I look forward to your being able to be a bit more revelatory in the next few months.
With the current web your individual blocks are things like posts, comments, bookmarks, etc. You have tools like RSS, XML and JSON to put the blocks together (ie: Yahoo Pipes)
These blocks are big and they only come in a few basic colours. They are the Legos of the 1970s.
With the semantic web it is like modern Legos. The blocks are smaller, and come in many more different sizes and shapes. It makes it easier to build more detailed and complex structures.
And, believe me, there are lots of fun engineering challenges one can have building the SemWeb - making massively scalable triple stores (something that Nova's company is apparently working on), efficient scutters, finding the optimum distribution of granularity vs. centrality and much more.
If Google can achieve this much with a set of source documents that are not particularly well marked up semantically, then imagine how cool the next evolution of Google could be...
A lot of information is already broken up into little pieces. The goal would be to easily get data in/out from services like eBay, Google, or even MySpace. Once relationships are defined (either algorithmically or with a little user input), it should be possible to conduct full social searches to return stuff like Mom's schedule, Dad's doctors appointment, John Doe's resume, for-sale listings from my 6th degree, or what my family thought of a certain restaurant.
The best part? All of that information resides on various services; the key is putting it all together. Schedules might reside on Google Calendar, Zoho, or some PDA application. Listings might reside on eBay or Craigslist. Resumes might reside on MySpace, Facebook, or Monster.
Most services hate aggregation of data since it takes eyeballs away from their service... On the other hand, those who are generating the content would NEED those services in order to continue to generate content.
It sounds great, but who knows how it will end up.
This is one of the reasons that e.g. Google doesn't put much stock in the semantic web.
Well, whatever your answer is - it's basically the same with the SemWeb. There's a whole group of cool and interesting technology - SSL, GPG, OpenID, FOAF - that can be used in combination with good old-fashioned human judgment and human rule-making. If someone advertises something on their website and it doesn't work, they may be breaking - say - a law or contract. If they advertise something in a Semantic document (say, a series of RDF triples) and it isn't true, then you can use the same sort of legal instruments.
The W3C/TimBL vision of the Semantic Web is not holy. Trust is a question higher up on the layer cake. Adding small amounts of semantics at this stage does a lot of good - which is why things like microformats, eRDF and GRDDL are important.
For folks without the opportunity to be shown a good demo, I'd recommend the recent short video from Tim Berners-Lee, The Semantic Web of Data.
Nope
"the industry has gotten boring"?
- hehe, that kinda proves my point.
I second the comments from Danny and would like to add the following:
1. Data is Data, and it the foundation of everyting on the Web (as it is everything in our daily existence)
2. Data has always wanted to be free
3. The Semantic Data Web is about Open Access to Data on the Web
4. There are no secrets in the Web Realm (any variant from 1.0 - X.X.X)
5. Decomposing Web Pages into granular items of referencable data is what the Semantic Web has always been about
6. Hyperlinks no longer point solely to (X)HTML documents, they can also point to granular bits of Data exposed by Data Sources (that include traditional Web Pages)
7. When a hyperlink points to a Document as blurb it is a URL. When it points to a Document or any other Data Source with granular Data Access in mind it is a URI
8. Everything you, I, or anyone else has posted on the Web will ultimately be accessible via single URI (because URIs can point to other URIs; meaning Data can point to other pieced of Data)
9. Computers have worked this way since the beginning of time; we are simply extending the concept of the Computer. To quote Sun: The Network is the Computer. The Web is a Network that connnect People to Data and Data to People
10. The Web is a subsystem of the Network Operating system called the Internet :-)
TBL
keynote at the May 2005 Bio-IT World event in Boston.
This turned out to be a serendipitous introduction as I had been
wrestling
with various metadata challenges.
GRDDL is useful because it allows relationships to be extracted from XML-based
representations, and high-quality metadata to be developed along the way. These
relationships, cast in RDF, are semantically richer and more expressive than
their XML counterparts.
In a further phase of semantic refinement, the RDF-based representation can be
transformed into one based on the Web Ontology Language (OWL). From this vantage
point of an
informal
ontology "... Web pages will no longer be just pages, or posts. They’ll all
be split up into little objects, stored in a database (a massive, scalable one
at that) and then your words can be displayed in different ways."
Because this remains a highly geeky domain, it's wonderful to see the dialog
that this blog post has generated.
If we think about this with a kind of logic that applies a historical innovation index of some kind, html and other languages might be compared to any innovation or language.
Take linear A compared to linear B. No one has yet gained a full understanding of Linear A, yet it served the most advanced civilization of the bronze age for 2000 or more years.
Linear B was translated when the Rosetta stone was deciphered and it bears some similarity to linear A. For those that don't know Linear B was the early language of the Greeks (those guys our culture and other stuff came from).
In a similar way html, XML and the others have served us well, but is it time to look deeper? Hell, mathematics cannot even give anyone an exact measurement of a circle or any derivative of pi. It does not mean it is useless, but it does suggest that there is something (quark-tile theory of inter-dimensional mathematics etc.) that could reveal greater discoveries.
So, given my introduction to AI, fuzzy logic and semantic search offered By Riza Berkan of hakia search, I am leaning toward semantics. Just another opinion of course, but the law of probability and history assures us of this. We no longer use punch cards to tabulate bank data on computers the size of whole floors (1975), Windows 95 is no longer our preferred OS, we watch LCD high def TV's rather than black and white(1965) and Google has been around (in computer years) as long as the transistor radio. :)
Read the rest at:
http://www.elainevigneault.com/2007/04/08/seman...
Glad you're feeling better. BTW, I don't think that Radar will be alone in opening up the Web 3.0 space. Recently we've seen a MUVE game that is entirely composed with semantic agents in a multi-user 3D immersive environment based on Croquet.
Also, there are a number of plays that are pushing well beyond the threshold of keyword searching. Some now fully combine semantics and linguistics.
fourth quarter 2007 should be interesting
Mills Davis.