2,153
edits
Line 11: | Line 11: | ||
The Internet age is a major part of a an explosion in computer use in our age. One of the most remarkable things about the Web is it is based on HTML, a text format that is highly accessible by people ''and'' computers. Every Web page uses the same syntax to indicate what should be displayed, they all use the same retrieval mechanisms. This was a remarkable and unexpected breakthrough in communications, but the way companies jumped in to make the Web more attractive (Flash-y) and commerce-friendly did little to enable background information exchange. Today's focus on the forthcoming HTML 5, with its built in facilities for multimedia and interaction, helps mitigate these problems. | The Internet age is a major part of a an explosion in computer use in our age. One of the most remarkable things about the Web is it is based on HTML, a text format that is highly accessible by people ''and'' computers. Every Web page uses the same syntax to indicate what should be displayed, they all use the same retrieval mechanisms. This was a remarkable and unexpected breakthrough in communications, but the way companies jumped in to make the Web more attractive (Flash-y) and commerce-friendly did little to enable background information exchange. Today's focus on the forthcoming HTML 5, with its built in facilities for multimedia and interaction, helps mitigate these problems. | ||
There are two main technical requirements to make digital information reusable and available - | There are two main technical requirements to make digital information reusable and available — well-known access mechanisms, and the descriptions of how data is organized and detailed for reliable re-use. Efforts over the years have struggled with complexity and standardization, with major initiatives interfering with each other for technical or market reasons. However, with the value seen through rich information re-use (for example, in banking applications and partnership programs), many practical ad hoc, de facto and standard methods exists. | ||
Another major concern is the incentive to share information. Today it's common for non profit organizations to hoard their information, to create "proprietary databases" they can use to pitch to granting agencies. Another factor is that ignoring standards allows efforts to move ahead on their own terms, without making their systems fit into larger systems which could slow them down. Another factor is insecurity | Another major concern is the incentive to share information. Today it's common for non profit organizations to hoard their information, to create "proprietary databases" they can use to pitch to granting agencies. Another factor is that ignoring standards allows efforts to move ahead on their own terms, without making their systems fit into larger systems which could slow them down. Another factor is insecurity — an organization may have a perfectly useful database, but in implementation it may not compare well to best technical efforts. | ||
Trust is another issue. Many people do not think it's appropriate to share "government data," "hospital data," and so on. Yet within these monolithic descriptions, there are vast swaths of data that do not endanger individuals. | Trust is another issue. Many people do not think it's appropriate to share "government data," "hospital data," and so on. Yet within these monolithic descriptions, there are vast swaths of data that do not endanger individuals. | ||
Another factor holding things back is how we use computers today | Another factor holding things back is how we use computers today — for the most part, like a typewriter. Not many people embed data from spreadsheets into their email, use automatic facilities for events and contacts, share to-do tasks, and so on. Documents and communications are one-offs, out of date the moment they're sent, and nothing is explicit in them. A semantic approach to computer data will help change this. Data will be more consistent, and when it comes to important statements we should be able to expect more. Increasing [http://en.wikipedia.org/wiki/Digital_literacy digital literacy] is an issue here. Services like Facebook and Twitter introduce participation and embedded data that leads the way, along with the popular idea of 'infotainment.' Mapquest pushed ahead with interactive maps — information about the public environment — and today these services are better designed and more available than facilities produced by the government. The forces of automated, worldwide spam and fraud on the Internet are creating defenses, cultural and technical, suitable for mass participation. | ||
The Internet has been mainstream for 15 years, nearly a generation of new and experienced users, programmers, researchers and so on using the most advanced systems available freely around the world. Innovation is amplified by international access and competition. We're starting to see real breakthroughs in Semantic Web type applications. With unlimited room for our improvement by building on rather than hoarding data, and the recognition of the value of a true participatory society, efforts to not share public data will have a stunting effect. | The Internet has been mainstream for 15 years, nearly a generation of new and experienced users, programmers, researchers and so on using the most advanced systems available freely around the world. Innovation is amplified by international access and competition. We're starting to see real breakthroughs in Semantic Web type applications. With unlimited room for our improvement by building on rather than hoarding data, and the recognition of the value of a true participatory society, efforts to not share public data will have a stunting effect. |