Venetica, IBM bridge unstructured and structured content
Grok Headline matches for Venetica, IBM bridge unstructured and structured content
The real difference between structured
and unstructured data (XML.org)
The real difference between structured
and unstructured data (XML.org)
06/11/2002 07:06 PMIBM to Buy Content Management Developer
Venetica
IBM to Buy Content Management Developer
Venetica
08/27/2004 01:39 PMIBM plans to integrate the content integration software maker's
technology into the DB2 Information Integrator family of products.
Transforming Unstructured Content into
Meaningful XML
Transforming Unstructured Content into
Meaningful XML
10/23/2002 03:00 AMSyberWorks Selected by the American
Contract Bridge League to Develop Online
Bridge Education Content through the
SyberWorks Hosted e-Learning Solution
SyberWorks Selected by the American
Contract Bridge League to Develop Online
Bridge Education Content through the
SyberWorks Hosted e-Learning Solution
06/05/2005 10:52 PMSyberWorks, a leader in custom e-Learning Solutions and the Learning
Management System industry, announced today it has been selected by
the American Contract Bridge League to develop online bridge education
content and host its online bridge training programs using the
SyberWorks Hosted e-Learning Solution. [PRWEB May 23, 2005]
EMC Archives Unstructured Data
EMC Archives Unstructured Data
07/26/2004 02:06 PMEMC Corp. is uniting configurable policy-based content management with
storage resource allocation tools to help customers cut costs by
automating archiving capabilities.
Unstructured Information Management
Unstructured Information Management
12/20/2003 07:31 AMUnstructured Information Management by Robert D. Kugel of
IntelligentBPM http://www.intelligentbpm.com/feature/2003/12/0312feat2_1.shtml This white paper, from
Ventana Research, offers a
lucid explanation of what "unstructured information" actually means,
and why it will consume a significant amount of IT resources in the
coming years. Structured data is the easily classified stuff --
names, addresses, zip codes, SKU numbers, etc. Unstructured data "does
not readily fit into structured databases except as binary large
objects (BLOBs)." Examples given include e-mails, multimedia files,
document files.... Although these objects may have some structure --
e.g., an e-mail address -- they are not easily classified for storage
in a structured format that makes a typical database happy. As the
amount of this unstructured data increases exponentially, solutions
are being sought; XMLis a big help because of its flexible tagging
system. If this data cannot be efficiently stored and retrieved, it
has little or no utility. The white paper identifies six potential
components of a viable storage system: document management, Web
content management, records management, digital rights management,
collaboration, and image capture. All of these elements are emerging
as critical, especially in light of today's more stringent regulatory
environment (i.e.,
Sarban
es-Oxley) which dictates compliance standards for information
retention. (
C
urrent Cities December 2003)
FAST Lassos Unstructured Data
FAST Lassos Unstructured Data
01/27/2004 05:19 PMInternet News Jan 27 2004 10:00PM GMT
Business Intelligence from Unstructured
Data
Business Intelligence from Unstructured
Data
12/20/2003 07:31 AMBusiness Intelligence from Unstructured Data by Sundar
Kadayamhttp://intelliseek.com/w
hitepapers.aspAn excellent white paper on Business
Intelligence from Unstructured Data by my good friend Sundar Kadayam.
Text Mining Begins Digging Through
Unstructured Data
Text Mining Begins Digging Through
Unstructured Data
07/03/2004 05:34 AMText Mining Begins Digging Through Unstructured
Datahttp://snipurl.com/7gppA good
85% of an organization's knowledge is in the form of unstructured
data. Easy to quantify, hard to find. "We are drowning in information
but are starving for knowledge," says an R&D technical leader at Dow
Chemical. But a new generation of text mining tools is allowing
companies to extract key elements from large unstructured data sets,
discover relationships and summarize the information. Dow, for
example, uses ClearResearch to extract data from chemical patent
abstracts, published research papers and the company's own files. And
the University of Louisville uses SAS's Text Miner on text files, such
as patient charts, and analyzes flat-file snapshots of billing and
pharmaceutical databases as text, rather than as database entries.
Researchers there have pinpointed certain medications that can prolong
hospital stays for patients. Because of some limitations in the new
software (such as understanding linguistics), text miners are still
niche products, generally restricted to specific parts of an
organization and requiring specialized analytical skills to implement
and deliver truly useful information. It'll be awhile before they're
commonly available. But some vendors are already incorporating text
mining tools as a background function to improve the effectiveness of
more familiar search or document management applications.
Kazeon's storage appliance handles
unstructured data
Kazeon's storage appliance handles
unstructured data
03/22/2005 03:35 PMKazeon Systems says it is set to introduce a storage appliance that
not only analyzes the unstructured data on file servers but also
decides where to store that data, and determines who can retrieve it
and when, based on rules that IT creates.
Information Bridge Framework Document:
Introducing the Microsoft Office
Information Bridge Framework
Information Bridge Framework Document:
Introducing the Microsoft Office
Information Bridge Framework
05/19/2004 11:41 PMLearn about the Microsoft Office Information Bridge Framework from an
organizational perspective, including the rationale for exposing
line-of-business data to desktop systems and the benefits to the
information workers who rely on this data to perform their daily work.
In addition, review the Information Bridge architecture and the
advantages that this architecture offers for both deployment and
maintenance of solutions.
Structured Blogging
Structured Blogging
04/04/2005 06:05 AMI haven't written yet about Bob Wyman's 'Structured
Blogging" efforts - partially cause I'm so blown away with it -
and how fucking right on fucking PERFECT target it
is.
Dude - you rock.
So it also didn't surprise me when Peter Caputa connected Structured Blogging with Events.
So now how far will be before Seb Paquet writes anotehr phd. on it, or Alf
Eaton connects
it to Flitter and Microcontent Musings'
guy connects it to user interfaces in general and Phil Pearson
hacks up the whole thing - in a weekend - with Leonard Lin (while
flying back from Taiwan.)
Just use the blogosphere to track the hackers and watch the memes
spread. Then watch the Web 2.0 entreprenuers weave the mesh into a
distributed archipelago - connected by open standards, causeways and
bridges.
It's like watching a strethed out, slowed down, choreographed
dance.
Towards structured bl0gging
Towards structured bl0gging
03/15/2003 12:18 PM
Sebastien Paquet:
I believe a critical element to get a
sustainable system is for people to get reasonably quick feedback
in return for the extra effort expended in creating metadata.
"Towards structured bl0gging"
"Towards structured bl0gging"
03/14/2003 03:47 PM"Structured Blogging"
"Structured Blogging"
03/26/2005 01:35 PMStructured Storage: Quo Vadis ?
Structured Storage: Quo Vadis ?
03/06/2004 01:59 AMThe design of
Microsoft structured storage,
which are used among others in Microsoft Office
document format (OLE Compound Document), shows its age. It might not
be suitable anymore to store documents.
Structured change detection
Structured change detection
03/06/2004 01:57 AM
Consider two versions of a Word document saved as XML. There are
"structured diff tools that can map the changes at an intermediate
level, in terms of XML elements. For example, IBM's AlphaWorks site
offers he XML Diff and
Merge Tool for Java, while Microsoft's GotDotNet site offers XML Diff and Patch
for .Net. Both of these free tools can track element-level change.
To get a sense of what's possible, check out Monsell EDM's online demo of its
Delta XML technology. The demo compares two subtly different
versions of a complex graphic -- the standard SVG (Scalable Vector
Graphics) "tiger" benchmark -- and animates the differences between
the two. It's stunningly cool.
As XML becomes the standard way to represent prose, graphics, and
other content, we should expect such change visualization to become
routine. What about code? It has sections, subsections, and
paragraphs, too. XML isn't -- and probably shouldn't be -- the primary
way we read and write code. But the underlying abstract syntax tree
has structure that can -- and arguably should -- help us see and
comprehend the code's evolution. [Full story at
InfoWorld.com]
Ordinarily readers call me on stuff like this, but for once I get a
chance to beat them to the punch. This column certainly should have
mentioned that
Subversion,
the open source project that aims to replace CVS, reached its 1.0
release last week. It looks really good, and I'm investing some time
in learning how to deploy and use it.
...MKDoc-Text-Structured-0.6
MKDoc-Text-Structured-0.6
07/12/2004 05:30 PMMKDoc-Text-Structured-0.81
MKDoc-Text-Structured-0.81
03/30/2005 11:25 AMStructured Document Validator 0.7.2
Structured Document Validator 0.7.2
01/28/2004 10:16 AMA tool for validating the structure and content of documents.
MKDoc-Text-Structured-0.7
MKDoc-Text-Structured-0.7
07/18/2004 05:18 PMMKDoc-Text-Structured-0.3
MKDoc-Text-Structured-0.3
02/17/2004 10:32 AMMKDoc-Text-Structured-0.8
MKDoc-Text-Structured-0.8
07/23/2004 09:37 AMMKDoc-Text-Structured-0.82
MKDoc-Text-Structured-0.82
03/31/2005 08:57 AMStructured Document Validator
Structured Document Validator
01/28/2004 10:15 AMSDValidator version 0.7.2 released
Structured Document Validator 0.7.6
Structured Document Validator 0.7.6
09/17/2004 10:09 AMA tool for validating the structure and content of documents.
Structured Document Validator 0.7.1
Structured Document Validator 0.7.1
11/12/2003 02:54 PMA tool for validating the structure and content of documents.
GNOME Structured File Library 1.9.0
GNOME Structured File Library 1.9.0
05/05/2004 05:41 AMRead and write OLE and Zip files.
DUNS 2d/3d structured multi-block CFD
DUNS 2d/3d structured multi-block CFD
03/15/2003 11:02 AMNew Pre/Post Processor
PubSub's structured bl0gging initiative
PubSub's structured bl0gging initiative
03/24/2005 02:27 PM
I had a fascinating conversation the other day with Bob Wyman, the CTO
and co-founder of
PubSub.com.
PubSub's game is "prospective search" -- watchlists, in other words.
You plug in search terms; PubSub compares them to the new content it
sees; it alerts you when things match, via RSS or XMPP (Jabber). The
obvious application is ego-surfing: watching reactions to your own
name, your company's name, your product's name. But PubSub differs
from, say, a Technorati watchlist, in its commitment to speedy
notification of a match. Time-sensitive matches can be reported in
just a few seconds, Bob says. In this case the delivery mechanism of
choice is XMPP, and PubSub offers sidebars for Firefox and IE to
receive these realtime alerts. I'm not usually in that much of a
hurry, but plenty of folks are: traders, for example, whose antennae
quiveringly anticipate the next scrap of market-moving news.
...Wez Furlong's Blog: Structured Errors in
PHP
Wez Furlong's Blog: Structured Errors in
PHP
05/25/2004 08:15 AMOn
PHP Magazine's site today,
there's a repost from Wez Furlong's weblog discussing
Structured Errors in PHP.
A short history of structured flowcharts
A short history of structured flowcharts
11/12/2003 01:13 PMGNOME Structured File Library 1.10.0
GNOME Structured File Library 1.10.0
07/05/2004 03:59 PMRead and write OLE and Zip files.
Structured Product Solutions Launched
Structured Product Solutions Launched
05/23/2004 01:25 PMTechnology For Finance May 23 2004 4:28PM GMT
Hot Banana Wins 2005 e-Content Award -
Best Content Management System - CMS
Hot Banana Wins 2005 e-Content Award -
Best Content Management System - CMS
04/08/2005 04:55 AMHot Banana Software Inc., a leading North American Web Content
Management Suite (CMS) company, announced today that it has won the
2005 e-Content award for the best Content Management System. The
Canadian e-Content Awards are sponsored by the e-Content Institute and
were created to recognize and honor e-content products and services
used by Canadian organizations and individuals. [PRWEB Apr 8, 2005]
Feedburner Media RSS Support - and soon
Structured Blogging
Feedburner Media RSS Support - and soon
Structured Blogging
06/06/2005 12:13 AMRichard
MacManus has a great write-up on Feedburner supporting Media RSS and
soon - Structured blogging.
What a coincidence - two projects we're getting involved in!
XML & DocBook: Structured Technical
Documentation Authoring
XML & DocBook: Structured Technical
Documentation Authoring
09/01/2004 05:40 AMAn introduction to XML and DocBook: what is it and why should I learn
yet another data format?
Structured Document Validator 0.7.8
(Default branch)
Structured Document Validator 0.7.8
(Default branch)
06/05/2005 11:22 PM
Structured Document Validator implements a generalized method for
validating both the structure and content of structured documents. Any
data format that can be deterministically divided into tags and data
is classed as a structured document. This definition applies to a wide
array of data formats, including XML, Java properties files, and
delimited value files. The application performs validations based on
user-defined Structured Document Definitions (SDDs). It provides an
environment for validation, SDD development, and document editing.
License: GNU Lesser General Public License (LGPL)
Changes:
This release adds new functionality to the Alias mechanism and
improved messages for SDD validation errors and warnings. The parsing
table generation technique has been upgraded from SLR to LR(1).

Scraping the Senate, turning US govt
into structured data
Scraping the Senate, turning US govt
into structured data
09/03/2004 06:29 AM
Cory Doctorow:
Paul Ford has written an article for XML.com about his plan to scrape
all the information he can about the Senate and convert it into
searchable, structured data (much like the UK's brilliant
They Work For You project,
which does the same for Parliament). He's planning to document his
process of converting the Senate's sloppy html into clean XML, and
turn the process into a tutorial on how to make the Semantic Web come
alive.
Of course screen-scraping is itself a dubious process. When the Senate
decides to change its page design, moves the page, or alters the
suffix, I'm out of luck. At the same time, it's hard to argue against
the fact that the Senate's own web site is a definitive source for
up-to-date, reliable information about the current composition of the
Senate. This is a situation that we're likely to encounter again: the
best, most reliable site to get some information is the worst place to
get useful data. Hopefully, as we go forward, we'll have multiple
sources of information on various members of the government, and can
use them all together.
Link
(
via Kottke)
Grok Description matches for Venetica, IBM bridge unstructured and structured content
GrokA matches for Venetica, IBM bridge unstructured and structured content
Venetica, IBM bridge unstructured and structured content