EMC Archives Unstructured Data
Grok Headline matches for EMC Archives Unstructured Data
FAST Lassos Unstructured Data
FAST Lassos Unstructured Data
01/27/2004 05:19 PMInternet News Jan 27 2004 10:00PM GMT
Business Intelligence from Unstructured
Data
Business Intelligence from Unstructured
Data
12/20/2003 07:31 AMBusiness Intelligence from Unstructured Data by Sundar
Kadayamhttp://intelliseek.com/w
hitepapers.aspAn excellent white paper on Business
Intelligence from Unstructured Data by my good friend Sundar Kadayam.
The real difference between structured
and unstructured data (XML.org)
The real difference between structured
and unstructured data (XML.org)
06/11/2002 07:06 PMKazeon's storage appliance handles
unstructured data
Kazeon's storage appliance handles
unstructured data
03/22/2005 03:35 PMKazeon Systems says it is set to introduce a storage appliance that
not only analyzes the unstructured data on file servers but also
decides where to store that data, and determines who can retrieve it
and when, based on rules that IT creates.
Text Mining Begins Digging Through
Unstructured Data
Text Mining Begins Digging Through
Unstructured Data
07/03/2004 05:34 AMText Mining Begins Digging Through Unstructured
Datahttp://snipurl.com/7gppA good
85% of an organization's knowledge is in the form of unstructured
data. Easy to quantify, hard to find. "We are drowning in information
but are starving for knowledge," says an R&D technical leader at Dow
Chemical. But a new generation of text mining tools is allowing
companies to extract key elements from large unstructured data sets,
discover relationships and summarize the information. Dow, for
example, uses ClearResearch to extract data from chemical patent
abstracts, published research papers and the company's own files. And
the University of Louisville uses SAS's Text Miner on text files, such
as patient charts, and analyzes flat-file snapshots of billing and
pharmaceutical databases as text, rather than as database entries.
Researchers there have pinpointed certain medications that can prolong
hospital stays for patients. Because of some limitations in the new
software (such as understanding linguistics), text miners are still
niche products, generally restricted to specific parts of an
organization and requiring specialized analytical skills to implement
and deliver truly useful information. It'll be awhile before they're
commonly available. But some vendors are already incorporating text
mining tools as a background function to improve the effectiveness of
more familiar search or document management applications.
National Archives launches digital data
repository
National Archives launches digital data
repository
02/13/2004 03:23 AMPublicTechnology.net Feb 13 2004 8:11AM GMT
Open Archives Initiative Data Providers
- Part I
Open Archives Initiative Data Providers
- Part I
04/16/2004 06:20 AMOpen Archives Initiative Data Providers - Part Ihttp://www
.public.iastate.edu/~gerrymck/OAI-DP-I.pdfGerry
McKiernan announced the availability of his latest eProfile column
from the Library Hi Tech News V21 N3 (April 2004) 11-19 titled "OPen
Archives Initiative Data Providers - Part I". Definitely an excellent
resource and bookmark. This will be added to
Academic Resources
2004 Internet MiniGuide.
Unstructured Information Management
Unstructured Information Management
12/20/2003 07:31 AMUnstructured Information Management by Robert D. Kugel of
IntelligentBPM http://www.intelligentbpm.com/feature/2003/12/0312feat2_1.shtml This white paper, from
Ventana Research, offers a
lucid explanation of what "unstructured information" actually means,
and why it will consume a significant amount of IT resources in the
coming years. Structured data is the easily classified stuff --
names, addresses, zip codes, SKU numbers, etc. Unstructured data "does
not readily fit into structured databases except as binary large
objects (BLOBs)." Examples given include e-mails, multimedia files,
document files.... Although these objects may have some structure --
e.g., an e-mail address -- they are not easily classified for storage
in a structured format that makes a typical database happy. As the
amount of this unstructured data increases exponentially, solutions
are being sought; XMLis a big help because of its flexible tagging
system. If this data cannot be efficiently stored and retrieved, it
has little or no utility. The white paper identifies six potential
components of a viable storage system: document management, Web
content management, records management, digital rights management,
collaboration, and image capture. All of these elements are emerging
as critical, especially in light of today's more stringent regulatory
environment (i.e.,
Sarban
es-Oxley) which dictates compliance standards for information
retention. (
C
urrent Cities December 2003)
Transforming Unstructured Content into
Meaningful XML
Transforming Unstructured Content into
Meaningful XML
10/23/2002 03:00 AMVenetica, IBM bridge unstructured and
structured content
Venetica, IBM bridge unstructured and
structured content
04/29/2004 07:43 AMIBM and Venetica this week announced a partnership designed to help
customers more easily access and work with both unstructured and
structured information through a single SQL-based query.
BBned selects Allied Data Technologies
as Supplier for combined Voice and Data
IAD
BBned selects Allied Data Technologies
as Supplier for combined Voice and Data
IAD
09/15/2004 02:24 AMAllied Data Technologies, specialist of Customer Premises Equipment
(CPE) for the Local Loop (PSTN, ISDN, xDSL), today announces their
agreement with BBned, largest provider for high-quality DSL services
in the Netherlands, for the supply of CPE equipment for their Voice
over DSL services in the Netherlands. The agreement involves the
delivery of Voice Integrated Access Devices (IAD), called the
CopperJet 816-2P, with the intention of a follow up order and delivery
next year. The initial shipment will take place this year. [PRWEB Sep
15, 2004]
Epic Data Introduces the MPT9500
Mini-Workstation for Data Collection
Epic Data Introduces the MPT9500
Mini-Workstation for Data Collection
04/08/2005 01:08 AMBC Technology Apr 8 2005 5:34AM GMT
OLAP and Data Warehousing (Data
Warehouse solution architecture)
OLAP and Data Warehousing (Data
Warehouse solution architecture)
07/12/2004 09:12 PMConvert data between XML and relational,
LDAP data (Advisor.com)
Convert data between XML and relational,
LDAP data (Advisor.com)
10/11/2002 07:56 AMConvert data between XML and relational,
LDAP data (Advisor.com)
Convert data between XML and relational,
LDAP data (Advisor.com)
10/09/2002 10:47 AMProsoft releases Data Backup 2.0, Data
Rescue X 10.4
Prosoft releases Data Backup 2.0, Data
Rescue X 10.4
01/06/2004 11:53 AMProsoft Engineering today released Data Backup 2.0 and Data Rescue X
10.4 at Macworld Expo (Booth #834)...
Ariadne Genomics Announces the Release
of Seqware Data Center, Self-Updating
Sequence Data Management and Personal
Blast System
Ariadne Genomics Announces the Release
of Seqware Data Center, Self-Updating
Sequence Data Management and Personal
Blast System
03/14/2005 05:08 PMAriadne Genomics, Inc. today announced the release of Seqware Data
Center, a sequence data management and personal BLAST software,
enabling scientists to easily maintain and search annotated sequence
collections. Seqware comes with GenBank pre-loaded. A free trial of
Seqware is available at www.ariadnegenomics.com/products/seqware.html.
[PRWEB Feb 7, 2005]
SecureSpeed, LLC, Leading Provider of
Data Storage, Data Backup, Security and
Disaster Recovery Solutions Names VP,
Sales and Marketing
SecureSpeed, LLC, Leading Provider of
Data Storage, Data Backup, Security and
Disaster Recovery Solutions Names VP,
Sales and Marketing
12/26/2004 04:50 AMSecureSpeed LLC named Ludwig Terán as Vice President of Sales and
Marketing. [PRWEB Dec 26, 2004]
Franklin Data Releases Break-Thru
Technology to Help Law Firms,
Corporations, and Government Agencies to
Capture Lotus Notes Rich Text Formatting
(RTF) for Use in Electronic Data
Discovery.
Franklin Data Releases Break-Thru
Technology to Help Law Firms,
Corporations, and Government Agencies to
Capture Lotus Notes Rich Text Formatting
(RTF) for Use in Electronic Data
Discovery.
07/28/2004 02:21 AMNew Lotus Notes offering helps demonstrate Franklin Data's leading
technologies by processing the full rich text format of Lotus Notes
E-mails, a continuing complexity of its competitors. [PRWEB Jul 28,
2004]
Data Guard Systems proudly announces
that CellularManager, its popular
Internet based POS for cellular
retailers, will now share data with
Intuit’s QuickBooks(r)
Pro/Premier/Canadian 2003 & 2004.
Data Guard Systems proudly announces
that CellularManager, its popular
Internet based POS for cellular
retailers, will now share data with
Intuit’s QuickBooks(r)
Pro/Premier/Canadian 2003 & 2004.
06/02/2004 02:19 AMData Guard Systems, Inc., proudly announces that CellularManager, its
popular Internet-based point-of-sale and enterprise management
software platform will now share data with Intuit’s QuickBooks(r)
software products – including QuickBooks Pro, Premier, and Canadian
Editions 2003 and 2004. This new financial module will all current
and future subscribers of CellularManager to share critical and
essential financial data with QuickBooks, which will allow cellular
retailers to better manage their businesses while saving valuable
time, effort, and money. [PRWEB Jun 2, 2004]
Data Guard Systems Announces New
CellularManager Carrier Interface to
Push Customer Data to Cellular Carrier
Websites
Data Guard Systems Announces New
CellularManager Carrier Interface to
Push Customer Data to Cellular Carrier
Websites
08/10/2004 03:20 AMNEW Features in CellularManager, Cellular POS Software for Cellular
Retailers, allow customers to push customer information data entered
at the point-of-sale into the carrier activation websites, thus
eliminating double-entry and making the point-of-sale more efficient.
[PRWEB Aug 10, 2004]
Help - I Lost My Digital Photos or Data
from my Memory Device - Visit
eProvided.com, Data Recovery & Photo
Recovery in 24 Hours or Less to Anywhere
Worldwide
Help - I Lost My Digital Photos or Data
from my Memory Device - Visit
eProvided.com, Data Recovery & Photo
Recovery in 24 Hours or Less to Anywhere
Worldwide
06/05/2005 11:14 PMFor those who have lost their digital photos or data files on a USB
media device, flash memory stick, DVD or CD eProvided will help
recover them within 24 hours. Whether images or files are deleted,
lost, reformatted or damaged, there’s now a simple, practical
solution. People now trust big name flash memory card manufacturers
with their personal files & photos; little do they know these devices
fail often enough to create serious issues for users. What will people
do if their data or photos become missing? [PRWEB Jun 2, 2005]
Data Recovery Services is Offering Data
Recovery for Florida Hospitals and
Emergency Service Organizations Affected
by Hurricane Charley
Data Recovery Services is Offering Data
Recovery for Florida Hospitals and
Emergency Service Organizations Affected
by Hurricane Charley
08/18/2004 03:32 AMDRS specializes in computer forensics with a proven record of
maintaining chain of custody for all damaged media that arrives at our
facilities. [PRWEB Aug 18, 2004]
Content Data Synchroniser (C.D.S.)
obtains Transora certification on the
Transora Data Synchronization Network
(TDSN)
Content Data Synchroniser (C.D.S.)
obtains Transora certification on the
Transora Data Synchronization Network
(TDSN)
03/27/2005 03:20 AMInflue, European software publisher specialized in electronic data
exchange solutions and collaborative technologies, has successfully
completed the Transora certification for C.D.S., Product Information
Management tool ( PIM ) at the end of November. Certification means
that C.D.S is able to synchonize data with Transora Home Data Pool
(TDSN) based on the latest Global Data Synchronization Standards
embodied in Transora ‘s TDSN. [PRWEB Mar 27, 2005]
Web Data Extractors: V2N37 September 13,
2004 Current Awareness happenings on the
Internet: Web Data Extractors
Web Data Extractors: V2N37 September 13,
2004 Current Awareness happenings on the
Internet: Web Data Extractors
09/13/2004 10:40 AMThis edition of Current Awareness Happenings on the Internet by
Marcus P. Zillman, M.S.,
A.M.H.A. September 13, 2004 V2N37 discusses one of my latest white
paper link compilations titled Web Data Extractors. Click on the below
audio posting to hear my audio describing this excellent resource.
Download this resource at:
Web Data
Extractorshttp://zillman.blogspot.com/2004_08_01_zillman_archive
.html#109250380875057586
CBL Data Recovery Technologies Launches
Global Data Recovery Partner Program
CBL Data Recovery Technologies Launches
Global Data Recovery Partner Program
03/14/2005 06:10 PMData Recovery Advantage Program Gives VARs, OEMs and IT providers
essential service to enhance offerings to customers [PRWEB Mar 5,
2005]
Archives
Archives
03/11/2003 09:44 AMI was recently going through the office closet and dealing with old
hard drives. I found some weird old graphics from like 1996.
I don’t know what InnerWeb was supposed to be—but
it’s a tracing of my head.

A friend of mine was working on starting an Internet café, and
this was part of some website graphics we were working on. (The
café didn’t happen.)

We were working on a product named Scramble at some point. It
didn’t ship. I don’t remember what it was going to be.

Here’s a metal thing. I think it’s a piece from a lighter.
I have no idea what the point is here.

Need a searchlight?

Here’s an old-fashioned Home button.

Here’s a six-shooter. To be used on only the nastiest of
bugs.

Here’s a weird 3D thing.

Here’s a colorful engine.

daypop archives
daypop archives
04/21/2004 07:24 AMpermalinking to the past
The ETEXT Archives
The ETEXT Archives
12/08/2003 06:59 AMThe ETEXT Archiveshttp://www.etext.org/Home
to electronic texts of all kinds, from the sacred to the profane, from
the political to the personal. Our mission is to provide electronic
versions of texts without judging their content.
plow through the archives
plow through the archives
11/02/2003 06:30 AMLGF Watch ..
,
lgfwatch.blogspot.com/2003_10_01_lgfwatch_archive.html#10662527866
3626224
track this
site | 3 links
The New Yorker: From the Archives
The New Yorker: From the Archives
12/22/2004 01:05 AM“A Visit from Saint Nicholas (In the Ernest Hemingway
Manner)” ..
FICTION
newyorker.com/archive/content/?031222fr_archive01
track this
site | 5 links
Top 40 - DayPop Archives
Top 40 - DayPop Archives
04/14/2004 06:26 AMTop 40 - DayPop Archiveshttp://www.daypop.com/archiv
e/top/What are the hot topics in the weblogging
community today or yesterday or last year? The Daypop Top 40 Archives
is a list of links that are currently and previously popular with
webloggers from around the world.
Moving Archives
Moving Archives
12/30/2003 07:22 AM The
Sorcerer's Scissors;
Air Raid
Practice, Knoll School Hove; and
An Eye to
the Future [wmv's all, I'm afraid]. These and
other examples nonpareil available at the University of Brighton's
Moving History: "A
guide to UK film and television archives in the public sector".
BLACKFIVE: Someone You Should Know
Archives
BLACKFIVE: Someone You Should Know
Archives
05/12/2004 05:26 AMA Marine you won't hear about on CNN, ABC, CBS or NBC .. Read about
some American heroes here .. people like Kerry and Kennedy, ..
Blackfive has documented .. people you should
know,
blackfive.net/main/someone_you_should_know/index.html
track this
site | 5 links
Photo Archives: OTC TIE Fighter
Photo Archives: OTC TIE Fighter
06/02/2004 01:26 PMOur
Original Trilogy
Collection Photo Archives gets its first official entry today with
the "new" version of the
TIE Fighter.
This vehicle is the same version as
last year's Saga
release, only this time there is no pilot included. Look for the
TIE Fighter Pilot to come separately later this year.
Jedi Archives Update
Jedi Archives Update
12/03/2003 04:53 PMThe
Jedi Archives is
updated today with Morrita and Murillo
cig
ar bands from the Netherlands. See eBay Today for more
information on this 290 item set and a few eBay auctions to browse.
National Archives are Going Digital?
National Archives are Going Digital?
08/09/2004 09:41 AMA PC World Story reports that the National Archives are going to offer
an Electronic Records Archives, scheduled to open by 2007. However, it
won't be complete; the archives are...
Photo Archives: OTC Greedo
Photo Archives: OTC Greedo
08/23/2004 06:54 AMOur
Photo Archives
is updated today with the
Original Trilogy
Collection version of the Rodian bounty hunter
Greedo. While the
debate continues as to who shot first in that Mos Eisley Cantina scene
from
A New Hope, the end result is the same...poor Greedo is
still dead.
Dutch National Archives
Dutch National Archives
04/14/2004 01:08 PM
500,000 pictures taken between 1880 and 1990 are now in a searchable
Dutch National Archive
Image Bank. If you speak enough Dutch to navigate the site,
there's quite a lot of history here. It looks like current Dutch copyright laws are similar to the US, lasting until a
creator's death + 70 years, so it's tough to tell how much of the
archive is free for reuse. Still, it's cool to see another country
take their archives online for everyone to see. [thanks prolific]
Grok Description matches for EMC Archives Unstructured Data
GrokA matches for EMC Archives Unstructured Data
Edmunds.com deploys text mining tool for
user forms
Edmunds.com deploys text mining tool for
user forms
08/06/2004 06:17 PMEdmunds.com, an online service for vehicle information, unveiled its
latest tool to mine the potentially invaluable data stored as
unstructured content in its user forums, consumer ratings, and reviews
archives.
A Roadmap to Text Mining and Web Mining
A Roadmap to Text Mining and Web Mining
12/30/2003 10:53 AMA Roadmap to Text Mining and Web Mininghttp://www
.cs.utexas.edu/users/pebronia/text-mining/A
comprehensive portal covering all protocols and sources related to
text mining and web mining. Text mining is about looking for
regularities, patterns or trends in natural language text, and usually
is about analyzing text for particular purposes. Inspired by data
mining, which discovers prominent patterns from highly structured
databases, text mining aims to extract useful knowledge from
unstructured or semi-structured text. Text Mining is a
cross-disciplinary field including, but not limited to:
*
Information Extraction(IE)
* Natural Language Processing(NLP) and
Computational Linguistics(CL)
* Machine Learning(ML)
*
Information Retrieval(IR)
* Data Mining(DM) or Knowledge
Discovery from Databases(KDD)
* Information Management and
Visualization
The Word on Text Mining
The Word on Text Mining
12/10/2003 10:21 AMThe Word on Text Mining by Seth Grimeshttp://www.intelligententerprise.com/031210/619decision1_1.shtml
Text analytics provide concept discovery, automated
classification, and innovative displays for volumes of unstructured
documents. This article written by
Seth Grimes for
Intelligent Enterprise
gives the latest and most current happenings on text mining on the
Internet.
theConcept text mining software debuts
theConcept text mining software debuts
07/22/2004 05:02 PMMesa Dynamics has released
theConcept, a text mining application for Mac OS X that analyzes
documents for keyword identification, content categorization and
contextual research. It can pull text from local files and Web sites,
and can process results from search engines using an XML plug-in
architecture. TheConcept is also designed to work with Beholder, an
image mining software application introduced by Mesa Dynamics in 2003.
TheConcept is available for trial download from the Web site; it
requires Mac OS X v10.2 or later, and costs US$39.95.
MedScan - Automated Scientific Text
Mining Tool
MedScan - Automated Scientific Text
Mining Tool
09/16/2004 11:14 AMMedScan - Automated Scientific Text Mining Toolhttp://www
.ariadnegenomics.com/products/medscan.htmlMedScan is
advanced scientific text mining software tool, automatically
extracting biological facts from scientific literature and MEDLINE
abstracts. MedScan extracts functional associations between proteins,
cell processes and small molecules, recognizes types of regulatory
mechanisms involved and the effects of regulation, and can be
customized to extract other information. Captured data is presented as
a datasheet, an XML file or a pathway diagram. You can use MedScan to
automatically extract information from:
*
MEDLINE abstracts and full text articles
* MS Office, PDF and
TXT files
* Catalogs and archives
* Web pages and
HTML documents.
This has been added to
Data Mining Resources
Subject Tracer™ Information Blog. This has also been added to
Biological
Informatics Subject Tracer™ Information Blog.
What if Intelliseek Bought Technorati?
What if Intelliseek Bought Technorati?
04/04/2005 04:28 PMThe more I think about Intelliseek, the more I think they'll either
try to muscle Technorati out of their market or they'll buy 'em.
Intelliseek's BlogPulse Conversation Tracker is the sort of tool that
I expected Technorati to build last year. In fact, I've joked about
trying to build it myself using their API. (Could I sell it to them if
I did?) And the BlogPulse Trend Search is kinda cool too. Just for
kicks, I did a Trend Search...
After "Dating Mining", Try a Little
"Reality Mining"
After "Dating Mining", Try a Little
"Reality Mining"
04/13/2004 06:11 AMAfter "Dating Mining", Try a Little "Reality
Mining"http://www.techreview.com/articles/wo_pentland033104.asp At the MIT Media Labs, the Human Design research group is
working on "reality mining" projects that use commonplace wearable
technology to identify a company's de facto organization chart (as
opposed to the theoretical and often-ignored one on the wall). The
group is using two approaches: The first provides an Expert and
Collaborator Locator, which uses speech recognition technology to
generate profiles of individuals based on the words they use in
conversations; the second offers Collaboration Tools, which makes it
possible to query a database of employee profiles of interests,
skills, or even recently used vocabulary, in order to find people who
might work well together. The researchers say: "We expect that by
aggregating this information, interpreting it in terms of work tasks,
and modeling the dynamics of the interactions, we will be better able
to understand and manage complex organizations." There are privacy
issues, of course, and here the key is to make the system transparent
and to allow employees to scrutinize their bosses' behavior. This
could be interesting.
Edmunds.com - Research and Price a New
Vehicle
Edmunds.com - Research and Price a New
Vehicle
09/23/2004 06:51 AMAd - www.edmunds.com Sep 23 2004 11:04AM GMT
Edmunds.com Identifies Car Dealers Best
Equipped to Serve Internet Users
Edmunds.com Identifies Car Dealers Best
Equipped to Serve Internet Users
04/07/2005 02:34 PMPR Newswire via Wards Apr 7 2005 6:39PM GMT
Edmunds.com Tests Paid Search to Drive
Traffic: Discover Their Results
Edmunds.com Tests Paid Search to Drive
Traffic: Discover Their Results
07/12/2004 04:11 PMSource: ContentBiz - Should a content site pay search engines for
traffic? If you're not selling directly online, how do you set a
search budget anyway?...
W3C Releases Public Working Draft for
Full-Text Searching of XML Text and
Documents
W3C Releases Public Working Draft for
Full-Text Searching of XML Text and
Documents
07/13/2004 06:43 PMXMLMania.com Jul 13 2004 10:01PM GMT
SWF Text Version 1.1: Feature-Rich Flash
Text Animation Tool for Dummies
SWF Text Version 1.1: Feature-Rich Flash
Text Animation Tool for Dummies
04/08/2005 05:09 AMAntsSoft today announced the release of SWF Text version 1.1, an
innovative text animation tool for producing professional-quality
Flash movies in five minutes [PRWEB Apr 8, 2005]
Building a Blog with Dreamweaver, PHP,
and MySQL - Part 6: Replacing Text Areas
with Rich Text Editors
Building a Blog with Dreamweaver, PHP,
and MySQL - Part 6: Replacing Text Areas
with Rich Text Editors
12/22/2004 01:47 AMIn this final installment, learn how to transform the familiar HTML
text area into a rich text editor with formatting and file-uploading
capabilities.
Ariadne Genomics Launches MedScan™
Text-to-Knowledge Suite 2.0, Unique Tool
that Converts Scientific Text into a
Database of Functional Relationships
Ariadne Genomics Launches MedScan™
Text-to-Knowledge Suite 2.0, Unique Tool
that Converts Scientific Text into a
Database of Functional Relationships
06/05/2005 11:58 PMAriadne Genomics, Inc. today announced the launch of MedScan™
Text-to-Knowledge Suite 2.0, a Natural Language Processing-based tool
for automated extraction of biological facts from scientific
literature, MEDLINE abstracts, and other text sources. A demo version
of MedScan is available at www.ariadnegenomics.com. [PRWEB May 18,
2005]
AppleScript FAQ: String, Text, Unicode
Text
AppleScript FAQ: String, Text, Unicode
Text
12/17/2004 06:30 PMA string is a bunch of characters of the ASCII table (including
extended ASCII, that is, all the 256 characters).
text is the same. styled text is a string containing information about
the standard styles: on styles (that is plain, bold, italic,
underline, outline, shadow, condensed, expanded), font id, size,
color, etc. Unicode text is supossed to contain any character in the
known languages (latin, japanese, etc.). The information is kept in
two bytes. If you are thinking in a multi-language solution in your
script, you may use this kind of text as your standard.
Mining the Value of Metrics
Mining the Value of Metrics
09/27/2004 07:02 AMConsumer Goods Technology Sep 27 2004 11:17AM GMT
Data Mining Goes 3D
Data Mining Goes 3D
07/11/2004 12:59 PMMining the Tagged Web
Mining the Tagged Web
03/06/2004 02:04 AMConsulting Google to track down "jaguar," for example, generates an
alarming list of more than 7 million documents—a mad muddle of entries
about cars, animals ...
Mining a Big Winner
Mining a Big Winner
09/03/2004 02:48 PMMine Safety's stock has risen almost 150% in the year since it was
recommended in Hidden Gems.
Mining the intranet
Mining the intranet
12/15/2003 10:31 AM
Of course sites such as Amazon and Google have reasons to create
formal APIs and gate access to them. But on an enterprise intranet the
threat is disuse, not overuse. You're publishing information that you
want people to find, exploit, and recombine. When it's appropriate to
use SOAP and WSDL -- for example, when queries require fancy
authorization or complex inputs -- then do so. But when a simpler
strategy will suffice, don't be ashamed to use it. Between the
primordial tag soup of HTML and the formal realm of Web services lies
a large and fertile middle ground: XHTML. Information that you publish
in XHTML can be directly consumed by browsers, and it's much
friendlier to spiders than ill-formed HTML. If you hope people will
mine your intranet, make the job as easy as it can be. [Full story at
InfoWorld.com]
I sometimes worry that I harp too much on these kinds of simple home
truths. But Mike Champion's
review of my XML 2003
keynote was a nice bit of validation:
...Mining for memes
Mining for memes
06/17/2005 07:21 PM
Bruce Schneier
wonders if the
ongoing
reports of identity loss are creating a boy-who-cried-wolf
situation. Are people starting to tune this stuff out? And will that
result in less pressure for reform?
...EPIC on data mining
EPIC on data mining
05/27/2004 07:38 PMp2pnet.net May 27 2004 11:07PM GMT
Is Space Mining Feasible?
Is Space Mining Feasible?
11/19/2003 04:36 PMRoland Piquepaille writes "There is a large amount of precious
minerals on the Moon and Mars. Would it be feasible to bring these
valuable materials back on ...
Data Mining Resources
Data Mining Resources
05/28/2004 05:09 AM
Data
Mining Resourceshttp://www.DataMiningResou
rces.info/Data Mining Resources is a Subject
Tracer™ Information Blog developed and created by the
Virtual Private
Library™. It is designed to bring together the latest
resources and sources on an ongoing basis for data mining information.
We always welcome suggestions of additional sites and resources to be
added to this comprehensive listing and please submit by clicking
here. This site has been developed and is
maintained by
Marcus P.
Zillman, M.S., A.M.H.A.. Additional links and resources by Marcus
are available by clicking
here.
GAO: Fed Data Mining Extensive
GAO: Fed Data Mining Extensive
05/28/2004 04:51 AMIn a new report, the investigative arm of the government finds that
data mining by federal agencies is ubiquitous. A watchdog group offers
a second report suggesting ways to protect privacy. By Kim Zetter.
Data Mining For the Masses
Data Mining For the Masses
07/29/2004 08:17 PMA proposed data mining spec gets the green light for J2EE-compliant
application servers.
EU data mining hacks available for U.S.
EU data mining hacks available for U.S.
09/04/2004 06:57 AMMining Mars from Houston
Mining Mars from Houston
03/15/2003 07:05 AMMining the Campaign War Chests
Mining the Campaign War Chests
05/19/2004 11:45 PMFundrace.org tracks contributions made by from individuals to
Democratic and Republican presidential candidates since January 2003.
It does not include contributions to third-party candidates or
Congressional campaigns, or the far larger amounts given by political
action committees, companies, unions and other groups. But other sites
enable you to follow the money further and deeper.
EMC Archives Unstructured Data