Text Mining Begins Digging Through Unstructured Data
Grok Headline matches for Text Mining Begins Digging Through Unstructured Data
A Roadmap to Text Mining and Web Mining
A Roadmap to Text Mining and Web Mining
12/30/2003 10:53 AMA Roadmap to Text Mining and Web Mininghttp://www
.cs.utexas.edu/users/pebronia/text-mining/A
comprehensive portal covering all protocols and sources related to
text mining and web mining. Text mining is about looking for
regularities, patterns or trends in natural language text, and usually
is about analyzing text for particular purposes. Inspired by data
mining, which discovers prominent patterns from highly structured
databases, text mining aims to extract useful knowledge from
unstructured or semi-structured text. Text Mining is a
cross-disciplinary field including, but not limited to:
*
Information Extraction(IE)
* Natural Language Processing(NLP) and
Computational Linguistics(CL)
* Machine Learning(ML)
*
Information Retrieval(IR)
* Data Mining(DM) or Knowledge
Discovery from Databases(KDD)
* Information Management and
Visualization
EMC Archives Unstructured Data
EMC Archives Unstructured Data
07/26/2004 02:06 PMEMC Corp. is uniting configurable policy-based content management with
storage resource allocation tools to help customers cut costs by
automating archiving capabilities.
FAST Lassos Unstructured Data
FAST Lassos Unstructured Data
01/27/2004 05:19 PMInternet News Jan 27 2004 10:00PM GMT
Business Intelligence from Unstructured
Data
Business Intelligence from Unstructured
Data
12/20/2003 07:31 AMBusiness Intelligence from Unstructured Data by Sundar
Kadayamhttp://intelliseek.com/w
hitepapers.aspAn excellent white paper on Business
Intelligence from Unstructured Data by my good friend Sundar Kadayam.
The real difference between structured
and unstructured data (XML.org)
The real difference between structured
and unstructured data (XML.org)
06/11/2002 07:06 PMKazeon's storage appliance handles
unstructured data
Kazeon's storage appliance handles
unstructured data
03/22/2005 03:35 PMKazeon Systems says it is set to introduce a storage appliance that
not only analyzes the unstructured data on file servers but also
decides where to store that data, and determines who can retrieve it
and when, based on rules that IT creates.
The Word on Text Mining
The Word on Text Mining
12/10/2003 10:21 AMThe Word on Text Mining by Seth Grimeshttp://www.intelligententerprise.com/031210/619decision1_1.shtml
Text analytics provide concept discovery, automated
classification, and innovative displays for volumes of unstructured
documents. This article written by
Seth Grimes for
Intelligent Enterprise
gives the latest and most current happenings on text mining on the
Internet.
theConcept text mining software debuts
theConcept text mining software debuts
07/22/2004 05:02 PMMesa Dynamics has released
theConcept, a text mining application for Mac OS X that analyzes
documents for keyword identification, content categorization and
contextual research. It can pull text from local files and Web sites,
and can process results from search engines using an XML plug-in
architecture. TheConcept is also designed to work with Beholder, an
image mining software application introduced by Mesa Dynamics in 2003.
TheConcept is available for trial download from the Web site; it
requires Mac OS X v10.2 or later, and costs US$39.95.
MedScan - Automated Scientific Text
Mining Tool
MedScan - Automated Scientific Text
Mining Tool
09/16/2004 11:14 AMMedScan - Automated Scientific Text Mining Toolhttp://www
.ariadnegenomics.com/products/medscan.htmlMedScan is
advanced scientific text mining software tool, automatically
extracting biological facts from scientific literature and MEDLINE
abstracts. MedScan extracts functional associations between proteins,
cell processes and small molecules, recognizes types of regulatory
mechanisms involved and the effects of regulation, and can be
customized to extract other information. Captured data is presented as
a datasheet, an XML file or a pathway diagram. You can use MedScan to
automatically extract information from:
*
MEDLINE abstracts and full text articles
* MS Office, PDF and
TXT files
* Catalogs and archives
* Web pages and
HTML documents.
This has been added to
Data Mining Resources
Subject Tracer™ Information Blog. This has also been added to
Biological
Informatics Subject Tracer™ Information Blog.
Edmunds.com deploys text mining tool for
user forms
Edmunds.com deploys text mining tool for
user forms
08/06/2004 06:17 PMEdmunds.com, an online service for vehicle information, unveiled its
latest tool to mine the potentially invaluable data stored as
unstructured content in its user forums, consumer ratings, and reviews
archives.
Data Mining Goes 3D
Data Mining Goes 3D
07/11/2004 12:59 PMData Mining Resources
Data Mining Resources
05/28/2004 05:09 AM
Data
Mining Resourceshttp://www.DataMiningResou
rces.info/Data Mining Resources is a Subject
Tracer™ Information Blog developed and created by the
Virtual Private
Library™. It is designed to bring together the latest
resources and sources on an ongoing basis for data mining information.
We always welcome suggestions of additional sites and resources to be
added to this comprehensive listing and please submit by clicking
here. This site has been developed and is
maintained by
Marcus P.
Zillman, M.S., A.M.H.A.. Additional links and resources by Marcus
are available by clicking
here.
GAO: Fed Data Mining Extensive
GAO: Fed Data Mining Extensive
05/28/2004 04:51 AMIn a new report, the investigative arm of the government finds that
data mining by federal agencies is ubiquitous. A watchdog group offers
a second report suggesting ways to protect privacy. By Kim Zetter.
Tantek goes a data mining
Tantek goes a data mining
06/25/2004 04:01 AM"There's GOLD in them thar blogs!" Tantek Celik was overheard
yelling at the dinner party two night ago. "I'm a going a data
mining..."
As Scoble tells it.....
It's public now, so wanted to let everyone know that Tantek Çelik
is leaving Microsoft and going to work for Technorati. This is a loss
for Microsoft and a major win for Technorati. Tantek did the rendering
engine for IE on the Mac, represented Microsoft on the W3C, among
other things, during his tenure at Microsoft (some of his more recent
work won't be seen until Longhorn ships).
I learned it earlier this week (Tantek IM'd me). Tantek is one of
those developers that you just wish you could bottle up and duplicate.
I'm happy for him and Technorati, though.
Is the weblogging and RSS world picking up steam? Can you feel it
yet?
[Scobeliz
er]
EPIC on data mining
EPIC on data mining
05/27/2004 07:38 PMp2pnet.net May 27 2004 11:07PM GMT
Data Mining For the Masses
Data Mining For the Masses
07/29/2004 08:17 PMA proposed data mining spec gets the green light for J2EE-compliant
application servers.
EU data mining hacks available for U.S.
EU data mining hacks available for U.S.
09/04/2004 06:57 AMSQL Server Data Mining Programmability
SQL Server Data Mining Programmability
04/14/2005 01:40 AMWith Microsoft SQL Server 2005, the collection of statistics
techniques and machine learning algorithms generically known as data
mining is brought up to a new level. The most important change is
that, in SQL Server 2005, data mining changes its target audience.
Besides being a scientific lab instrument, addressing a limited number
of highly skilled individuals, SQL Server Data Mining now gains
ubiquity as a handy tool for developers, ready to be used in a wide
range of applications. Most applications, from spreadsheets to
Internet games, from peer-to-peer communication systems to application
servers, share one common characteristic: they have to process data.
In doing this, they use certain standard APIs for accessing data. With
SQL Server 2005 Data Mining, the same kind of APIs can be used to
embed machine intelligence in the data processing systems.
Data Mining by Government Rampant
Data Mining by Government Rampant
05/28/2004 12:46 PMThe General Accounting Office,
Congress' investigative arm, has issued a new report showing that
government agencies are using data-mining in all kinds of ways and
without any serious scrutiny for abuse. It's all in the name of
protecting against terrorism, but the wider use of these techniques is
blatantly obvious, and dangerous.
And this report doesn't have anything about what the CIA and NSA are
doing, because they didn't bother to respond to Congress on this
question. Secrecy rules if you're the government. Privacy is dying if
you're not the government. This is a dangerous imbalance.
Government data-mining lives on
Government data-mining lives on
06/01/2004 08:55 AMU.S. data mining remains unchecked
U.S. data mining remains unchecked
05/31/2004 11:15 AMSQL, Data Mining, & Genetic Programming
SQL, Data Mining, & Genetic Programming
04/23/2004 11:06 PMDDJ Apr 24 2004 2:20AM GMT
DataFerrett - Data Mining Tool
DataFerrett - Data Mining Tool
06/22/2005 02:48 AM
DataFerrett - Data Mining Toolhttp://dataferrett.census.gov/
a>
DataFerrett is a data mining tool that accesses data
stored in TheDataWeb through the internet. DataFerrett can be
installed as an application on your desktop or use a java applet with
an internet browser. DataFerrett is compatible with Windows operating
systems: 95, 98, 2000, NT, ME and XP. DataFerrett is a unique data
mining and extraction tool. DataFerrett allows you to select a
databasket full of variables and then recode those variables as you
need. You can then develop and customize tables. Selecting your
results in your table you can create a chart or graph for a visual
presentation into an html page. Save your data in the databasket and
save your table for continued reuse. DataFerrett helps you locate and
retrieve the data you need across the Internet to your desktop or
system, regardless of where the data resides. This has been added to
Data Mining
Resources Subject Tracerâ„¢ Information Blog. This has been added
to
Knowledge
Discovery Resources Subject Tracerâ„¢ Information Blog. This has
been added to
Statistics Resources
Subject Tracerâ„¢ Information Blog. This has been added to the
tools section of
Research
Resources Subject Tracerâ„¢ Information Blog.
GAO Studies U.S. Government Data Mining
GAO Studies U.S. Government Data Mining
05/28/2004 10:50 AMDatabase developer/Data mining
programmer
Database developer/Data mining
programmer
08/09/2004 12:53 PM - United States, DC, Washington DC (2004-08-09)
Old School Data Mining, Maritime Style?
Old School Data Mining, Maritime Style?
12/29/2003 02:58 PMArcSight injects data mining into
security
ArcSight injects data mining into
security
05/25/2004 06:01 PMArcSight this week detailed a new software product, TruThreat
Discovery, that combines data mining technology with security to more
effectively evaluate security threats.
Panel Seeks Protections From Data Mining
Panel Seeks Protections From Data Mining
06/27/2004 07:25 PMAP via Los Angeles Times Jun 27 2004 11:09PM GMT
NSF, Intelligence Community Work on
Data-Mining Research
NSF, Intelligence Community Work on
Data-Mining Research
03/20/2003 01:05 PMPrompted by homeland security issues, the U.S. intelligence community
and the National
Science Foundation are researching innovative data-mining techniques
designed primarily
to aid law enforcement agencies at various levels.
Survey finds U.S. agencies engaged in
data mining
Survey finds U.S. agencies engaged in
data mining
05/27/2004 10:45 AMCNET May 27 2004 1:28PM GMT
Data Mining Algorithms: Microsoft SQL
Server 2000
Data Mining Algorithms: Microsoft SQL
Server 2000
09/13/2004 08:34 PMData-mining for terrorists sparks U.S.
privacy fears
Data-mining for terrorists sparks U.S.
privacy fears
05/20/2004 09:51 PMCNET May 21 2004 1:36AM GMT
GAO Report Reveals Rampant Federal Data
Mining
GAO Report Reveals Rampant Federal Data
Mining
05/27/2004 09:11 PMAlthough Congress put an end to the Pentagon's Terrorism Information
Awareness project, a GAO report shows that nearly 200 data mining
initiatives are under way or in the works.
Using Data Mining to Discover Web-Based
Scholarly Research Works
Using Data Mining to Discover Web-Based
Scholarly Research Works
12/22/2004 01:18 AMBibliomining for Automated Collection Development in a Digital
Library Setting: Using Data Mining to Discover Web-Based Scholarly
Research Works by Dr. Scott Nicholson
http://dlist.sir
.arizona.edu/archive/00000625/
http://www.BiblioMining.com/
<
br />
Abstract:
This research creates an
intelligent agent for automated collection development in a digital
library setting. It uses a predictive model based on facets of each
Web page to select scholarly works. The criteria came from the
academic library selection literature, and a Delphi study was used to
refine the list to 41 criteria. A Perl program was designed to analyze
a Web page for each criterion and applied to a large collection of
scholarly and non-scholarly Web pages. Bibliomining, or data mining
for libraries, was then used to create different classification
models. Four techniques were used: logistic regression, non-parametric
discriminant analysis, classification trees, and neural networks.
Accuracy and return were used to judge the effectiveness of each model
on test datasets. In addition, a set of problematic pages that were
difficult to classify because of their similarity to scholarly
research was gathered and classified using the models. The resulting
models could be used in the selection process to automatically create
a digital library of Web-based scholarly research works. In addition,
the technique can be extended to create a digital library of any type
of structured electronic information. This has been added to
Data Mining Resources
Subject Tracer™ Information Blog.
Data mining firm names 'statistically
likely' terrorists
Data mining firm names 'statistically
likely' terrorists
05/21/2004 08:18 AMJuly 2004 Zillman Column - Data Mining
Resources on the Internet
July 2004 Zillman Column - Data Mining
Resources on the Internet
06/23/2004 12:19 PMJuly 2004 Zillman Column - Data Mining Resources on the
Internethttp://virtualprivatelibrary.blogspot.com/Data Mining
Resources.pdfhttp://www.zillmancolumns.com/
a>
The July 2004 Zillman Column is now available and is
titled Data Mining Resources on the Internet. This
July 2004 Zillman Column is a comprehensive listing of online data
mining sites and subject guides currently available on the Internet.
Download this excellent 12 page free .pdf document today and stay
current in the ever changing exciting data mining field!
Northwest Denies Knowledge of Secret
Gov't Data-Mining Study
Northwest Denies Knowledge of Secret
Gov't Data-Mining Study
01/19/2004 03:58 AMNorthwest Airlines said Sunday that an executive and a company
spokesman were not aware of the company's role in a secret government
study when they denied that the airline gave away passenger
information.
CFP 2004: Data mining allowing insurance
companies to do high-tech redlining
CFP 2004: Data mining allowing insurance
companies to do high-tech redlining
04/21/2004 02:20 PMBirny Birnbaum, Executive Directorfor the Center for Economic Justice
in Texas, just gave an astounding presentation at CFP 2004 about how
insurance companies are using data mining to do "high-tech redlining,"
denying coverage or charging excess rates for insurance when...
Northwest Gave Passenger Info to Secret
NASA Data-Mining Study
Northwest Gave Passenger Info to Secret
NASA Data-Mining Study
01/19/2004 03:58 AMNorthwest Airlines gave information on passengers to the federal
government for a secret air-security project after the Sept. 11
terrorist attacks, the airline said Saturday.
Grok Description matches for Text Mining Begins Digging Through Unstructured Data
GrokA matches for Text Mining Begins Digging Through Unstructured Data
Text Mining Begins Digging Through Unstructured Data