stargeek
PHP news website logo.
home    PHP scripts    articles    seo tools    links    search    contact    shop    realtors


Taxonomies Tackle Content Classification







Taxonomies Tackle Content Classification

Taxonomies Tackle Content Classification 05/09/2004 07:18 AM

Taxonomies Tackle Content Classification
http://www.transformmag.com/enterprise/showArticle .jhtml?articleID=19200201&pgno=1

Finding the information you need is a daunting challenge that consumes about one-third of the typical workday. There is so much information, so many contexts in which documents may be relevant, and so many different file formats, from Office documents to graphics to PDFs. A single business document may cover hundreds, even thousands of subjects, have many authors, and have been created in different contexts for a variety of audiences. Enterprise content management (ECM) systems try to centralize content and enforce the assignment of metadata to simplify the task of finding information. Although ECM systems can bring order to chaos, you'll get more accurate and efficient information retrieval by planning your classification and taxonomy strategy. Creating a taxonomy is the process of classifying information and the associated metadata that further describes the information according to a logical system. There are several ways to create taxonomies, but most organizations build them manually, buy a pre-existing system or apply automated taxonomy/classification tools to their data. Each approach has advantages and disadvantages. Whichever you choose, advance planning is critical. Develop an information-management strategy, understand your organization's business needs and know what types of information your users require. After these step are complete, you can move on to creating a taxonomy using one of the approaches discussed above.




This is a GrokNews Entry: (what is grok?)





Similar Items

Taxonomies Tackle Content Classification

Grok Headline matches for Taxonomies Tackle Content Classification

Spam Blockers Tackle Broader
Content-Security Issues


Spam Blockers Tackle Broader
Content-Security Issues
08/30/2004 06:31 AM
FrontBridge, others rev filtering tools.

Of Taxonomies and Crumbtrails


Of Taxonomies and Crumbtrails 08/15/2004 03:33 PM

I've had an eternal struggle with taxonomies and crumbtrails and I'll share it with you now in the hopes of find some resolution that will let me sleep. (Okay, it's not THAT bad, but I have been tossing this around for days now with no solution.)

A taxonomy is a parent-child classification system. Most every site has one whether it was planned or not. I work for a commercial real estate firm, and we have a simple taxonomy, some of which looks like this:


Home
  Property
    Office
    Industrial
    Land
    Retail
    Investment

For the most part, this works fine. It's simple, and it makes sense.

Taxonomies also lend themselves nicely to crumbtrail navigation. If I'm looking at a property in the Office category, I can get a crumbtrail like this:


Home > Property > Office

However, there are situations that require a piece of property to fit into more than one category. For instance, there are many buildings that can legitimately be used for both office and retail. Therefore, the property ends to appear under both categories because people browsing either would be interested in it. This is no problem, as taxonomies are supposed to be able to do this.

But what about the crumbtrail? If I'm looking at a property that appears in both Office and Retail, which crumbtrail do I get:



Home > Property > Office
Home > Property > Retail

I can think of two things:

Primary and Secondary Classification
Pick one "true" classification for the property. Just make an arbitrary decision if its Office and Retail and classify it as such. Let it appear in the other category as well, but the crumbtrail should reflect its "true" classification.

A couple problems here:

(a) Maybe some legitimately fits equally in two places. Say one side of taxonomy classifies by property type (Office, Retail, etc.) and anothert by size (less than 5,000 sq. ft.; 5,000 - 20,000 sq. ft., more than 20,000 sq. ft.). No matter how hard to you try, any property is going to fit in more than one category.

(b) If someone browses to a category from a "secondary" trail, they're going to be confused because the crumtrail doesn't reflect where they came from. For instance, say I give a building a classification of Office but also let it appear as Retail. If someone browses to the property through the Retail trail, then tries to walk back up the trail, they're going to be sent back to Office, instead of Retail where they came from.

Dynamic Crumbtrails
You can always create the crumbtrail based on the trail the use came from. So if a user browses to our property through the Retail trail, display a trail based on that. If they came from Office, display that crumbtrail.

This seems good, but what if the user didn't browse and was linked directly? Then what do you use?


So, there you have my quandry. If anyone has a resolution or thoughts, let's hear them.

Click here to comment on this entry


Playing with Taxonomies


Playing with Taxonomies 12/19/2004 03:25 PM
Playing with Taxonomies
http://www.econtentmag.com/Articles/ArticleReader .aspx?ArticleID=7357&CategoryID=21

Taxonomies are critical to good information systems. But can non-librarians develop effective taxonomies? Several Web sites now being built use socially developed taxonomies. Del.icio.us and Flickr both attract large groups of people describing their content in a way that they all can share. Del.icio.us lets participants share Web bookmarks; Flickr offers online photo sharing. Finding information on either site demands some agreed-upon, dynamic way of classifying content, and changing that classification as the content grows exponentially. Feedback is immediate; you see whether others agree (use) or disagree (don't use) your tags. Stewart Butterfield of Ludicorp, developer of Flickr, thinks this user-driven approach has advantages. "If you can hire enough excellent librarians, you will get better keyword results than with social approaches. However, as the content grows, tagging (and retagging) becomes an order of magnitude more difficult. In other words, social approaches are 80% as good as and 10 times easier than top-down approaches." Would Flickr's approach work in the buttoned-down corporate world? Butterfield says, "Anticipate resistance in the CIO crowd who don't want to risk losing control in a social self-correcting process and do not want anything to get lost."

Are Taxonomies Dead?


Are Taxonomies Dead? 01/09/2004 09:58 PM

The taxonomy was always supposed to be the be-all and end-all of information architecture. A good, solid category structure was how all the information in an enterprise was supposed to fit together.

But they're harder to build than you think. There are shades of gray and complications. You need related categories so people can jump from branch to branch; you can slice information so many different ways; who can agree where something fits, etc. I've tried to build a half-dozen, but I can't point to any major successes.

Is the ideal of taxonomy possible? Or is it just better to invest in a good search engine? Think about it, when you visit a site, do you ever browse a taxonomy, or do you just go right to search? If you're looking for something you've seen on this site, do you wade through the category list, or just hit the search engine?

When was the last time you actually browsed Yahoo! or DMOZ? I know they're there, but I haven't visited them in ages. Last time I did visit, what was the first thing I did? That's right — typed something into the search box.

Search is a lazy man's taxonomy. It's not as organized or structured as a taxonomy, but human beings — imperfect creatures than we are — tend to settle to what's easier. So, as an information architect, do you stand on principle, or do you cater to the lazy way your users are going to look for information?

This comes from my current infatuation with wikis. There is no categorizing of pages in wikis (even after my railin g against all their shortcomings a few months ago), there's just search and linking between pages. But the search is good, and it always seems to work. Same with the search on this site — when I'm looking for a previous post, it just always seems to work, and that search is nothing but a SQL "LIKE" query, the dumbest search of all.

So, are taxonomies an ideal that just don't survive the reality test?

Click here to comment on this entry


Stanford: Taxonomies of Love


Stanford: Taxonomies of Love 04/04/2005 04:45 AM
Ever since my little affair (no link), I have become considerably less interested in relationships. Indeed, I’ve been puzzled at…

Extended Faceted Taxonomies for Web
Catalogs


Extended Faceted Taxonomies for Web
Catalogs
12/02/2002 07:12 AM

sharing photoshop album taxonomies


sharing photoshop album taxonomies 05/24/2004 03:50 PM
the fascinating thing here is how album uses metatagging instead of folders

Features: Formal Taxonomies for the U.S.
Government


Features: Formal Taxonomies for the U.S.
Government
02/01/2005 08:52 PM
Mike Daconta, Metadata Program Manager at the Department of Homeland Security, introduces the notion of a formal taxonomy in the context of the Federal Enteriprise Architecture's Data Reference Model.

Stanford: Taxonomies of Love (Aaron
Swartz: The Webl0g)


Stanford: Taxonomies of Love (Aaron
Swartz: The Webl0g)
04/17/2005 10:05 PM
Stanford- Taxonomies of Love (Aaron Swartz: The Weblog)

aaronsw.com/weblog/001661
track this site | 2 links


Meta-classification


Meta-classification 08/28/2004 11:51 AM
At BiologyBrowser.org you'll find the following branch of their tree: Home > Subject > Systematics > Taxonomy > Classification...

Classification of Computers


Classification of Computers 07/13/2004 01:35 AM
WebDevInfo Jul 13 2004 5:28AM GMT

Defying classification


Defying classification 01/23/2004 02:21 PM

What is Planet ApacheWhat is Technorati?  These questions fascinate me.

Mailing list vs comments as a venue for dialog?  To me, that's like comparing board meetings vs pubs for as venues for making business decisions.

I am told that using information at the IP layer to affect behaviour two layers up isn't going to work?  And yet I run firewall programs filter outgoing connections based on the path of the main program which issued the request.  Despite coming up with "perl.exe" for a fair number of commands, it does seem to catch a lot of spyware.

Classifications are so impossible, yet so darn useful.


Threat Classification


Threat Classification 07/28/2004 04:34 PM

How to Make a Faceted Classification and
Put It On the Web


How to Make a Faceted Classification and
Put It On the Web
01/06/2004 08:06 AM

miskatonic.org/library/facet-web-howto.html
track this site | 5 links


Faceted Classification bibliography


Faceted Classification bibliography 10/30/2003 11:31 PM
William Denton created Putting Facets on the Web: An Annotated Bibliography: a classified, annotated bibliography about how to design faceted classification systems and make them usable on the World Wide Web. (Yes, the facet goodness is back on!)...

Sorting things out: Classification and
its consequences


Sorting things out: Classification and
its consequences
11/14/2003 11:26 AM

POPFile - Automatic Email Classification


POPFile - Automatic Email Classification 10/31/2003 10:37 AM
My Usenix LISA talk

WASC Releases Web Security Threat
Classification


WASC Releases Web Security Threat
Classification
07/28/2004 04:49 PM
Jeremiah Grossman (Jul 28 2004)

Information Architecture: Using Card
Sorting for Web Classification Design


Information Architecture: Using Card
Sorting for Web Classification Design
09/23/2002 05:28 AM

Folksonomies - Cooperative
Classification and Communication
Through Shared Metadata


Folksonomies - Cooperative
Classification and Communication
Through Shared Metadata
01/03/2005 10:00 AM
"Folksonomies - Cooperative Classification and Communication Through Shared Metadata" .. folksonomies .. del.icio.us

adammathes.com/academic/computer-mediated-communication/ folksonomies.html
track this site | 3 links


Adobe Jumps The Shark, Reader's
Classification As Spyware Not Far Behind


Adobe Jumps The Shark, Reader's
Classification As Spyware Not Far Behind
03/14/2005 06:07 PM

I'm not working from a Windows box as much as I used to these days, so maybe I'm a little too blissfully isolated from the sheer volume of crap that most of you put up with from your computer each day.

Tonight, though, I was working from my wife's PC at home, and I fired up Adobe Reader (The Reader Formerly Known As Acrobat) to look at a PDF. I've become accustomed to the annoying popup window in the last few versions that remind you of "important updates", and I saw this:

Abode
Update Screenshot

What the crap, man?! Since when is Photoshop Album an "Update" to Reader?! And Yahoo Toolbar?! They've co-opted the at least (dubiously) useful update program to shill their crap.

Don't get me wrong. I think Photoshop Album is probably the best photo management program available today. In fact, I already have the full version installed, which makes the update suggestion even more idiotic.

It's insane to dilute the interface with this sort of garbage. I will now dismiss anything that ever appears in the update list as marketing BS, which means that the update system is now broken because I'll never use it. Even if it wasn't, the popup window was a horrible way to notify for updates anyway. Firefox's subtle icon in the toolbar is a far better way to handle this.

As an app, Reader should be basically invisible. The ultimate goal of the app should be that, whenever I click on a PDF, it starts instantly and immediately displays me that PDF in a manner that's optimal for onscreen reading, and provides just a few simple tools that I might need. Anything else is serving Adobe, not me, and that makes the app less valuable. The splash screen is lame, and the popup update notice is asinine. Adding other apps to the software update notice is "Pulling a RealPlayer".


Information Architecture: Carrying out a
Classification Situation Analysis


Information Architecture: Carrying out a
Classification Situation Analysis
09/09/2002 06:29 AM

Experimental Mozilla Thunderbird Build
with Improved Junk Mail Classification
Available


Experimental Mozilla Thunderbird Build
with Improved Junk Mail Classification
Available
02/17/2004 01:25 AM

Hot Banana Wins 2005 e-Content Award -
Best Content Management System - CMS


Hot Banana Wins 2005 e-Content Award -
Best Content Management System - CMS
04/08/2005 04:55 AM
Hot Banana Software Inc., a leading North American Web Content Management Suite (CMS) company, announced today that it has won the 2005 e-Content award for the best Content Management System. The Canadian e-Content Awards are sponsored by the e-Content Institute and were created to recognize and honor e-content products and services used by Canadian organizations and individuals. [PRWEB Apr 8, 2005]

The Difference Between Online Content
And Broadcast Content


The Difference Between Online Content
And Broadcast Content
02/10/2004 02:46 PM
Major League Baseball made news last year for claim ing to own all in-progress game data - saying they were going to go after websites that reported what was happening at a game in real-time. It didn't matter that the law is pretty clear that you can't copyright facts - MLB believes that just presenting the data is a "rebroadcast" of the game. That said, I guess it's no surprise to hear that they now believe that web audio and video broadcasts of games should work the same way as television broadcasts with a content provider paying a huge upfront fee for the rights to the games, and then telling them they can make it back in ad revenue and subscription fees. Of course, the various internet sites they've approached with this plan have been laughing them out the door, and pointing out that they're not television stations, and they just want to provide something useful to their users - but aren't going to lose money to do so. While MLB has been at the forefront of offering streaming video and audio, it appears they still look on this as a broadcast medium, and not the interactive medium it actually is. They're doing their best to squeeze more money out of existing fans, rather than attract new fans, which is dangerous for the future of the sport. Not only do you anger your biggest fans, you also make it less likely that you're going to pick up new fans.

Usenet Content Up For Grabs On Content
Hungry Web


Usenet Content Up For Grabs On Content
Hungry Web
12/19/2004 03:08 PM
The age old question of copyright and Usenet comes up again.

The C# Programming Techniques Content
Area of Premium Content Aggregator
Braintique.com, www.braintique.com, is
Now Open


The C# Programming Techniques Content
Area of Premium Content Aggregator
Braintique.com, www.braintique.com, is
Now Open
02/01/2005 09:17 PM
C# Programming Techniques features articles, tips, techniques, and source code created by well-known author and programmer Harold Davis. Davis is the author of more than twenty books about programming and technology, including most recently Building Research Tools with Google For Dummies published by John Wiley. [PRWEB Jan 30, 2005]

Sun, Sybase, IBM Tackle RFID


Sun, Sybase, IBM Tackle RFID 08/16/2004 12:25 PM
Sun, Sybase and IBM are developing middleware in a contest for leadership in RFID management.

Drive to tackle MoT backlog


Drive to tackle MoT backlog 08/12/2004 02:46 AM
MoT centre staff prepare to tackle a backlog of vehicle tests as they return to work after three months on strike.

New bid to tackle superbug rates


New bid to tackle superbug rates 07/10/2004 10:13 PM
The government is set to introduce new measures to tackle hospital superbugs.

Use your PC to tackle financial problems


Use your PC to tackle financial problems 10/31/2003 10:39 PM
Scotsman Online Oct 31 2003 9:17PM ET

Centre to tackle net paedophiles


Centre to tackle net paedophiles 03/31/2005 09:53 PM
Police, customs officers and child welfare experts will join a unit to help protect children from online paedophiles.

Firms tackle web abuse


Firms tackle web abuse 11/14/2003 06:55 AM
vnunet.com Nov 14 2003 5:53AM ET

New tackle for amateur phishers


New tackle for amateur phishers 08/20/2004 04:43 PM
Sympatico Aug 20 2004 8:19PM GMT

Tackle wireless development


Tackle wireless development 05/23/2002 10:39 PM

UN talks tackle hunger problems


UN talks tackle hunger problems 04/15/2005 05:11 AM
News.bbc.co.uk - Thu Apr 14, 09:52 am GMT

Techies tackle Billy Bass


Techies tackle Billy Bass 12/04/2003 01:16 AM
iafrica.com Dec 4 2003 0:18AM ET

Strategy to tackle domestic abuse


Strategy to tackle domestic abuse 03/30/2005 02:01 AM
The Welsh Assembly Government unveils a three-year plan with £1.1m funding to tackle domestic abuse.

Tech Summit to Tackle Net Issues


Tech Summit to Tackle Net Issues 11/10/2003 11:14 PM
World leaders from 56 nations will attend the first U.N. confab on information technology to discuss such questions as who governs the Internet and free speech guidelines for the global medium. President Bush is not among them.
Grok Description matches for Taxonomies Tackle Content Classification
GrokA matches for Taxonomies Tackle Content Classification

Taxonomies Tackle Content Classification

The following phrases have been identified by the grok system as matching this entry:

















Also check out:


Grok

Ipod Porn on the
Rise

Brief Abstract of
Wikipedia's
Mesothelioma Cancer
page

Get first aid
instructions in your
cell phone

IE is crap
JSPWiki gains
podcasting support

Health InterNetwork
Open Source Resource
Center

Small Business
Online Resources

Public Library
Personal Investing
Resources

Google's IPO
German newspaper
iPod/torture mashup

ageproject :: Home
UMLGraph 2.0
Convulsion 0.0.1
E-Keyboard 0.1
QT RssApp 0.0.1
UUID 1.7.1
LaTeX Indexbuild 1.0
awEventMgr 040420
asm2class 0.0.6
XUL PHP libs 0.2
sYnergy Groupware
1.0.6

Threat of 'Dirty
Bomb' Growing,
Officials Say (Los
Angeles Times)

Two Foreigners
Stoned to Death in
Afghanistan
(Reuters)

Sharon Cancels U.S.
Trip After Gaza Plan
Setback (Reuters)

Putin Says
Chechnya's Kadyrov
Dead (Reuters)

Pacers Crush Heat
91-80 to Take 2-0
Lead (AP)

Open Borges
Labour peer urges
Blair to quit

AM warns of more
fuel protests

Cockling victims are
remembered

Trawlers 'caught' in
Canada's net

Officials: Teen
admitted creating
computer worm

German teen arrested
for hatching
'Sasser' bug

Sasser worm author
arrested in Germany

50 years later,
milers run on
computers

Microsoft names new
General Manager for
North Gulf

Movable Type/Safari
Bug

It's really bad in
Iraq

Teen Confesses to
Creating
'Sasser'
Worm (AP)

Utah Gov. Out of
Running for
Nomination (AP)

MTV spot
Kathryn Cramer:
Halliburton Pulling
the Plug on GI
Communications

Wired News:
Academics Patent P2P
Spoofing

David Von Drehle
Halls of Ivy May
Receive Miracle-Gro
in Germany

Chechen President
Injured in Explosion
at Stadium

Chechnya blast: Your
views

Security tightened
up at Cannes

M&S 'starts search
for new boss'

Ceremony to mark
leader's death

Children escape
house fire

Football: Hero's
send off for Smith

Football: Van Gaal
Blues link

Obese 'cheated' by
NHS targets

what is grok?