| Overview | SIMS 202 Information Organization and Retrieval
Assignment 4 Assigned 9/14. Due 9/21.
Readings: Read
Clark & Clark, Fellbaum and Miller (WordNet), Bates.
The goal of this assignment is to give you hands-on experience with a
large database of lexical relations (WordNet) and to further explore
the idea of hierarchical versus faceted organization of categories.
WordNet
(2)
This is an exploratory exercise, and there is no "correct" answer!
Don't spend too long, not more than an hour. (Idea for question from
Philip Resnik.)
2. break a bone
Try to do this without a dictionary if you can, but if you're not a native
speaker of English, use a dictionary if you need to.
(b) Go to one of the web pages for using
WordNet online,
and look up
the verb senses for "break".
Which WordNet senses do your senses from part (a) match, if any? (One
of your senses might match more than one WordNet sense, of course.) For
example,
Sense 1: Matches WN senses 2,3,4,5
(c) Do any of your senses group naturally into a class with common elements
of meaning? How would you group them? (Use a hierarchy if that makes more
sense.)
(d) How do these variations in word meaning relate to our discussion
of centrality of category membership and characteristic features? (3) Comparing WordNet to other systems of organization.
Optionally, (just for fun) try to find a
really interesting relationship between two terms. My favorite is
doing a "Connection" search on "hearst" and "berkeley" (be sure to
click on More Paths).
Hierarchical Classification vs. Facets
(5) Yahoo! (a catalog service for the World Wide Web) employs
human categorizers to assign selected web pages to categories. A (variation
of a) small fragment of the Yahoo category system is shown here.
(Names surrounded by * indicate actual web pages, as opposed to categories
of web pages.)
(b) Convert the category system from its given structure (hierarchical
or faceted) to the other of these two kinds of structure. Show where each
of the categories and web pages appears in your converted structure. (You
may refer to line numbers rather than writing out the entire name of each
category to save time if you like. The numbers have no meaning except to
act as a quick way to refer to the categories.)
(c) Is this new arrangement better? Why or why not? |