Term Weighting and Ranking Algorithms

10/20/98


Click here to start


Table of Contents

Term Weighting and Ranking Algorithms

Review

Documents in 3D Space

Vector Space Model

Documents in Vector Space

Vector Space Documents and Queries

Similarity Measures

Text Clustering

Agglomerative Clustering

Agglomerative Clustering

Agglomerative Clustering

Automatic Class Assignment

PPT Slide

Today

Finding Out About

Ranking Algorithms

Structure of an IR System

PPT Slide

Vector Representation (revisited; see Salton article in Science)

Assigning Weights to Terms

Assigning Weights to Terms

Binary Weights

Raw Term Weights

Assigning Weights

tf x idf

Inverse Document Frequency

tf x idf normalization

Vector space similarity (use the weights to compare the documents)

Vector Space Similarity Measure combine tf x idf into a similarity measure

To Think About

Computing Similarity Scores

Computing a similarity score

Other Major Ranking Schemes

Other Major Ranking Schemes

Probabilistic Models

Probabilistic Models: Some Notation

Probabilistic Models

Probabilistic Models

Logistic Regression

Probabilistic Models: Logistic Regression

Logistic Regression

Probabilistic Models: Logistic Regression attributes

Probabilistic Models: Logistic Regression

Simplified Logistic Regression

Probabilistic Models

Vector and Probabilistic Models

Author: Ray R. Larson

Email: ray@sherlock.berkeley.edu

Home Page: http://sims.berkeley.edu/~ray

Download presentation source