Vamsi Pavan’s Place

When curiousity outbursts …..

Entries Tagged as 'IR/Data Mining'

Online links for eBooks

October 6th, 2006 · No Comments · Gen, IR/Data Mining

free ebook downloads
flazx.com
holyplanets.com
http://ftp.anyhost.ru/books/www.krf.bsu.by/
Placement questions
http://www.cracktheinterview.com/adfaqpublish.html
Latex
Using ttf fonts with latex
http://www.radamir.com/tex/ttf-tex.htm
Online C reference
Programming in C UNIX System Calls and Subroutines using C.
http://www.cs.cf.ac.uk/Dave/C/
Probability of Theory: The logic of science
Statistical Natural Language Processing: Reading list
Statistical Data Mining Tutorials by Andrew More from CMU
Graphical Models Reading group
Information Theory (Short and good one, gives enough information about entropy and diffferent kinds of […]

Bookmark it! These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • Sphinn
  • del.icio.us
  • Facebook
  • Mixx
  • Google
  • Live
  • StumbleUpon
  • BlinkList
  • YahooMyWeb
  • NewsVine
  • blogtercimlap
  • Netvouz
  • Technorati
  • Slashdot
  • Print this article!

[Read more →]

Tags:

English corpus

October 6th, 2006 · No Comments · IR/Data Mining

Vocab size

Bookmark it!
These icons link to social bookmarking sites where readers can share and discover new web pages.

Bookmark it! These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • Sphinn
  • del.icio.us
  • Facebook
  • Mixx
  • Google
  • Live
  • StumbleUpon
  • BlinkList
  • YahooMyWeb
  • NewsVine
  • blogtercimlap
  • Netvouz
  • Technorati
  • Slashdot
  • Print this article!

[Read more →]

Tags:

Limit robots action on a specific page

October 6th, 2006 · No Comments · IR/Data Mining

Yahoo ROBOT from http://help.yahoo.com/help/us/ysearch/basics/basics-10.html
If you run a web site and do not want your content to be accessible through the cache, you can use the NOARCHIVE meta-tag. Place this in the section of your documents:
< META NAME="ROBOTS" CONTENT="NOARCHIVE" >
This tag will tell robots not to archive the page. Our crawler will continue to index and […]

Bookmark it! These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • Sphinn
  • del.icio.us
  • Facebook
  • Mixx
  • Google
  • Live
  • StumbleUpon
  • BlinkList
  • YahooMyWeb
  • NewsVine
  • blogtercimlap
  • Netvouz
  • Technorati
  • Slashdot
  • Print this article!

[Read more →]

Tags:

Similarity measures, text clustering or text classification

October 6th, 2006 · No Comments · IR/Data Mining

If X and Y represent the document vectors, then their similarity can be measured using different similarity measures,
^

Bookmark it!
These icons link to social bookmarking sites where readers can share and discover new web pages.

Bookmark it! These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • Sphinn
  • del.icio.us
  • Facebook
  • Mixx
  • Google
  • Live
  • StumbleUpon
  • BlinkList
  • YahooMyWeb
  • NewsVine
  • blogtercimlap
  • Netvouz
  • Technorati
  • Slashdot
  • Print this article!

[Read more →]

Tags: