Search this site
Embedded Files
yeshao's homepage
  • Home
  • More about me
  • Courses
  • Resources
yeshao's homepage
  • Home
  • More about me
  • Courses
  • Resources
  • More
    • Home
    • More about me
    • Courses
    • Resources

LaTex Templates and Publisher Policies

    • ACM Conferences
    • IEEE
    • Springer
    • Publisher's copyright and archiving policies.
    • UC eScholarship repository archive hints.

Mailing Lists

    • DDLBETA: An invitation-only mailing list for discussions of text classification, text mining, and related issues.
    • SIG-IRList: a moderated regular IR news source.
    • DBWorld: mailing list intended for messages of interest to the database research community.
    • KDNuggets: bi-weekly electronic newsletter focusing on Data Mining and Knowledge Discovery.

Software Documents and Manuals

    • Version Control with Subversion
    • GNU Octave documentation
    • Links to open source mathematical programs
    • Linux Advanced Routing & Traffic Control HOWTO
    • GNU Wget Manual: wget was part of my web crawler and gave me lots of pitfalls plus many nights of debugging.
    • Documentation of Apache HTTP Server
    • FreeBSD Handbook: FreeBSD is among the early IPv6 adopters. I use it to crawl IPv6 web sites.
    • APT HOWTO: APT is the most rapid, practical, and efficient package management system, much better than RPM.
    • Configuration of vsftpd: Probably the most secure and fastest FTP server for UNIX-like systems.
    • Oracle Berkeley DB Documentation: Every time I need a lightweight built-in database support, I go to Berkeley DB.
    • Gnuplot: Not So Frequently Asked Questions.
    • Prof. Matloff's DDD tutorial: An introduction to the secret art of debugging. Shows you the basic principles, leading you step-by-step in applying those principles to debugging two sample programs.
    • Libconfuse and libconfig: Libraries for parsing configuration files.
    • NIST Digital Library of Mathematical Functions

Programming Guides

    • Is Parallel Programming Hard, And, If So, What Can You Do About It? .
    • The C programming language, and answers to its exercises.
    • Dictionary of Algorithms and Data Structures
    • Beej's Guide to Network Programming and Unix Interprocess Communication
    • A Guide to Perl(in Chinese)
    • Perl regular expressions: This page describes the syntax of regular expressions in Perl.
    • Statistics with R
    • Python Tutorial

Suggestion Collections

    • The Elements of Style
    • Writing with Sources
    • Guide to grammar and writing
    • Guide to Grammar and Style
    • Report Dos and Don'ts
    • Time Management Talk, by Randy Pausch.
    • How To Ask Questions The Smart Way
    • Useful Things to Know About Ph. D. Thesis Research
    • The Ph.D experience, by Mihir Bellare.
    • You and Your Research, Richard Hamming.

Crawlers

    • Heritrix: A web crawler written in Java.
    • Larbin: Another web crawler which is written in c.
    • Sitemap: A crawling protocol observed by Google, Yahoo! and MSN goes beyond robots.txt.

"It is a very sad thing that nowadays there is so little useless information." -- Oscar Wilde

compass

Back to Shaozhi's Homepage

Google Sites
Report abuse
Page details
Page updated
Google Sites
Report abuse