Resources
LaTex Templates and Publisher Policies
ACM Conferences
IEEE
Springer
Publisher's copyright and archiving policies.
UC eScholarship repository archive hints.
Mailing Lists
DDLBETA
: An invitation-only mailing list for discussions of text classification, text mining, and related issues.
SIG-IRList
: a moderated regular IR news source.
DBWorld
: mailing list intended for messages of interest to the database research community.
KDNuggets
: bi-weekly electronic newsletter focusing on Data Mining and Knowledge Discovery.
Software Documents and Manuals
Version Control with Subversion
GNU Octave documentation
Links to open source mathematical programs
Linux Advanced Routing & Traffic Control HOWTO
GNU Wget Manual
: wget was part of my web crawler and gave me lots of pitfalls plus many nights of debugging.
Documentation of Apache HTTP Server
FreeBSD Handbook
: FreeBSD is among the early IPv6 adopters. I use it to crawl IPv6 web sites.
APT HOWTO
: APT is the most rapid, practical, and efficient package management system, much better than RPM.
Configuration of vsftpd
: Probably the most secure and fastest FTP server for UNIX-like systems.
Oracle Berkeley DB Documentation
: Every time I need a lightweight built-in database support, I go to Berkeley DB.
Gnuplot: Not So Frequently Asked Questions
.
Prof. Matloff's DDD tutorial
: An introduction to the secret art of debugging. Shows you the basic principles, leading you step-by-step in applying those principles to debugging two sample programs.
Libconfuse
and
libconfig
: Libraries for parsing configuration files.
NIST Digital Library of Mathematical Functions
Programming Guides
Is Parallel Programming Hard, And, If So, What Can You Do About It?
.
The C programming language
, and
answers to its exercises
.
Dictionary of Algorithms and Data Structures
Beej's Guide to Network Programming and Unix Interprocess Communication
A Guide to Perl(in Chinese)
Perl regular expressions
: This page describes the syntax of regular expressions in Perl.
Statistics with R
Python Tutorial
Suggestion Collections
The Elements of Style
Writing with Sources
Guide to grammar and writing
Guide to Grammar and Style
Report Dos and Don'ts
Time Management Talk
, by Randy Pausch.
How To Ask Questions The Smart Way
Useful Things to Know About Ph. D. Thesis Research
The Ph.D experience
, by
Mihir Bellare
.
You and Your Research
, Richard Hamming.
Crawlers
Heritrix
: A web crawler written in Java.
Larbin
: Another web crawler which is written in c.
Sitemap
: A crawling protocol observed by Google, Yahoo! and MSN goes beyond robots.txt.
"It is a very sad thing that nowadays there is so little useless information." -- Oscar Wilde
Back to
Shaozhi's Homepage
Comments
_displayNameOrEmail_
- _time_ -
Remove
_text_