Welcome to the Data Science Lab at Stony Brook!

Big Data makes for big and interesting problems!  Our lab focuses on analyzing large-scale text streams such as news, blogs, and social media to identify cultural trends around the world's people, places, and things.

Our research covers a range of topics in natural language processing.  A current focus is using Deep Learning techniques to build concise representations of the meanings of words in all significant languages, and use these powerful features to recognize entities and measure sentiment and other properties of texts.

Another focus involves analyzing Wikipedia to identify the fame and significance of historical figures as reported in our book Who's Bigger? and associated website.

Our Lydia technology has been licensed by General Sentiment, a social media analysis startup.

Recent News/Blogs

  • Three Ph.D. students graduate The Data Science Lab wishes a fond farewell to our three graduating Ph.D. students who are moving on in the world after five years at Stony Brook.  From left ...
    Posted May 22, 2015, 1:40 PM by Steven Skiena
  • Linguistic Shift Paper at WWW Lead author Vivek Kulkarni presented our paper Statistically Significant Detection of Linguistic Change today at the prestigious World Wide Web conference (WWW 2015) in Milan, Italy.   This paper has been ...
    Posted May 22, 2015, 1:27 PM by Steven Skiena
  • Best Paper Award Our lab's collaborations with social scientists have been recognized by the American Sociological Association's 2014 Clifford Geertz Prize, for the best article in the sociology of culture ...
    Posted Sep 19, 2014, 6:38 AM by Steven Skiena
Showing posts 1 - 3 of 18. View more »