Intrepid From NYIT              
 

Project at A Glance:

Microblogging today has become a very popular communication tool among Internet users. Millions of users share opinions on different aspects of their everyday life. Twitter, the most popular microblogging platform is a rich source of data for opinion mining and sentiment analysis. There have been some research works that were devoted to this topic using Twitter, however few of them has explored China’s microblogging field­—Sina MicroBlog, also known as Weibo. In our project, we focus on developing a tool that automatically collects the public “tweets” from Weibo and employs the data processing techniques from Hadoop Platform (a leading large-scale data processing platform that enables parallel processing over commodity computers in local networks) to perform linguistic analysis of the collected data (Chinese characters) and try to explain discovered phenomena in a general sense. By analyzing the collected data, we are hoping to statistically reflect social behaviors towards some specific topic within a given period of time and summarize the trends of social response briefly by giving out the statistic table of data analysis (such as tweets-bonded user information or user-bonded tweets statistics) with respect to timeline. Through designated experimental process and evaluations, we expect to demonstrate that our proposed techniques are efficient and able to give out reliable data analysis results, which can be further developed and applied. In our project, we applied our techniques on Chinese posting with GBK and UTF-8 encodings. However, the proposed techniques can be applied to analyzing online posts in other languages.


See our project slides at Prezi:
 

Rolling Updates

  • Ninth Week EENG 491 Spring 2011WEEKLY STATUS REPORTTeam Intrepid Week ending(Thursday) 04/14/11 Report 9     Project Title:  Cluster Computers & Distributed ComputingGroup Leader:       Hao LiuTeam Member 1 ...
    Posted May 10, 2011, 7:33 PM by KAI CHEN
  • Eighth Week EENG 491 Spring 2011WEEKLY STATUS REPORTTeam Intrepid Week ending(Thursday) 03/31/11 Report 8     Project Title:  Cluster Computers & Distributed ComputingGroup Leader:       Hao LiuTeam Member 1 ...
    Posted May 10, 2011, 7:31 PM by KAI CHEN
  • Seventh Week EENG 491 Spring 2011WEEKLY STATUS REPORTTeam Intrepid Week ending(Thursday) 03/24/11 Report 7     Project Title:  Cluster Computers & Distributed ComputingGroup Leader:       Hao LiuTeam Member 1 ...
    Posted May 10, 2011, 7:29 PM by KAI CHEN
  • Sixth Week EENG 491 Spring 2011WEEKLY STATUS REPORTTeam Intrepid Week ending(Thursday) 03/17/11 Report 6     Project Title:  Cluster Computers & Distributed ComputingGroup Leader:       Hao LiuTeam Member 1 ...
    Posted May 10, 2011, 7:27 PM by KAI CHEN
  • Fifth Week EENG 491 Spring 2011WEEKLY STATUS REPORTTeam Intrepid Week ending(Thursday) 03/09/11 Report 5     Project Title:  Cluster Computers & Distributed ComputingGroup Leader:       Hao LiuTeam Member 1 ...
    Posted May 10, 2011, 7:25 PM by KAI CHEN
Showing posts 1 - 5 of 15. View more »

Recent Files

  • Slide for our Presentation   0k - May 10, 2011, 7:49 PM by KAI CHEN (v1)
    ‎Slide for a Presentation‎
  • Team Intrepid -Final Report.docx   901k - May 10, 2011, 7:46 PM by KAI CHEN (v1)
  • weekly_report_Spring_2011 week 9.doc   49k - May 10, 2011, 7:37 PM by KAI CHEN (v1)
  • weekly_report_Spring_2011 week 8.doc   51k - May 10, 2011, 7:37 PM by KAI CHEN (v1)
  • weekly_report_Spring_2011 week 7.doc   50k - Apr 13, 2011, 10:00 PM by KAI CHEN (v1)
Showing 5 files from page Release 1.0.