Claudia Perlich

claudia@media6degrees.com

37 East 18th Street, 9th Floor                                          New York, NY 10003


Office: (646) 745 1574                                                 Cell:  (914) 409 5609

I am currently working as Chief Scientist at  Media6DegreesMedia6Degrees is the pioneer in Social Targeting.  We find the best prospective customers for your brand by targeting the people who cluster with your current customers around the web. Previously I was a member of the  Predictive Modeling Group at IBM T.J. Watson Research Center. My research interests are in practical applications of Machine Learning and Data Mining approaches. I graduated in 2004 from the Information Systems Department at Stern, NYU under the supervision of Foster Provost. 


Awards and Recognitions

Winner KDD CUP 2009 Fast Challenge: “Fast Challenge for CRM”

Finalist in the INFORMS Edelman competition 2009: "Operations Research Improves Sales Force Productivity"

Winner KDD CUP 2008 Task 1 and Task 2: “Identifying Breast Cancer”

Winner INFORMS Data Mining Contest 2008: “Identifying Pneumonia Patients"

Winner KDD CUP 2007 Task 2: “Predicting movie popularity for NETFLIX”

Data Mining Practice Prize at KDD 2007: Predictive modeling for marketing”

Winner ILP Challenge 2005: “Genetic classification”

Runner Up KDD CUP 2003 Task 1: “Predicting Citation Rates”

Selected Publications

Journal Papers

On Cross-Validation and Stacking: Building Seemingly Predictive Models on Random Data
Claudia Perlich, Grzegorz Swirszcz. SIGKDD Explorations 12(2) (2010) 11-15

“Social Media Analytics: The Next Generation of Analytics-Based Marketing Seeks Insights from Blogs”

R. Lawrence, P. Melville,C. Perlich, et al. Forthcoming ORMS Today 37(1) (2010)


“On Data-Driven Analysis of User-Generated Content”

C. Perlich, et al. Forthcoming IEEE Intelligent Systems 25(1) (2010) 12-17


“Medical Data Mining: Insights from Winning Two Competitions”

S. Rosset, C. Perlich, G. Swirszcz, P. Melville, Y. Liu. Forthcoming Journal of Data 

Mining and Knowledge Discovery 20 (3) (2010), 439-468


“Winning the KDD Cup Orange Challenge with Ensemble Selection”

A. Niculescu-Mizil, C.Perlich, et al. Journal of Machine Learning Research W&CP 7 (2009) 23-34 


“Operations Research Improves Sales Force Productivity at IBM”

R. Lawrence, C.Perlich, S.Rosset, et al. Interfaces 40(1) (2010) 33-46


“Breast Cancer Identification: KDD Cup Winners Report”

C. Perlich, P. Melville, Y. Liu, G. Swirszcz, S. Rosset and R. Lawrence. SIGKDD Explorations 10(2) (2008) 39-42


“Making the Most of Your Data: KDD Cup 2007 ‘How Many Ratings’ Winner’s Report”

S. Rosset, C. Perlich, Y. Liu, In SIGKDD Explorations 9(2) (2007) 66-69

 

“Analytics-driven solutions for customer targeting and sales force allocation”

J. Arroyo, M. Callahan, M. Collins, A. Ershov, I. Khabibrakhmanov, R. Lawrence, S.Mahatma, M. Niemaszyk, C. Perlich, S. Rosset, S. Weiss, IBM Systems Journal 46 (4) (2007)  

 

“A Market-Based Framework for Bankruptcy Prediction”

Reisz, A.S. and C. Perlich., Journal of Financial Stability 3(2) (2007) 85-131

 

 “Ranking-Based Evaluation of Regression Models”

Rosset, S., C. Perlich, and B. Zadrozny, Knowledge and Information Systems 12 (3) 2006 331-329

 

“ACORA: Distribution-Based Aggregation for Relational Learning from Identifier Attributes”

Perlich, C. and F. Provost. Journal of Machine Learning 62 (2006) 65-105

 

“Temporal Resolution of Uncertainty and Corporate Debt Yields: An Empirical Investigation”

Reisz, A.S. and C. Perlich. Journal of Business 79 (2006) 731-770

 

“Predicting Citation Rates for Physics Papers: Constructing Features for an Ordered Probit Model”

Perlich, C., F. Provost, and S. Macskassy. In SIGKDD Explorations (2004) 154-155

 

“Tree Induction vs. Logistic Regression: A Learning Curve Analysis”

Perlich, C., F. Provost, and J. Simonoff. Journal of Machine Learning Research 4 (2003) 211-255

 

Conference and Workshop Papers

“Cross-Validation: Bias Alert - Proceed with Caution”

C. Perlich and G. Swirszcz. Under Review at SIGKDD International Conference on 

Knowledge Discovery and Data Mining 2010


“A Predictive Perspective on Measures of Influence in Social Networks”

P. Melville, C. Perlich, E. Meliksetian, R. Lawrence. Under Review at SIGKDD 

International Conference on Knowledge Discovery and Data Mining 2010


“Machine Learning for Social Media Analytics”

P. Melville, et al.

 4th Annual Machine Learning Symposium, 

New York Academy of Science, 2009


“Predicting Links in Dyadic Domains”

C. Perlich, G. Swirszcz and R. Lawrence. The 1st Workshop on Information in 

Networks, NYU, 2009


“Content-based Link Prediction for Patent Marketing”

C. Perlich, G. Swirszcz and R. Lawrence. International Workshop on 

Recommendation-based Industrial Applications at RECSYS 2009


“Spatial-temporal causal modeling for climate change attribution”

A. Lozano, H. Li, A. Niculescu-Mizil, Y. Liu, C. Perlich, J. Hosking, N. Abe. SIGKDD 

International Conference on Knowledge Discovery and Data Mining 2009


“Winners Report: KDD Cup Breast Cancer Identification”

C. Perlich, P. Melville, Y. Liu, G. Swirszcz, S. Rosset and R. Lawrence. The KDD 

CUP and Workshop on Mining Medical Data at SIGKDD 2008


“Graphical Models for Workforce Classification”

Y. Liu, Z. Kou, C. Perlich, R. Lawrence. 

SIGKDD International Conference on Knowledge Discovery and Data Mining 2008

 

“Mining Political Blog Networks”

W. Gryc, Y. Liu, C. Perlich, R. D. Lawrence. 

Networks in Political Science Conference at Harvard 2008

 

“Making the Most of Your Data: KDD Cup 2007 ‘How Many Ratings’ Winner’s Report”

S. Rosset, C. Perlich, Y. Liu 

KDD Cup and Workshop at SIGKDD 2007

 

“A Data Mining Case Study: Analytics-driven solutions for customer targeting and sales force allocation”

R. Lawrence, C. Perlich, S. Rosset, I. Khabibrakhmanov, S. Mahatma, S. Weiss. 

Second Workshop on Data Mining Case Studies and Practice Prize at SIGKDD 2007

 

“Looking for Great Ideas: Analyzing the Innovation Jam”

Mary Helander, Rick Lawrence, Yan Liu, Claudia Perlich, Chandan Reddy, Saharon Rosset. Workshop on Web Mining and Social Network Analysis at SIGKDD 2007

 

“High Quantile Modeling for Customer Wallet Estimation with Other Applications”

Perlich, C., S. Rosset, R. Lawrence, and B. Zadrozny, 13th SIGKDD International Conference on Knowledge Discovery and Data Mining 2007

 

“Identifying Bundles of Product Options using Mutual Information Clustering”

Perlich, C., SIAM International Conference on Data Mining 2007

 

“Discriminative Embedding for Classification Tasks in Complex Relational and Network Domains”

Perlich, C., Workshop on Novel Applications of Dimensionality Reduction at NIPS 2006

 

“Quantile Modeling for Marketing”

Perlich, C., S. Rosset and B. Zadrozny. Workshop on Data Mining for Business Applications at 12th SIGKDD International Conference on Knowledge Discovery and Data Mining 2006

 

“A New Multi-View Regression Approach with an Application to Customer Wallet Estimation”

Merugu, S. S.Rosset and C. Perlich. 12th SIGKDD International Conference on Knowledge Discovery and Data Mining 2006

 

“Wallet Estimation Models”

Rosset, S., C. Perlich, B. Zadrozny, S. Merugu, S. Weiss and R. Lawrence. International Workshop on Customer Relationship Management: Data Mining Meets Marketing, NYU 2005

 

“Relational Learning for Customer Relationship Management”

Perlich, C., and Z. Huang. International Workshop on Customer Relationship Management: Data Mining Meets Marketing, NYU 2005

 

“Approaching the ILP Challenge 2005: Class-Conditional Bayesian Propositionalization for Genetic Classification”

Perlich, C. Inductive Logic Programming (ILP) 2005

 

“Gene Classification: Issues and Challenges for Relational Learning”

Perlich, C, and S. Merugu. Workshop on Multi-Relational Data Mining (MRDM), at 11th SIGKDD International Conference on Knowledge Discovery and Data Mining 2005

 

“Ranking-Based Evaluation of Regression Models”

Perlich, C., S. Rosset and B. Zadrozny. International Conference on Data Mining (ICDM) 2005

 

“Learning from Identifier Attributes: Distribution-Based Aggregation for Relational Learning”

Perlich, C. and F. Provost. Dagstuhl Seminar 05051, 2005

 

“Aggregation-Based Feature Invention and Relational Concept Classes”

Perlich, C. and F. Provost. Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2003, 167-176

 

“Citation-Based Document Classification”

Perlich, C. Workshop on Information Technology and Systems (WITS) 2003


“Aggregation and Concept Complexity in Relational Learning”

Perlich, C. and F. Provost. Workshop on Learning Statistical Models from Relational Data (SRL), at IJCAI 2003

 

 “Relational Learning Problems and Simple Models”

Provost, F., C. Perlich and S. Macskassy. Workshop on Learning Statistical Models from Relational Data (SRL), at IJCAI 2003

 

“ACORA: Automated Construction of Relational Attribute”

Perlich, C. Prototype Track at Workshop on Information Technology and Systems (WITS) 2003

 

“Discovering Knowledge from Relational Data Extracted from Business News”

Bernstein, A., S. Clearwater, S. Hill, C. Perlich and F. Provost. Workshop on Multi-Relational Data Mining (MRDM), at Eighth SIGKDD International Conference on Knowledge Discovery and Data Mining 2002

 

“A Modular Approach to Relational Data Mining”

Perlich, C. and F. Provost. American Conference on Information Systems (AMCIS) 2002

 

“Modeling of Scholastic Aptitude Tests”

Weigend, A.S., C. Perlich and M. Brehler. 

International Conference on Neural Information Processing (ICONIP) 1996

 

Invited Book Chapters

“Database Mining for Marketing”

Perlich, C. and M. Saar-Tsechansky. In Encyclopedia of Marketing, 2010


“Learning Curves in Machine Learning”

Perlich, C. In Encyclopedia of Machine Learning, C. Sammut and G. Webb Editors, Springer 2009


“Quantile Modeling for Wallet Estimation”

Perlich, C. and S. Rosset 

Statistical Methods in eCommerce Research

 

“Aggregation for Predictive Modeling with Relational Data”

Perlich, C. and F. Provost 

In Encyclopedia of Data Warehousing and Mining 2004


“Modeling Quantiles”

Perlich, C., S. Rosset and B.Zadrozny. Forthcoming, Encyclopedia of Data Warehousing 

and Mining, Second Edition


“Robust Regression Evaluation”

Perlich, C., S. Rosset and B.Zadrozny. Forthcoming, Encyclopedia of Data Warehousing 

and Mining, Second Edition


Tutorial

“Predictive Modeling in the Wild: Success Factors in Data Mining Competitions 

and Real-Life Projects”

At SIGKDD 

International Conference on Knowledge Discovery and Data Mining 2009


Patents

YOR820050714 Ranking-Based Method for Evaluating Customer Wallet Models

YOR820060081 Method for Predicting Customer Wallet

YOR820060057 Method for Customer-Choice Based Bundling of Product Options

YOR920090427 Model for Market Impact Analysis of Part Removal from Complex Products


                                                  

Sign in  |  Recent Site Activity  |  Terms  |  Report Abuse  |  Print page  |  Powered by Google Sites