Claudia Perlich
claudia@media6degrees.com 37 East 18th Street, 9th Floor New York, NY 10003 Office: (646) 745 1574 Cell: (914) 409 5609 I am currently working as Chief Scientist at Media6Degrees. Media6Degrees is the pioneer in Social Targeting. We find the best prospective customers for your brand by targeting the people who cluster with your current customers around the web. Previously I was a member of the Predictive Modeling Group at IBM T.J. Watson Research Center. My research interests are in practical applications of Machine Learning and Data Mining approaches. I graduated in 2004 from the Information Systems Department at Stern, NYU under the supervision of Foster Provost. Awards and RecognitionsWinner KDD CUP 2009 Fast Challenge: “Fast Challenge for CRM” Finalist in the INFORMS Edelman competition 2009: "Operations Research Improves Sales Force Productivity" Winner KDD CUP 2008 Task 1 and Task 2: “Identifying Breast Cancer” Winner INFORMS Data Mining Contest 2008: “Identifying Pneumonia Patients" Winner KDD CUP 2007 Task 2: “Predicting movie popularity for NETFLIX” Data Mining Practice Prize at KDD 2007: “Predictive modeling for marketing” Winner ILP Challenge 2005: “Genetic classification” Runner Up KDD CUP 2003 Task 1: “Predicting Citation Rates” Selected PublicationsJournal Papers “On
Cross-Validation and Stacking: Building Seemingly Predictive Models on Random
Data” Claudia Perlich, Grzegorz Swirszcz. SIGKDD Explorations 12(2) (2010) 11-15 “Social Media Analytics: The Next Generation of Analytics-Based Marketing Seeks Insights from Blogs”
R. Lawrence, P. Melville,C. Perlich, et al. Forthcoming ORMS Today 37(1) (2010) “On Data-Driven Analysis of User-Generated Content” C. Perlich, et al. Forthcoming IEEE Intelligent Systems 25(1) (2010) 12-17 “Medical Data Mining: Insights from Winning Two Competitions” S. Rosset, C. Perlich, G. Swirszcz, P. Melville, Y. Liu. Forthcoming Journal of Data Mining and Knowledge Discovery 20 (3) (2010), 439-468 “Winning the KDD Cup Orange Challenge with Ensemble Selection” A. Niculescu-Mizil, C.Perlich, et al. Journal of Machine Learning Research W&CP 7 (2009) 23-34 “Operations Research Improves Sales Force Productivity at IBM” R. Lawrence, C.Perlich, S.Rosset, et al. Interfaces 40(1) (2010) 33-46 “Breast Cancer Identification: KDD Cup Winners Report” C. Perlich, P. Melville, Y. Liu, G. Swirszcz, S. Rosset and R. Lawrence. SIGKDD Explorations 10(2) (2008) 39-42 “Making the Most of Your Data: KDD Cup 2007 ‘How Many Ratings’ Winner’s Report” S. Rosset, C. Perlich, Y. Liu, In SIGKDD Explorations 9(2) (2007) 66-69
“Analytics-driven solutions for customer targeting and sales force allocation” J. Arroyo, M. Callahan, M. Collins, A. Ershov, I. Khabibrakhmanov, R. Lawrence, S.Mahatma, M. Niemaszyk, C. Perlich, S. Rosset, S. Weiss, IBM Systems Journal 46 (4) (2007)
“A Market-Based Framework for Bankruptcy Prediction” Reisz, A.S. and C. Perlich., Journal of Financial Stability 3(2) (2007) 85-131
“Ranking-Based Evaluation of Regression Models” Rosset, S., C. Perlich, and B. Zadrozny, Knowledge and Information Systems 12 (3) 2006 331-329
“ACORA: Distribution-Based Aggregation for Relational Learning from Identifier Attributes” Perlich, C. and F. Provost. Journal of Machine Learning 62 (2006) 65-105
“Temporal Resolution of Uncertainty and Corporate Debt Yields: An Empirical Investigation” Reisz, A.S. and C. Perlich. Journal of Business 79 (2006) 731-770
“Predicting Citation Rates for Physics Papers: Constructing Features for an Ordered Probit Model” Perlich, C., F. Provost, and S. Macskassy. In SIGKDD Explorations (2004) 154-155
“Tree Induction vs. Logistic Regression: A Learning Curve Analysis” Perlich, C., F. Provost, and J. Simonoff. Journal of Machine Learning Research 4 (2003) 211-255
Conference and Workshop Papers “Cross-Validation: Bias Alert - Proceed with Caution” C. Perlich and G. Swirszcz. Under Review at SIGKDD International Conference on Knowledge Discovery and Data Mining 2010 “A Predictive Perspective on Measures of Influence in Social Networks” P. Melville, C. Perlich, E. Meliksetian, R. Lawrence. Under Review at SIGKDD International Conference on Knowledge Discovery and Data Mining 2010 “Machine Learning for Social Media Analytics” P. Melville, et al. 4th Annual Machine Learning Symposium, New York Academy of Science, 2009 “Predicting Links in Dyadic Domains” C. Perlich, G. Swirszcz and R. Lawrence. The 1st Workshop on Information in Networks, NYU, 2009 “Content-based Link Prediction for Patent Marketing” C. Perlich, G. Swirszcz and R. Lawrence. International Workshop on Recommendation-based Industrial Applications at RECSYS 2009 “Spatial-temporal causal modeling for climate change attribution” A. Lozano, H. Li, A. Niculescu-Mizil, Y. Liu, C. Perlich, J. Hosking, N. Abe. SIGKDD International Conference on Knowledge Discovery and Data Mining 2009 “Winners Report: KDD Cup Breast Cancer Identification” C. Perlich, P. Melville, Y. Liu, G. Swirszcz, S. Rosset and R. Lawrence. The KDD CUP and Workshop on Mining Medical Data at SIGKDD 2008 Y. Liu, Z. Kou, C. Perlich, R. Lawrence. SIGKDD International Conference on Knowledge Discovery and Data Mining 2008
“Mining Political Blog Networks” W. Gryc, Y. Liu, C. Perlich, R. D. Lawrence. Networks in Political Science Conference at Harvard 2008
“Making the Most of Your Data: KDD Cup 2007 ‘How Many Ratings’ Winner’s Report” S. Rosset, C. Perlich, Y. Liu KDD Cup and Workshop at SIGKDD 2007
“A Data Mining Case Study: Analytics-driven solutions for customer targeting and sales force allocation” R. Lawrence, C. Perlich, S. Rosset, I. Khabibrakhmanov, S. Mahatma, S. Weiss. Second Workshop on Data Mining Case Studies and Practice Prize at SIGKDD 2007
“Looking for Great Ideas: Analyzing the Innovation Jam” Mary Helander, Rick Lawrence, Yan Liu, Claudia Perlich, Chandan Reddy, Saharon Rosset. Workshop on Web Mining and Social Network Analysis at SIGKDD 2007
“High Quantile Modeling for Customer Wallet Estimation with Other Applications” Perlich, C., S. Rosset, R. Lawrence, and B. Zadrozny, 13th SIGKDD International Conference on Knowledge Discovery and Data Mining 2007
“Identifying Bundles of Product Options using Mutual Information Clustering” Perlich, C., SIAM International Conference on Data Mining 2007
“Discriminative Embedding for Classification Tasks in Complex Relational and Network Domains” Perlich, C., Workshop on Novel Applications of Dimensionality Reduction at NIPS 2006
“Quantile Modeling for Marketing” Perlich, C., S. Rosset and B. Zadrozny. Workshop on Data Mining for Business Applications at 12th SIGKDD International Conference on Knowledge Discovery and Data Mining 2006
“A New Multi-View Regression Approach with an Application to Customer Wallet Estimation” Merugu, S. S.Rosset and C. Perlich. 12th SIGKDD International Conference on Knowledge Discovery and Data Mining 2006
“Wallet Estimation Models” Rosset, S., C. Perlich, B. Zadrozny, S. Merugu, S. Weiss and R. Lawrence. International Workshop on Customer Relationship Management: Data Mining Meets Marketing, NYU 2005
“Relational Learning for Customer Relationship Management” Perlich, C., and Z. Huang. International Workshop on Customer Relationship Management: Data Mining Meets Marketing, NYU 2005
“Approaching the ILP Challenge 2005: Class-Conditional Bayesian Propositionalization for Genetic Classification” Perlich, C. Inductive Logic Programming (ILP) 2005
“Gene Classification: Issues and Challenges for Relational Learning” Perlich, C, and S. Merugu. Workshop on Multi-Relational Data Mining (MRDM), at 11th SIGKDD International Conference on Knowledge Discovery and Data Mining 2005
“Ranking-Based Evaluation of Regression Models” Perlich, C., S. Rosset and B. Zadrozny. International Conference on Data Mining (ICDM) 2005
“Learning from Identifier Attributes: Distribution-Based Aggregation for Relational Learning” Perlich, C. and F. Provost. Dagstuhl Seminar 05051, 2005
“Aggregation-Based Feature Invention and Relational Concept Classes” Perlich, C. and F. Provost. Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2003, 167-176
“Citation-Based Document Classification” Perlich, C. Workshop on Information Technology and Systems (WITS) 2003 “Aggregation and Concept Complexity in Relational Learning” Perlich, C. and F. Provost. Workshop on Learning Statistical Models from Relational Data (SRL), at IJCAI 2003
“Relational Learning Problems and Simple Models” Provost, F., C. Perlich and S. Macskassy. Workshop on Learning Statistical Models from Relational Data (SRL), at IJCAI 2003
“ACORA: Automated Construction of Relational Attribute” Perlich, C. Prototype Track at Workshop on Information Technology and Systems (WITS) 2003
“Discovering Knowledge from Relational Data Extracted from Business News” Bernstein, A., S. Clearwater, S. Hill, C. Perlich and F. Provost. Workshop on Multi-Relational Data Mining (MRDM), at Eighth SIGKDD International Conference on Knowledge Discovery and Data Mining 2002
“A Modular Approach to Relational Data Mining” Perlich, C. and F. Provost. American Conference on Information Systems (AMCIS) 2002
“Modeling of Scholastic Aptitude Tests” Weigend, A.S., C. Perlich and M. Brehler. International Conference on Neural Information Processing (ICONIP) 1996
Invited Book Chapters “Database Mining for Marketing” Perlich, C. and M. Saar-Tsechansky. In Encyclopedia of Marketing, 2010 “Learning Curves in Machine Learning” Perlich, C. In Encyclopedia of Machine Learning, C. Sammut and G. Webb Editors, Springer 2009 “Quantile Modeling for Wallet Estimation” Perlich, C. and S. Rosset Statistical Methods in eCommerce Research
“Aggregation for Predictive Modeling with Relational Data” Perlich, C. and F. Provost In Encyclopedia of Data Warehousing and Mining 2004 “Modeling Quantiles” Perlich, C., S. Rosset and B.Zadrozny. Forthcoming, Encyclopedia of Data Warehousing and Mining, Second Edition “Robust Regression Evaluation” Perlich, C., S. Rosset and B.Zadrozny. Forthcoming, Encyclopedia of Data Warehousing and Mining, Second Edition Tutorial “Predictive Modeling in the Wild: Success Factors in Data Mining Competitions and Real-Life Projects” At SIGKDD International Conference on Knowledge Discovery and Data Mining 2009 Patents YOR820050714 Ranking-Based Method for Evaluating Customer Wallet Models YOR820060081 Method for Predicting Customer Wallet YOR820060057 Method for Customer-Choice Based Bundling of Product Options YOR920090427 Model for Market Impact Analysis of Part Removal from Complex Products |