Home Research People Publications Events 

Analytics &  Insight: Learning, Optimization, Discovery

Our research in Analytics & Insight spans several research areas, from Machine Learning to Optimization, to Knowledge Discovery & Data Mining. Our research goals include:

Research Topics

Some of the recent research topics the labs have been working on include:

bulletEnterprise Search and Knowledge Management
bulletData Anonymization
bulletRetail Data Mining: Individual Consumer Modeling, Consumer Behavior Prediction
bulletText Mining: Text Classification, Information Extraction, NLP
bulletSemi-Supervised  & Active Learning: Combining Labeled and Unlabeled Data
bulletModeling & Simulation: Agent-based modeling, Swarm Intelligence, Discrete-Event Simulation, System Dynamics, Numerical simulation methods

Projects

Some of our recent projects include:

bulletData Anonymization
bulletSABLE
bulletExpert Finding
bulletTechnology Lifecycle Analysis
bulletProduct Attribute Extraction (Andrew Fano, Rayid Ghani, Marko Krema, Katharina Probst)
bullet Multiple Sensor Indoor Surveillance (Valery Petrushin, Gang Wei, Anatole Gershman, Rayid Ghani)
bulletAuction Price Prediction & Insurance (Rayid Ghani)
bulletIndividual Consumer Modeling (Chad Cumby, Andrew Fano, Rayid Ghani, Marko Krema)
bulletIntelligent Promotion Planning (Chad Cumby, Andrew Fano, Rayid Ghani, Marko Krema)
bulletBusiness Event Monitoring (Alex Kass, Chris Cowell Shah)
bulletOnline Audience Analysis (Gary Boone)
bulletIndustry Complexity Analysis and Simulation Tool (Cem Baydar, Kishore Swaminathan)
bulletComplex Risk Dynamics (Cem Baydar, Kishore Swaminathan)
bullet Knowledge Discovery Tool (Edy Liongasari, Mitu Singh)
bullet Personalized Pricing (Cem Baydar)
bulletProduct Profiler (Andrew Fano, Rayid Ghani)

People

bulletChad Cumby
bulletDivna Djordjevic
bulletAndrew Fano
bulletRayid Ghani
bulletAlex Kass
bulletMarko Krema
bulletMohit Kumar
bulletIrina Matveeva
bulletYaron Rachlin
bulletPeter Yeh

Publications

Semi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web
Katharina Probst, Rayid Ghani, Marko Krema, Andrew Fano, and Yan Liu.
Workshop on Web Mining - held with the European conference on Machine Learning (ECML/PKDD 2006)
[Paper (PDF)]

Text Mining for Product Attribute Extraction
Rayid Ghani, Katharina Probst, Marko Krema, Andrew Fano, and Yan Liu.
SIGKDD Explorations (2006)
[Paper (PDF)]

2005 Papers

Learning Individual Consumer Models for Personalized Promotions: A Data Mining Case Study.
Chad Cumby, Andrew Fano, Rayid Ghani, and Marko Krema.
Workshop on Data Mining for Business — held with the European Conference on Machine Learning (
ECML/PKDD 2005).

Multiple Sensor Integration for Indoor Surveillance.
Valery Petrushin, Gang Wei, Rayid Ghani and Anatole Gershman.
Multimedia Data Mining Workshop – held with 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2005)

Price Prediction and Insurance for Online Auctions
R. Ghani
11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August, 2005
Chicago, IL

Mining Rare and Frequent Events in Multi-camera Surveillance Video using Self-organizing Maps
V.A. Petrushin
11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August, 2005
Chicago, IL

Building Intelligent Shopping Assistant using Individual Consumer Models
C. Cumby, A. Fano, R. Ghani and M. Krema
Proceedings of the 2005 International Conference on Intelligent User Interfaces
January 9-12, 2005
San Diego, California
[Paper (PDF)]

2004 Papers

Predicting the End-Price of Online Auctions
R. Ghani and H. Simmons
International Workshop on Data Mining and Adaptive Modeling Methods for Economics and Management. Held in conjunction with the 15th European Conference on Machine Learning (ECML/PKDD 2004)
September, 2004
Pisa, Italy
[Abstract] [Paper (PDF, 270K)]

Mining the Web to Add Semantics  to Retail Data Mining
R. Ghani
Invited Paper. "Web Mining: From Web to Semantic Web:. Springer Lecture Notes in Artificial Intelligence, Vol. 3209. Berendt,B; Hotho, A.lMladenic, D.; Van Soeren, M.; Spiliopoulou, M.; Stumme, G. (eds.)
2004
Seattle, Washington

Predicting Customer Shopping Lists from Point-of-sale Purchase Data
C. Cumby, A. Fano, R. Ghani and M. Krema
10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August, 2004
Seattle, Washington
[Abstract] [Paper (PDF, 270K)]

Supporting Drug Discovery Through Knowledge Modeling and Integration
E. S. Liongosari, A. Gershman and M. Singh
Appeared in Fourth International Conference on Knowledge Management
June 30 – July 2, 2004
Graz, Austria

A Review of Operational Risk Quantitative Methodologies Within the BASEL II Framework
J. Aparicio-Navarro and E.Keskiner
May, 2004
[ Paper (PDF, 203K)]

Agent-based Modelling for Optimal Trading Decisions
C. Baydar
March, 2004
[Abstract] [Paper (PDF, 33K)]

A New Generation of Digital Library to Support Drug Discovery Research
E. S. Liongosari, A. Gershman and M. Singh
Sixth International Conference on Enterprise Information Systems
April 14-17, 2004
Porto, Portugal

The Role of Knowledge Modeling in Drug Discovery Research
A. Gershman, E. S. Liongosari and M. Singh
Appeared in UK Academy for Information Systems 2004 Conference
May, 2004
Glasgow, United Kingdom

2003 Papers

Agent-based Modeling and Simulation of Store Performance for Personalized Pricing
C. Baydar
2003 Winter Simulation Conference
December 7-10, 2003
New Orleans, Louisiana
[Abstract] [Paper (PDF, 164K)]

Supporting Intelligent Browsing of Disparate Bio-medical Data
E. S. Liongosari, A. Gershman and M. Singh
The Third Annual Mouse Genome Retreat
December 4-6, 2003
Nashville, Tennessee

Knowledge Discovery by Uncovering Hidden Linkages Among Disparate Sources
E. S. Liongosari and A. Gershman
Biomedical Information Science and Technology Initiative 2003 Symposium
National Institute of Health
November 6-7, 2003
Bethesda, Maryland

Can We Do Better than Google? Using Semantics to Explore Large Heterogeneous Knowledge Sources
A. Gershman and E. Liongosari
First International Workshop on Semantic Web and Databases
September 7-8, 2003
Berlin

Uncovering Hidden Linkages Among Disparate Sources
E. S. Liongosari and M. Singh
The Eleventh International Conference on Intelligent Systems for Molecular Biology
June 29-July 3, 2003
Brisbane, Australia

Integrating Disparate Information Through Knowledge Modeling
E. S. Liongosari and L. Hunter
The 26th Annual Meeting of Research Society on Alcoholism
June, 2003
Ft. Lauderdale, Florida

On Kernel Methods for Relational Learning
C. Cumby and D. Roth
International Conference on Machine Learning (ICML 2003)
August 21-24, 2003
Washington DC

Active Learning for Information Extraction with Multiple View Feature Sets
R. Ghani, R. Jones, T. Mitchell and E. Riloff
Workshop on Adaptive Text Extraction & Mining at the European Conference on Machine Learning (ECML 2003)
Dubrovnik, Croatia
[Abstract] [Paper (PDF, 248K)]

Building Minority Language Corpora by Learning to Generate Web Search Queries
R. Ghani, R. Jones and D. Mladenic
Journal of Knowledge and Information Systems (KAIS)
2003
[Abstract] [Paper (PDF, 1.2MB)]

2002 Papers

Accelerating Drug Discovery Process Through Model-Based Knowledge Integration
Edy S. Liongosari
KM World and Intranets 2002, Santa Clara, CA, 29 - 31 October 2002
[Paper (PDF, 3.1MB)]

Creating Emotion Recognition Agents for Speech Signal
Valery A. Petrushin
In Dauntenhahn K., Bond A.H., Canamero L, and Edmonds B. (eds.) Socially Intelligent Agents. Creating Relationships with Computers and Robots. Kluwer Academic Publishers, 2002, pp. 77-84.
[Abstract] [Paper (PDF, 38K)]

RUSLANA: A Database of Russian Emotional Utterances
Valery A. Petrushin
International Conference on Spoken Language Processing (ICSLP 2002), 17-20 September 2002, Denver, Colorado
[Abstract] [Paper (PDF, 454K)]

Student Response Evaluation for Spoken Language Learning: A Case Study of Learning Chinese Tones
Valery A. Petrushin
IEEE International Conference on Advanced Learning Technologies (ICALT 2002), 9-12 September 2002, Kazan, Russia
[Abstract] [Paper (PDF, 214K)]

A Learning Environment for Creating Media Processing Systems
Gang Wei, Valery A. Petrushin and Anatole V. Gershman
IEEE International Conference on Advanced Learning Technologies (ICALT 2002), 9-12 September 2002, Kazan, Russia
[Abstract] [Paper (PDF, 227K)]

The Community of Multimedia Agents Project
Gang Wei, Valery A. Petrushin and Anatole V. Gershman
IEEE International Conference on Multimedia and Expo (ICME2002), 26-29 August 2002, Lausanne, Switzerland
[Abstract] [Paper (PDF, 268K)]

Diagnosis of Complex Failures in Robotic Assembly Systems using Virtual Factories
Cem Baydar
AAAI/KDD/UAI-2002 Joint Workshop on Real-Time Decision Support and Diagnosis Systems, 29 July 2002, Edmonton, Alberta
[Abstract] [Paper (PDF, 205K)]

From Data to Insight: The Community of Multimedia Agents
Gang Wei, Valery A. Petrushin and Anatole V. Gershman
3rd International Workshop on Multimedia Data Mining (MDM KDD 2002) in conjunction with The 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery & Data Mining, 23-26 July 2002, Edmonton, Alberta, Canada

A Hybrid Parallel Simulated Annealing Algorithm to Optimize Store Performance
Cem Baydar
Workshop on Evolutionary Computing for Optimisation in Industry at the Genetic and Evolutionary Computation Conference (GECCO-2002), 9 July 2002, New York
[Abstract] [Paper (PDF, 130K)]

Combining Labeled and Unlabeled Data for MultiClass Text Categorization
Rayid Ghani
International Conference on Machine Learning (ICML 2002), 8-12 July 2002, Sydney, Australia
[Abstract] [Paper (PDF, 133K)]

Building Recommender Systems Using a Knowledge Base of Product Semantics
Rayid Ghani and Andrew Fano
Workshop on Recommendation and Personalization in ECommerce (RPEC 2002) at the Second International Conference on Adaptive Hypermedia and Adaptive Web-based Systems (AH 2002), 28 May 2002, Malaga, Spain
[Abstract] [Paper (PDF, 136K)]

A Comparison of Efficacy and Assumptions of Bootstrapping Algorithms for Training Information Extraction Systems
Rayid Ghani and Rosie Jones (Carnegie Mellon University)
Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Data at the Linguistic Resources and Evaluation Conference (LREC 2002), 27 May 2002, Las Palmas, Spain
[Abstract] [Paper (PDF, 87K)]

Learning to Change Taxonomies
Elena Eneva and Valery A. Petrushin
SPIE 2002 Conference on Data Mining and Knowledge Discovery: Theory, Tools, and Technology IV, 1-5 April 2002, Orlando
[Abstract] [Paper (PDF, 42K)]

One-to-One Modeling and Simulation: A New Approach in Customer Relationship Management for Grocery Retail
Cem Baydar
SPIE Conference on Data Mining and Knowledge Discovery: Theory, Tools, and Technology IV, 1-5 April 2002, Orlando
[Abstract] [Paper (PDF, 216K)]

2001 Papers

Organizing Information to Support Knowledge Discovery
Edy S. Liongosari
Enterprise Content Management 2001, Los Angeles, CA, 9-12 October 2001

PERSEUS: Personalized Multimedia News Portal
Victor O. Kulesh, Valery A. Petrushin, Ishwar K. Sethi
IASTED International Conference on Artificial Intelligence Applications (AIA 2001), 4-7 September 2001, Marbella, Spain

Using Speech Analysis Techniques for Language Learning
Valery A. Petrushin
The IEEE International Conference on Advanced Learning Technologies (ICALT 2001), 6-8 August 2001, Madison, WI

eShopper Modeling and Simulation
Valery A. Petrushin
International Society for Optical Engineering (SPIE 2001) Conference on Data Mining and Knowledge Discovery: Theory, Tools, and Technology III, 16-20 April 2001, Orlando

Data Mining for Targeted Marketing
Valery A. Petrushin, James M. Britton.
Artificial Neural Networks in Engineering (ANNIE 2000), 5-8 November 2000, St. Louis
[Abstract] [Paper (PDF, 278K)]

Emotion Recognition Agents in Real World
Valery A. Petrushin.
2000 AAAI Fall Symposium on Socially Intelligent Agents: Human in the Loop, 3-5 November 2000, North Falmouth, MA
[Abstract] [Paper (PDF, 117K)]

Hidden Markov Models: Fundamentals and Applications. Part 1: Markov Chains and Mixture Models
Valery A. Petrushin
Online Symposium for Electronics Engineers 2000, November 2000
[Abstract] [Paper (PDF, 273K)]

Hidden Markov Models: Fundamentals and Applications. Part 2: Discrete and Continuous Hidden Markov Models
Valery A. Petrushin
Online Symposium for Electronics Engineers 2000, November 2000
[Abstract] [Paper (PDF, 281K)]

Emotion Recognition In Speech Signal: Experimental Study, Development, And Application
Valery A. Petrushin.
Sixth International Conference on Spoken Language Processing (ICSLP 2000), 16-20 October 2000, Beijing
[Abstract] [Paper (PDF, 247K)]

Applying Knowledge Integration Technology
Edy S. Liongosari and Richard J. Stuckey
Comdex 2000, Chicago, IL, 17-21 April 2000
[Paper (PDF, 746K)]

Emotion in Speech: Recognition and Application to Call Centers
Valery A. Petrushin
Artificial Neural Networks in Engineering (ANNIE '99), 7-10 November 1999, St. Louis
[Abstract] [Paper (PDF, 504K]

Opportunistic Exploration of Large Consumer Product Spaces
Douglas Bryan and Anatole Gershman
ACM Conference on Electronic Commerce (EC '99), 3-5 November 1999, Denver
[Abstract] [Paper (PDF, 1.6MB), (ZIP, 1.6MB)]

Modeling Organizations Using Agent-Based Simulations
M. V. Nagendra Prasad and Donald A. Chartier
A Workshop on Agent Simulation: Applications, Models, and Tools, 15-16 October 1999, Chicago
[Abstract] [Paper (PDF, 322K)

In Search of A New Generation of Knowledge Management Applications
Edy S. Liongosari, Kelly L. Dempski and Kishore S. Swaminathan
ACM SIGGROUP Bulletin, July 1999
[Abstract] [Paper (PDF, 2MB), (ZIP, 340K)]

Use of Recurrent Neural Networks for Strategic Data Mining of Sales Information
Jayavel Shanmugasundaram, M. V. Nagendra Prasad, Sanjeev Vadhavkar and Amar Gupta
1999 Information Resources Management Association International Conference (IRMA '99), 16-19 May 1999, Hershey, PA
[Abstract] [Paper (PDF, 21MB), (ZIP, 1.4MB)]

Integrating Disparate Knowledge Sources
Adam B. Brody, Kelly L. Dempski, Joseph E. Kaplan, Scott W. Kurth, Edy S. Liongosari and Kishore S. Swaminathan
Second International Conference on The Practical Application of Knowledge Management (PAKeM '99), 21-23 April 1999, London
[Abstract] [Paper (PDF, 4.6MB), (ZIP, 619K)]

Opportunistic Exploration of Large Consumer Product Spaces (demonstration)
Douglas Bryan and Anatole Gershman
1999 International Conference on Intelligent User Interfaces (IUI '99), 5-8 January 1999, Redondo Beach, California
[Abstract] [Paper (PDF, 100K)]

Distributed Case-Based Learning
M. V. Nagendra Prasad
Fifteenth National Conference on Artificial Intelligence (AAAI '98), 27 July 1999, Madison, Wisconsin
[Abstract] [Paper (PDF, 316K)]

InfoScout: An Active Recommender Agent
M. V. Nagendra Prasad and Theodore D. Anagnost
Fifteenth National Conference on Artificial Intelligence (AAAI '98), 26 July 1999, Madison, Wisconsin
[Abstract] [Paper (PDF, 144K)]