Research Day

Computer Science Department
University of Vermont

October 10 (Friday), 2003

Mining Data Streams

Philip S. Yu
Manager, Software Tools and Techniques
IBM T.J. Watson Research Center
19 Skyline Drive
Hawthorne, NY 10532

ABSTRACT

With the advance of data gathering and communication technologies, it becomes increasingly possible to support real-time monitoring of large amount of information from diverse information sources. Examples include trade surveillance for security fraud and money laundering, network monitoring for intrusion detection, bio-surveillance for terrorist attacks, and various sensor network based monitoring applications. Data is viewed as a continuous stream in these kinds of applications. Problems such as data mining which have been widely studied for traditional data sets cannot be easily applied to the data stream domain. This is because the large volume of data arriving in a stream renders most algorithms too inefficient as most mining algorithms require multiple scans of data which is unrealistic for stream data. More importantly, the characteristics of the data stream can change over time and the evolving pattern needs to be captured. In this talk, I'll provide an overview, discuss the issues and focus on how to mine evolving data streams.

Bio

Dr. Philip S. Yu is the manager of the Software Tools and Techniques group at the IBM Thomas J. Watson Research Center . The current focuses of the project include the development of advanced algorithms and optimization techniques for data mining, anomaly detection and personalization, and the enabling of Web technologies to facilitate E-commerce and pervasive computing.

Dr. Yu’s research interests include data mining, Internet applications and technologies, database systems, multimedia systems, parallel and distributed processing, disk arrays, computer architecture, performance modeling and workload analysis. Dr. Yu has published more than 340 papers in refereed journals and conferences. He holds or has applied for more than 200 US patents. Dr. Yu is an IBM Master Inventor.

Dr. Yu is a Fellow of the ACM and a Fellow of the IEEE. He will become the Editor-in-Chief of IEEE Transactions on Knowledge and Data Engineering on Jan. 2001. He is an associate editor of ACM Transactions of the Internet Technology and also Knowledge and Information Systems Journal. He is a member of the IEEE Data Engineering steering committee. He also serves on the steering committee of IEEE Intl. Conference on Data Mining. He received an IEEE Region 1 Award for "promoting and perpetuating numerous new electrical engineering concepts".

Philip S. Yu received the B.S. Degree in E.E. from National Taiwan University, Taipei, Taiwan, the M.S. and Ph.D. degrees in E.E. from Stanford University, and the M.B.A. degree from New York University.


Related Web sites on publications

o DBLB for database publications

o Webbib for Web related publications