IMPRINT
Conventional stream mining methods assume that each data instance is seen only once and is forgotten after being processed. Consider for example a classifier that distinguishes between normal network accesses and attacks. This classifier reads each data instance (access operation) once and must adapt to new types of attack. However, the data to be analyzed in many business applications are not simple instances, but complex, nested objects that contain streams of data instances. Customer data are such an example: they encompass some stationary information, as well as transactions like purchases, service requests, product reviews etc. To learn and maintain customer segments, a company needs learning methods that derive and adapt models upon the complex objects and the streams feeding them.
In IMPRINT we distinguish between perennial objects, which contain data instances, and the stream of data instances themselves. The challenges of mining perennial objects are manifold. They include learning upon objects that grow as new transactions arrive, the comparison of objects that differ in size and age, and their efficient maintenance. In IMPRINT, we will design, develop and evaluate adaptive learning methods that deal with the above challenges.
Publications
- Learning Relational User Profiles and Recommending Items as Their Preferences Change. International Journal on Artificial Intelligence Tools, (24)02:31 pages, 2015. URL
- Learning and inspecting classification rules from longitudinal epidemiological data to identify predictive features on hepatic steatosis. Expert Systems with Applications, (41)11:5405-5415, Elsevier BV, September 2014. URL
- Subpopulation Discovery in Epidemiological Data with Subspace Clustering. Foundations of Computing and Decision Sciences (FCDS), (39)4:271-300, 2014. URL
- Discovering and Monitoring Product Features and the Opinions on them with OPINSTREAM. Neurocomput., Elsevier Science Publishers B. V., Amsterdam, The Netherlands, The Netherlands, 2014.
- Adaptive Semi Supervised Opinion Classifier With Forgetting Mechanism. Proc. of the 29th Annual ACM Symposium on Applied Computing, ACM, 2014.
- Using Participant Similarity for the Classification of Epidemiological Data on Hepatic Steatosis. Proc. of the 27th IEEE Int. Symposium on Computer-Based Medical Systems (CBMS14), IEEE, Mount Sinai, NY, 2014. URL
- Classification of Benign and Malignant DCE-MRI Breast Tumors by Analyzing the Most Suspect Region. In Hans-Peter Meinzer, Thomas Martin Deserno, Heinz Handels, and Thomas Tolxdorff (Eds.), Bildverarbeitung für die Medizin 2013, 45-50, Springer Berlin Heidelberg, 2013. URL
- Can we distinguish between benign and malignant breast tumors in DCE-MRI by studying a tumor's most suspect region only?. Computer-Based Medical Systems (CBMS), 2013 IEEE 26th International Symposium on, 77-82, June 2013. URL
- Extracting Opinionated (Sub)Features from a Stream of Product Reviews. In Johannes Fürnkranz, Eyke Hüllermeier, and Tomoyuki Higuchi (Eds.), Discovery Science, (8140):340-355, Springer Berlin Heidelberg, 2013. URL
- Framework for Storing and Processing Relational Entities in Stream Mining. In Jian Pei, Vincent S. Tseng, Longbing Cao, Hiroshi Motoda, and Guandong Xu (Eds.), Advances in Knowledge Discovery and Data Mining, (7819):497-508, Springer Berlin Heidelberg, 2013. URL
- Where Are We Going? Predicting the Evolution of Individuals. In Jaakko Hollmén, Frank Klawonn, and Allan Tucker (Eds.), Advances in Intelligent Data Analysis XI, (7619):357-368, Springer Berlin Heidelberg, 2012. URL
- Advanced Topics on Data Stream Mining: Part II. Mining Multiple Streams. Bristol, UK, 24-28 09 2012.
- Discovering Global and Local Bursts in a Stream of News. Proceedings of the 27th Annual ACM Symposium on Applied Computing, 807--812, ACM, New York, NY, USA, 2012. URL
- A Semi-supervised Incremental Clustering Algorithm for Streaming Data. Proc. of 16th Pacific-Asian Conf. on Knowledge Discovery (PAKDD 2012), Kuala Lumpur, Malaysia, May 2012.
- Online Clustering of High-Dimensional Trajectories under Concept Drift. In Dimitrios Gunopulos, Thomas Hofmann, Donato Malerba, and Michalis Vazirgiannis (Eds.), Machine Learning and Knowledge Discovery in Databases, (6912):261-276, Springer Berlin Heidelberg, 2011. URL
- Classification Rule Mining for a Stream of Perennial Objects.. In Nick Bassiliades, Guido Governatori, and Adrian Paschke (Eds.), RuleML Europe, (6826):281-296, Springer, 2011. URL
- A data generator for multi-stream data. The second International Workshop on Mining Ubiquitous and Social Environments, MUSE '11, September 5, 2011 Athens, Greece, 2011. URL