
The first CubaUK (Liverpool) School of Data Mining took place at CENATAV in Havana, Cuba, on January 818, 2007; a joint effort between the University of Liverpool and the Advanced Technologies Application Centre (CENATAV). The event was directed at all those interested in techniques of data minimg in general and multimedia data mining in particular with the intention establish a dialog bto foster ongoing academic collaboration between data mining researchers in Cuba and the United Kingdom.
Location: CENATAV, 7a # 21812 e/ 218 y 222, Rpto. Siboney, Playa, Ciudad de la Habana, Cuba, Phone: (+537) 2721670, Fax: (+537) 2730045
Hours: 9:0013:00 (Coffee break: 11:00  11:30).
DAY  SEMINAR  SEMINAR 

Monday 8  Introduction: Aim's and objectives, some assumptions about the target audience, data mining in thE UK and The University of Liverpool, and overview of "what's to come".  Data:Data sources, UCI data repository, IBM QUEST data generator, LUCSKDD ARM data generator (demonstration of software), normalisation and/or discretisation for (a) ARM, (b) CARM (demonstration of LUCSKDDDN software). 
Tuesday 9  Association Rule Mining 1: Association Rule Mining (ARM)  What is it? Brute force Algorithmm (software demonstration), the Apriori Algorithm, the Ttree and the AproriT Algorithm (demonstration of AprioriT).  Association Rule Mining 2: Lattices and the negative boarder approach, Dynamic Itemset Counting (DIC), software demonstration of negarive boarder and DIC algorithms, organising the input data, FPtrees and the FP growth algorithm, Ptrees and the TFP algorithm, demonstration of FP growth and TFP algorithms. 
Wednesday 10  Association Rule Mining  The Wider Picture: Vertical v. Horizontal data, Maximal Frequent Itemsets (MaxMiner), Frequent Closed Patterns (CloSet), understanding your association rules (ordering, clustering, visualisation), emerging (jumping) patterns, more ARM, and future directions.  Association Rule Mining For Very Large Data Sets: Overview of distributed and parallel ARM, partitioning and segmentation (especially vertical partitioning), xperiments with the Distributed AprioriT Algorithm (DATA), and mining VLDB using partitioning and segmentation. 
Thursday 11  Classification Association Rule Mining 1: The Classification problem, Classification Association Rules (CARs), some popular Classification Association Rule Mining (CARM) algorithms, demonstartion of a number of CARM algorithms (FOIL, PRM, CPAR, CMAR and CBA), evaluation strategies.  Classification Association Rule Mining 2: Fast and effective Classification Association Ruke mining  the Total From Partaila Classification (TFPC) approach. software demlonstration. 
Monday 12  MultiAgent Data Mining (MADM) 1: What is an agent? (features, categorisation, advantages and disadvantages), MultiAgent System (MAS) Technologies, a MultiAgent Data Mining (MADM) vision, some research issues, thoughts on ARM in a MADM setting, MADM operation, some more thoughts (argumentation and the semantic Web).  MultiAgent Data Mining (MADM) 2  Incremental Daata Mining: The challenge of incremental ARM (IARM), some IARM algorithms (FUP, AFPIM, EFPIM, NFUP, CATS trees, CAN trees), incremental TFP algorithm, more thoughts. 
Tuesday 13  Text Mining 1: Text mining techniques, text representation, and bench mark data sets.  Text Mining 2: Text mining research at Liverpool, (a) Identification of significant words and phrases, (b) experiments using different phrase based mining approaches, (c) software demonstration. 
Wednesday 14  MultiMedia Data Mining (MMDM): Current research directions at Liverpool, multimedia Data Mining (MDM). Image Mining, the LUCSKDD random image generator, image preprocessing for Data Mining (image representation)  the tesseral and quad tree representations.  Image Mining 1: Issues with image representations, (a) Image preprocessing software demonstration (tesserak and Quadtree representations), (b) image mining demonstartion (tesseral and quadtree representations), (c) graph mining and time series analysis, (d) Dynamic Time Warping (software demonstartion), (e) "BlobsÓ, (f) concept graphs and (g) segmentation. 
Thursday 15  Image Mining 2: Research work at Liverpool, image primitives (what are they?), comparing primitives, similarity matrices and lattices, similarity weighting, comparitor grid, software demonstration.  Summary and Conclussions: Summary of seminar series highlighting main ideas, review of possible future directions relating to individual topics discussed, main findings. 
Delgates at School Of data Mining, 15 January 2007