• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail - Request a copy

Hierarchical Data Topology Based Selection for Large Scale Learning

Hmida, Hmida; Ben Hamida, Sana; Borgi, Amel; Rukoz, Marta (2016), Hierarchical Data Topology Based Selection for Large Scale Learning, in El Baz, Didier; Bourgeois, Julien, 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress, IEEE - Institute of Electrical and Electronics Engineers : Piscataway, NJ, p. 1221-1226. 10.1109/UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld.2016.0186

Type
Communication / Conférence
Date
2016
Conference title
UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld 2016
Conference date
2016-07
Conference city
Toulouse
Conference country
France
Book title
2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress
Book author
El Baz, Didier; Bourgeois, Julien
Publisher
IEEE - Institute of Electrical and Electronics Engineers
Published in
Piscataway, NJ
ISBN
978-1-5090-2770-5
Number of pages
1242
Pages
1221-1226
Publication identifier
10.1109/UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld.2016.0186
Metadata
Show full item record
Author(s)
Hmida, Hmida
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Ben Hamida, Sana cc
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Borgi, Amel
Laboratoire d'Informatique, Programmation, Algorithmique et Heuristique [LIPAH]
Rukoz, Marta
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Abstract (EN)
The amount of available data for data mining, knowledge discovery continues to grow very fast with the era of Big Data. Genetic Programming algorithms (GP), that are efficient machine learning techniques, are face up to a new challenge that is to deal with the mass of the provided data. Active Sampling, already used for Active Learning, might be a good solution to improve the Evolutionary Algorithms (EA) training from very big data sets. This paper investigates the adaptation of Topology Based Selection (TBS) to face massive learning datasets by means of Hierarchical Sampling. We propose to combine the Random Subset Selection (RSS) with the TBS to create the RSS-TBS method. Two variants are implemented, applied to solve the KDD intrusion detection problem. They are compared to the original RSS, TBS techniques. The experimental results show that the important computational cost generated by original TBS when applied to large datasets can be lightened with the Hierarchical Sampling.
Subjects / Keywords
Sampling; machine learning; decision support systems; Big data

Related items

Showing items related by title and author.

  • Thumbnail
    Scale Genetic Programming for large Data Sets: Case of Higgs Bosons Classification 
    Hmida, Hmida; Ben Hamida, Sana; Borgi, Amel; Rukoz, Marta (2018) Article accepté pour publication ou publié
  • Thumbnail
    Adaptive sampling for active learning with genetic programming 
    Ben Hamida, Sana; Hmida, Hmida; Borgi, Amel; Rukoz, Marta (2019) Article accepté pour publication ou publié
  • Thumbnail
    Sampling Methods in Genetic Programming Learners from Large Datasets: A Comparative Study 
    Hmida, Hmida; Ben Hamida, Sana; Borgi, Amel; Rukoz, Marta (2017) Communication / Conférence
  • Thumbnail
    A new adaptive sampling approach for Genetic Programming 
    Hmida, Hmida; Ben Hamida, Sana; Borgi, Amel; Rukoz, Marta (2019) Communication / Conférence
  • Thumbnail
    Genetic Programming over Spark for Higgs Boson Classification 
    Hmida, Hmida; Ben Hamida, Sana; Borgi, Amel; Rukoz, Marta (2019) Communication / Conférence
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo