다운로드가 가능한 정답셋이 있는(labeling 된) 공개 데이터셋 중에서, 신뢰성이 높으며 비즈니스케이스 활용 가능한 학습데이터


1) HTTP CSIC 2010 Dataset for Intrusion detection (Security) - http://www.isi.csic.es/dataset/ 

2) Multi-Source Cyber-Security Events Dataset (Security) - http://csr.lanl.gov/data/cyber1/ 

3) Air Quality Dataset (Public sector) - http://archive.ics.uci.edu/ml/datasets/Air+Quality# 

4) Gas Sensors for Home activity monitoring Dataset (Smart Home) - https://github.com/thmosqueiro/ENose-Decorr_Humdt_Temp 

5) Bank Marketing Dataset (Marketing, Retail) - http://archive.ics.uci.edu/ml/datasets/Bank+Marketing# 

6) Human Activity Recognition using smartphones Dataset (Marketing, Retail) - http://archive.ics.uci.edu/ml/datasets/Smartphone-Based+Recognition+of+Human+Activities+and+Postural+Transitions 

7) Credit Card Client in Taiwan (6 months) Dataset (Marketing, Finance) - http://archive.ics.uci.edu/ml/datasets/default+of+credit+card+clients 

8) Online Retail Dataset (Marketing, e-Commerce) - http://archive.ics.uci.edu/ml/datasets/Online+Retail

9) MIMIC (Medical database)  - https://github.com/MIT-LCP/mimic-code  /  https://mimic.physionet.org/about/mimic/ 

Health-related data associated with over 40k patients who stayed in critical care units of Beth Israel Deaconess Medical Center 2001-2012.

Includes information about demographics, vital sign measurements (-1 data point per hour), lab test result, procedures, medications, caregiver notes, imaging reports, and mortality.

 


+ Recent posts