ResearchResearchDatasetDatasetExperimentsExperiments
GitHubGitHub (opens in a new tab)
  • Abstract
  • Research Objectives
  • Problem definition
  • Limitations
  • Project Architecture
  • DL Pipeline
  • Technology Stack
  • Data Collection
  • Scraping
    • Streaming Service APIs
    • Quality of data
    • Store Data
    • Transformation & Cleaning
  • Dataset
    • Design Principles
    • Taxonomy Construction
    • Descriptive Statistics
  • Exploratory data analysis
  • Audio Analysis
  • Feature Extraction
  • Pre-processing
  • Model
  • Convolutional Neural Network (CNN)
  • Architecture
  • Training
  • Performance
  • Conclusions
  • Contribution
  • Discussion
  • Endnotes
  • Sources
  • References
  • Audio Datasets
  • Research
    • Abstract
    • Research Objectives
    • Problem definition
    • Limitations
    • Project Architecture
    • DL Pipeline
    • Technology Stack
    • Data Collection
    • Scraping
      • Streaming Service APIs
      • Quality of data
      • Store Data
      • Transformation & Cleaning
    • Dataset
      • Design Principles
      • Taxonomy Construction
      • Descriptive Statistics
    • Exploratory data analysis
    • Audio Analysis
    • Feature Extraction
    • Pre-processing
    • Model
    • Convolutional Neural Network (CNN)
    • Architecture
    • Training
    • Performance
    • Conclusions
    • Contribution
    • Discussion
    • Endnotes
    • Sources
    • References
    • Audio Datasets
      • Audio Datasets
  • Dataset
  • Experiments
    • Experiments
    • Archive
    • Binary Classification
      • Training
      • Experiment 1
      • Experiment 2
      • Experiment 3
      • Experiment 4
      • Experiment 5
  • Technology
    • Pytorch (opens in a new tab)
    • torch audio (opens in a new tab)

On This Page

  • Audio Datasets
Research
Audio Datasets

Audio Datasets

GTZAN Genre Dataset https://datasets.activeloop.ai/docs/ml/datasets/gtzan-genre-dataset/ (opens in a new tab)

Audio datasets https://towardsdatascience.com/40-open-source-audio-datasets-for-ml-59dc39d48f06 (opens in a new tab)

Image dataset https://www.cityscapes-dataset.com/dataset-overview/ (opens in a new tab)

Video Datasets YouTube-8M Segments Dataset https://research.google.com/youtube8m/ (opens in a new tab)

References

MLVC 2023 - This website is an online documentation of Antonis kalagkatsis' MSc thesis in the National and Kapodistrian University of Athens, Department of Communication and Media Studies, Digital Communication Media and Interaction Environments