[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Last update: Oct 09, 2022

Related tags

Deep Learning OkMugle

Overview

Ok Mugle! 🎵

장르부터 멜로디까지, Content-based Music Recommendation

'Ok Mugle!'은 제13회 투빅스 컨퍼런스(2022.01.15)에서 진행한 음악 추천 프로젝트입니다.

Description 📖

본 프로젝트에서는 Kakao Arena에서 제공하는 Melon Playlist Continuation 데이터를 활용하여, 사용자가 검색한 노래와 유사한 노래 추천을 구현하였습니다.

[Model] '유사성'의 기준을 멜로디, 분위기, 상황, 장르 등으로 정의
- 해당 요소 반영하여 Music2Vec, Time Convolutional AutoEncoder, ConsineEmbeddingLoss Multimodal 등의 모델 Building
[Retrieval] Embedding의 Cosine Similarity를 구하여 Retrieval 구성
[Ranking] 다양한 Ranking Method 사용 → 추천 결과 Ensemble
[Serving] 최종적으로 Score Total Top 10 Ranking Method의 추천 결과 활용하여 Web 구현 & 모델 Serving

Usage ✔️

Windows Shell에 아래 명령을 입력하여 실행합니다.

set FLASK_APP=server
set FLASK_ENV=development
flask run

Result (Web) 💻

웹(ToBigs 13th Conference Music Recommendation) 바로가기
웹 메인화면

검색창에 '비투비 - 비밀 (Insane) (Acoustic Ver.)'를 검색한 결과 화면

Presentation 🙋

컨퍼런스 발표영상과 보고서입니다. 자세한 분석 내용은 아래 링크를 통해 확인해주세요!

Contributor 🧑‍🤝‍🧑

본 프로젝트에는 빅데이터 분석 및 인공지능 대표 연합동아리 ToBig's 멤버들이 참여하였습니다.

기수	이름
15기	이성범
16기	김권호
16기	박한나
16기	이승주
16기	이예림
16기	주지훈
7기	이광록(멘토)

File Directory 📂

Ok Mugle!
├── 1. preprocessig
│   ├── make_song_meta_and_playlist.ipynb       # 노래, 플레이리스트 데이터 전처리
│   ├── make_mel_data.ipynb                     # 멜 데이터 전처리
│   └── make_mel_batch_data.ipynb               # 멜 데이터 배치 단위로 전처리
│
├── 2. model
│   ├── genre_embedding_model.ipynb             # Music2Vec
│   ├── mel_embedding_model.ipynb               # Time Convolutional Autoencoder
│   └── genre_and_mel_embedding_model.ipynb     # CosineEmbeddingLoss Multimodal
│
├── 3. embedding-visualization
│   └── embedding_visualization_tsne.ipynb      # t-SNE를 활용한 각 임베딩별 시각화
│
├── 4. ranking
│   ├── make_ranking_data_preprocessig.ipynb    # 각 임베딩별 코사인 유사도 Top50 데이터 셋 제작 
│   ├── make_ranking_data_multiprocessig.py     # make_ranking_data_preprocessig의 multiprocessig을 위한 함수
│   ├── make_ranking_data.ipynb                 # 순위별 가중치 ranking, 각 임베딩 별 상위 Top3 ranking
│   └── cos_sim_music_serving.ipynb             # 각 임베딩, ranking 별 결과
│
└── 5. web
    ├── crawling                                # 결과창 구현을 위한 데이터 수집
    │   └── melon_crawling.py 
    │ 
    ├── data                                    # 웹 제작에 활용된 데이터
    │    ├── ranking_song_id2playlist.json
    │    ├── song_id2artist_name_basket.json
    │    ├── song_id2song_name.json
    │    └── song_name_artist_name2song_id.json
    │ 
    ├── static                                  # 웹 제작에 활용된 css, font, image, js
    │    ├── css
    │    ├── fonts
    │    ├── images
    │    └── js
    │ 
    ├── templates                               # 프론트 구현
    │    ├── about.html
    │    ├── index.html
    │    ├── people.html
    │    └── result.html
    │ 
    └── server.py                               # 백엔드 구현
    │
    └── requirements.txt                        # 필요 패키지 목록

[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Related tags

Overview

Ok Mugle! 🎵

장르부터 멜로디까지, Content-based Music Recommendation

Description 📖

Usage ✔️

Result (Web) 💻

Presentation 🙋

Contributor 🧑‍🤝‍🧑

File Directory 📂

Owner

SeongBeomLEE

Repository of 3D Object Detection with Pointformer (CVPR2021)

Open Source Differentiable Computer Vision Library for PyTorch

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

A minimalist tool to display a network graph.

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

A transformer which can randomly augment VOC format dataset (both image and bbox) online.

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

BarcodeRattler - A Raspberry Pi Powered Barcode Reader to load a game on the Mister FPGA using MBC

Tiny Kinetics-400 for test

Optimize Trading Strategies Using Freqtrade

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

True Few-Shot Learning with Language Models

using yolox+deepsort for object-tracker

Run containerized, rootless applications with podman

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

A custom DeepStack model that has been trained detecting ONLY the USPS logo