Auxillary Web Scraper For The Database Application For Tunes

This script was written in order to populate an instance of the database used in The Database Application For Tunes. It farmes the Spotify A.P.I. for metadata content and uses the Invidious A.P.I. to provide the audio source(s) for hashing. In order to speed up the parsing of metadata into the appropriate form, multiprocessing routines are employed.

Installing Dependencies

While there are many ways to skin the cat per say, we reccomend using the Mamba project for quick and frictionless experience. After installing miniforge use the included enviroment.yaml file to create the python enviroment using the following command:

mamba env create --file enviroment.yaml

Assuming there are no errors, then run:

mamba activate daft_scraper

And you should be good to go!

Usage

Begin by cloning the repository to your local machine.

Spotify Credentials

Before running the script, you need to provide valid Spotify application credentials in order to make calls to the Spotify A.P.I.:

Begin by creating a Spotify Developer Account here.
Once you have logged in, go to the Developer Dashboard and create a new application. Be sure to select the tickbox labeled "Web API".
Wait a bit for your application to be approved. Once that happens, open the application settings. Here you should be able to see both your Client ID and a Client Secret.

Finally, create a .env file in the project repository structured as follows:

spotify_id=[REPLACE EVERYTING AFTER EQUAL SIGN WITH CLIENT ID]
spotify_secret=[REPLACE EVERYTING AFTER EQUAL SIGN WITH CLIENT SECRET]

Running the Script

The script can be run with the following command:

python web_scraper.py [PATH TO GENRE LIST]

The genre list is a text file that contains a list of genres to query (one genre per return sperated line), from which the discographies of the top 50 artists will be compiled.

As a final note, while the script is running the project directory should look something like following:

.
├ output/
├ temp/
├ .env
├ .gitignore
├ enviroment.yaml
├ genre.txt
├ README.md
└ web_scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Auxillary Web Scraper For The Database Application For Tunes

Installing Dependencies

Usage

Spotify Credentials

Running the Script

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
enviroment.yaml		enviroment.yaml
web_scraper.py		web_scraper.py

me11203sci/Database-Application-For-Tunes-Webscraper

Folders and files

Latest commit

History

Repository files navigation

Auxillary Web Scraper For The Database Application For Tunes

Installing Dependencies

Usage

Spotify Credentials

Running the Script

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages