Python Script Hanging During NLTK Data Download on TrueNAS SCALE

Strickland · June 30, 2024, 1:18pm

I’m running my own Python application on my TrueNAS SCALE, the script processes YouTube audio by downloading, transcribing, and correcting text. It uses the libraries Whisper and TextBlob, and it watches a directory on my laptop for new CSV files containing YouTube URLs.

The issue I have is the script hangs during the download of NLTK corpora. It begins the process but doesn’t proceed beyond downloading and unzipping the initial datasets. I have sufficient resources (47GB RAM with 5GB spare, over 1TB HDD space, and the script uses 12% CPU), the application remains unresponsive at this stage.

Any insights or suggestions to resolve this would be greatly appreciated!

Strickland · July 6, 2024, 2:41pm

I solved it