I’m running my own Python application on my TrueNAS SCALE, the script processes YouTube audio by downloading, transcribing, and correcting text. It uses the libraries Whisper and TextBlob, and it watches a directory on my laptop for new CSV files containing YouTube URLs.
The issue I have is the script hangs during the download of NLTK corpora. It begins the process but doesn’t proceed beyond downloading and unzipping the initial datasets. I have sufficient resources (47GB RAM with 5GB spare, over 1TB HDD space, and the script uses 12% CPU), the application remains unresponsive at this stage.
Any insights or suggestions to resolve this would be greatly appreciated!