-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The data link is broken #21
Comments
Hi, that's a pity. It seems the repository does not exist anymore. It was a nice place to get the data. So now, you'll need to gather the data from somewhere else. If you find links and folder structures, I might be able to help you with renaming and arranging |
I already found all the datasets, but unfortunately, none of them correspond to NPZ.≥﹏≤ |
Alright, so you got the data that is great! Now, you'll just need to prepare the
Once you have all of these arrays, you just need to save them (let me show you an example): # create the arrays
data = {
"num_words": ....,
"num_chars": ....,
"file_name": ....,
"text": ...
}
# now we save everything
with open("destination.npz", 'wb') as f:
numpy.savez_compressed(f, **data) |
We can't find the original data, so we don't know their original appearance and filename。 Therefore, we don't know how to rename and how to arrange the path.
The text was updated successfully, but these errors were encountered: