-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Several tensorflow-related issues #14
Comments
Hello. This file should have been created by the first step of the modelling pipeline, and contains the indices of the data for the training, validation and test set. The easiest in these situations is usually to run that part of the pipeline in isolation. Once, you have activated the conda environment you can run this step with
where Try that and see if the split file is generated or if you get another error message. |
Hello, I'm encountering a new problem during the model_training step with this output: I don't understand because in the configs.py, the file is specified as : training_checkpoint: str = "keras_model.hdf5" |
This seems to be an issue with a new Tensorflow version. I will try to find where this is coming from. But you could try changing the filename in the config file and see if that helps. It doesn't have to end with "hdf5", that we used historically, but if TF now want it to end with ".keras" you can change the config file. |
It works with ".keras" instead of ".hdf5" |
Glad you got it working. It is clear that the Tensorflow code needs to be updated to support the latest version, so thank you for your patience and for reporting these issues. |
Thank you for your prompt reply. I'm encountering an error when converting the Keras model to ONNX. I've tried to change the way top10_acc & top50_acc are registered and haven't found a solution to load the model with keras.models load_model() |
Hard to help on this last one unless you post a more specific error trace or share the trained model. |
I am curious to what versions of You can reproduce the behavior in the pipeline with
and this worked for me with But then I realized that you changed the file format, and I am wondering if this is causing the issues. How big is your model file? You could share it via e.g. Dropbox |
You are right, the versions of tensorflow (2.16.2) & keras (3.0.5) in my environment are different from the poetry.lock file. I encountered difficulties installing tensorflow from poetry and the installation of tensorflow followed #13 |
Hello, To follow up what Jeremy wrote, I reinstalled everything, from scratch, on my machine and, with Jeremy, was able to apply the whole process from downloading the uspto data to generating the models uspto_expansion.onnx and uspto_ringbreaker_expansion.onnx. But I have some interrogations ? During the installation process I saw the following message several times : In addition, when I first used aizynthtrain.pipelines.expansion_model_pipeline I got a message about tensorflow :
Dowgrading protobuf from 5.26 to 3.20.2 solved this problem. Restarting the pipeline I got the new message :
I think the script is not able to find cuda 11 on my machine because cuda 12 is installed. In my opinion, the problems are linked to the tensorflow version. To solve my problems I had to downgrade some libraries. I've seen that another solution might be to update tensorflow, but in that case I'll probably get the same problems as Jeremy. Do you plan to integrate the latest versions of tensorflow and keras, although I suppose it could mean a lot of work to adapt your code? Regards |
Thanks for the feedback and your willingness to live with a few bumps. It has made us realize that we need to substantially re-factor the code, most likely move tensorflow models to pytorch, which are easier to maintain. |
Hello,
I'm having a problem during model training (step 5. in USAGE) for the split_data step.
A file uspto_split_indices.npz was not found. After moving from python files to python files, I had difficulty understanding how it was generated and why.
During the previous step to generate the final reaction and reaction templates files (step 4 in USAGE), I followed the instructions of reaction_selection.py in a notebook to understand the process.
Is there the same type of python files to understand the model training process (step 5)?
Thanks for your help.
The text was updated successfully, but these errors were encountered: