Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Setup Issues with Capreolus on Canada Compute #205

Open
jasper-xian opened this issue Apr 13, 2022 · 0 comments
Open

Setup Issues with Capreolus on Canada Compute #205

jasper-xian opened this issue Apr 13, 2022 · 0 comments

Comments

@jasper-xian
Copy link

I ran into mostly the same errors when setting up Capreolus on CC (Cedar) with setup-cc.md as this issue here. Thus, I followed the setup described in this comment within that issue.

Some of the new issues I noticed were that scipy-stack now uses python 3.8, so the python version changes to 3.8.10 when running module load scipy-stack. I also noticed that the TensorFlow version installed was 2.8.0, despite the requirements.txt listing it to be between 2.3 and 2.5.


Additionally, when running pip install tensorflow-ranking==0.4.2 --no-cache-dir I ran into the error:

ERROR: Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

This was solved by manually importing pyarrow with the commands below:

module load gcc/9.3.0 arrow python scipy-stack
python -c "import pyarrow"

Despite this, I was able to verify my installation as per these commands. However, when running

python -m capreolus.run rerank.train with file=docs/reproduction/config_msmarco.txt

to fine-tune a mini MS MARCO, I run into this set of warnings/errors:

2022-04-12 17:37:07.215423: I tensorflow/core/platform/cpu_feature_guard.cc:151] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2022-04-12 17:37:09.466046: F tensorflow/core/platform/default/env.cc:73] Check failed: ret == 0 (11 vs. 0)Thread tf_numa_-1_Eigen creation via pthread_create() failed.
Aborted

Am a bit stuck here (perhaps it is a mismatching TensorFlow version issue?), so any help would be much appreciated!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant