No to install repo error message when trying to install and insufficient guidance on preparing inputs for the codes... #1

sjscotti opened this issue Apr 29, 2022 · 4 comments


I am very interested in trying out your repo, but I have had trouble installing it on my Windows PC.
I created an anaconda environment and installed all the dependencies in the requirements.txt file.
I cloned your repo using the command:

git clone

Changed directories using:

cd citlab-article-separation-new

But when I tried to install it in my environment, using:

pip install -e .

I got the error message:

ERROR: File "" not found. Directory cannot be installed in editable mode: D:\citlab-article-separation-new

Is the file missing, or is there another way the repo should be installed in my environment?
Thanks in advance!

sjscotti commented May 2, 2022

I was able to run some of the python codes by putting the article_separation folder from the repo in the site-packages folder for the environment I set up. The for article separation seems to work well for my sample newspaper images. However, it was unclear how to prepare inputs to run the Text Block Detection python scripts. The only information in the for the repo was that a PAGE-XML file is needed, but there was no guidance on what specific contents are required for this file and how to obtain them. Is there more detailed guidance on how to go from an image of a newspaper page to obtaining text blocks and to progress on to the subsequent stages of the process?

@sjscotti sjscotti changed the title No to install repo error message when trying to install No to install repo error message when trying to install and insufficient guidance on preparing inputs for the codes... May 2, 2022
Hi @SJScott how did you setup the environment, did you end using a tensorflow-gpu?

Hi @SJScott how did you setup the environment, did you end using a tensorflow-gpu?

It has been quite awhile since I ran the article separation but I think these are the packages in the environment...

Copy link

oforomar commented Jun 20, 2023

here is my env too:
pip list absl-py 1.4.0 affine 2.4.0 astor 0.8.1 attrs 23.1.0 certifi 2022.12.7 click 8.1.3 click-plugins 1.1.1 cligj 0.7.2 cssutils 2.7.1 cycler 0.11.0 et-xmlfile 1.1.0 fonttools 4.38.0 gast 0.5.4 google-pasta 0.2.0 grpcio 1.54.2 h5py 3.8.0 importlib-metadata 6.7.0 joblib 1.2.0 JPype1 1.4.1 Keras-Applications 1.0.8 Keras-Preprocessing 1.1.2 kiwisolver 1.4.4 lxml 4.9.2 Markdown 3.4.3 MarkupSafe 2.1.3 matplotlib 3.5.3 numpy 1.21.6 opencv-python openpyxl 3.0.10 packaging 23.1 Pillow 9.5.0 pip 22.3.1 protobuf 3.20.3 pyparsing 3.1.0 python-dateutil 2.8.2 rasterio 1.2.10 scikit-learn 1.0.2 scipy 1.7.3 setuptools 65.6.3 Shapely 1.7.0 six 1.16.0 snuggs 1.4.7 tensorboard 1.14.0 tensorflow 1.14.0 tensorflow-estimator 1.14.0 termcolor 2.3.0 threadpoolctl 3.1.0 tqdm 4.65.0 typing_extensions 4.6.3 Werkzeug 2.2.3 wheel 0.38.4 wrapt 1.15.0 zipp 3.15.0

I also copied the article segmentation dir to site-packages,
I try to run using :-
python article_separation/image_segmentation/net_post_processing/ \ --path_to_image_list "list.txt" \ --path_to_pb "nets/separator_detection_net.pb" \ --mode "separator" --num_processes 1

list.txt file contiains only one line: -

but keep getting this error
`/home/ai/miniconda3/envs/news/lib/python3.7/site-packages/tensorboard/compat/tensorflow_stub/ FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'.
_np_qint16 = np.dtype([("qint16", np.int16, 1)])
/home/ai/miniconda3/envs/news/lib/python3.7/site-packages/tensorboard/compat/tensorflow_stub/ FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'.
_np_quint16 = np.dtype([("quint16", np.uint16, 1)])
/home/ai/miniconda3/envs/news/lib/python3.7/site-packages/tensorboard/compat/tensorflow_stub/ FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'.
_np_qint32 = np.dtype([("qint32", np.int32, 1)])
/home/ai/miniconda3/envs/news/lib/python3.7/site-packages/tensorboard/compat/tensorflow_stub/ FutureWarning: Passing (type, 1) or '1type' as a synonym of type is deprecated; in a future version of numpy, it will be understood as (type, (1,)) / '(1,)type'.
np_resource = np.dtype([("resource", np.ubyte, 1)])
WARNING:tensorflow:From /home/ai/miniconda3/envs/news/lib/python3.7/site-packages/article_separation/image_segmentation/net_post_processing/ The name tf.gfile.GFile is deprecated. Please use instead.

WARNING:tensorflow:From /home/ai/miniconda3/envs/news/lib/python3.7/site-packages/article_separation/image_segmentation/net_post_processing/ The name tf.GraphDef is deprecated. Please use tf.compat.v1.GraphDef instead.

WARNING:tensorflow:From /home/ai/miniconda3/envs/news/lib/python3.7/site-packages/article_separation/image_segmentation/net_post_processing/ calling import_graph_def (from tensorflow.python.framework.importer) with op_dict is deprecated and will be removed in a future version.
Instructions for updating:
Please file an issue at if you depend on this feature.
WARNING:tensorflow:From /home/ai/miniconda3/envs/news/lib/python3.7/site-packages/article_separation/image_segmentation/net_post_processing/ The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead.

WARNING:tensorflow:From /home/ai/miniconda3/envs/news/lib/python3.7/site-packages/article_separation/image_segmentation/net_post_processing/ The name tf.Session is deprecated. Please use tf.compat.v1.Session instead.

2023-06-21 01:58:54.747478: I tensorflow/core/platform/] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2023-06-21 01:58:54.767670: I tensorflow/core/platform/profile_utils/] CPU Frequency: 3299990000 Hz
2023-06-21 01:58:54.768067: I tensorflow/compiler/xla/service/] XLA service 0x13946b0 executing computations on platform Host. Devices:
2023-06-21 01:58:54.768084: I tensorflow/compiler/xla/service/] StreamExecutor device (0): ,
2023-06-21 01:58:54.848930: W tensorflow/compiler/jit/] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.

Do you remember any other step you did before running?

