Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Escrapear todos los boletines #1

Open
pandres opened this issue Mar 18, 2017 · 2 comments
Open

Escrapear todos los boletines #1

pandres opened this issue Mar 18, 2017 · 2 comments

Comments

@pandres
Copy link

pandres commented Mar 18, 2017

Escrapear todos los boletines y meterlos en la db.

Bonus points: script para que reciba lista de urls.

@mgaitan
Copy link
Collaborator

mgaitan commented Mar 20, 2017

si bien no escrapea las urls, ya está el comando para importar el texto a la db

python manage.py importar_seccion <url_al_pdf>  

acepta multiples urls

@pdelboca
Copy link
Member

pdelboca commented Mar 20, 2017

La idea es migrar a este repo parte de la funcionalidad que tenemos en el otro y quedarnos con un solo repo (este). Concretamente, tenemos que traer el script que saca todas las urls e integrarlo con el comando que importa el texto a la db.

Podemos crear un nuevo comando para obtener todas las urls, y luego pasarle esas urls a este comando para que importe todos los textos.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants