Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some questions #4

Open
sshzhang opened this issue Aug 9, 2018 · 12 comments
Open

some questions #4

sshzhang opened this issue Aug 9, 2018 · 12 comments

Comments

@sshzhang
Copy link

sshzhang commented Aug 9, 2018

Can you explain the meaning of the dataset. I am a little confused

@Leavingseason
Copy link
Owner

Sorry for the late response,. Did you mean the data format? It's the field-wise format, in the form of FieldID:featureID:featureValue. A field is a group of features, such as gender, location, oppucation, etc.

@sshzhang
Copy link
Author

Thank you very much!

@sshzhang
Copy link
Author

sshzhang commented Sep 1, 2018

after reading the article in detail . I also find some problem. can you explain me about how to preprocessing the Criteo Dataset when you do experiment. I want to run the mode CIN in Criteo Dataset , but I don't know how to preprocessing the datasets. Another question is the dataset in that program is ariticial ? Thank you!

@Leavingseason
Copy link
Owner

Criteo dataset is frequently used by research groups. I think there are no too much ways to preprocess the dataset, just transform the numerical values into categorical values (by log 2) and filter out some low frequent categorical values. You can leave an email address and I will send you our scripts. The sample dataset in github is a real-world dataset.

@sshzhang
Copy link
Author

sshzhang commented Sep 3, 2018

Thanks ! Here is my email [email protected]

@cowry5
Copy link

cowry5 commented Sep 23, 2018

Hi, I have the same problem now. would you email me the scripts ? Thanks!
Here is my email [email protected]

@anzhizh
Copy link

anzhizh commented Oct 14, 2018

I also need the scripts, and the size of criteo dataset in your experience is 45mb?
Thank you very much!Here is my email [email protected]

@Leavingseason
Copy link
Owner

Since the criteo script seems to block a few readers, I have uploaded the script to the codebase.

@Decalogue
Copy link

Decalogue commented Feb 19, 2019

Happy Lantern Festival ! I also need the scripts and want to know the tensorflow 1.12.0 is OK? Thanks!
Here is my email [email protected]

@wanesta
Copy link

wanesta commented Jun 4, 2019

Thank you very much! [email protected]

@wenjuanxu
Copy link

Hi, I have the same problem now. would you email me the scripts ? Thanks!
Here is my email [email protected]

@ccfccl
Copy link

ccfccl commented Sep 5, 2019

Hello, I want to know how the movielens data set is processed to meet the required format, thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants