You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think as stated in their paper they use the dataset in this (it's not released when they publish their paper, so they might reproduce the data collection) paper: https://huggingface.co/datasets/shailja/Verilog_GitHub ... which might be from you? XD
There is some processing and filtering though ... I can not get the number of samples after the processing to 8502 as stated in their paper, which maybe due to the difference in raw data and pre-processing steps.
Could you also please share the training data used in fine-tuning codegen models?
The text was updated successfully, but these errors were encountered: