Skip to content

Commit

Permalink
dégradé
Browse files Browse the repository at this point in the history
  • Loading branch information
ecornamu committed Dec 20, 2024
1 parent 70ad4a8 commit 6c2238f
Showing 1 changed file with 2 additions and 3 deletions.
5 changes: 2 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,9 +58,8 @@ We tried another approach to detect unusual trends in name counts following a ke

### Name detection

To identify the main characters in our movies, we processed the plot_summaries.txt file, which contains plot summaries for 42,306 movies extracted from English-language Wikipedia. Each entry in the file follows a consistent structure:

Wikipedia ID \t Plot Summary \n
To identify the main characters in our movies, we processed the plot_summaries.txt file, which contains plot summaries for 42,306 movies extracted from English-language Wikipedia.
Every line in the file represents a movie, with its wikipedia id and plot summary separated by a tabulation.

Using this format, we extracted both the Wikipedia ID and the plot summary, linking each movie’s name to its corresponding Wikipedia ID and release year.

Expand Down

0 comments on commit 6c2238f

Please sign in to comment.