Friday, August 19, 2016

5000 Italian Sentences Sorted from Easiest to Hardest

Here's how this list was made:

1) I grabbed a list with the 5000 most frequently used Italian words -- sorted from the most frequently used, to the least.

2) I grabbed 60 000 translated italian sentences from the internet

3) I wrote a program that assigns a Frequency Rank Number to each word from every sentence, this Frequency Rank Number based on the list mentioned on item "1)"

4) This program calculates the average value of the all the words' Frequency Rank Numbers. And assigns this value to the sentence.

The result is that if a sentence contains advanced words, the sentence will have a high Average Frequency Rank Number. If a sentence contains only beginner words, the AFRN will be  low.

5) Finally, I sorted the sentences: from the ones with the lowest AFRN, to the highest.


The end result is that this list begins with very, very simple sentences, and new words get slowly introduced as you progress.


The best part is, an expanded version of this list(with audio!) is available for you to study on the learning software Anki:
https://ankiweb.net/shared/info/1713927804
For free, of course.

If you don't have Anki, you can download here: http://ankisrs.net/


To view in a big screen, click the "See Original" button.

7 comments:

  1. I think this is very useful, but there are a lot of identical sentences, why is that? Some other sentences are kind of alike, and one gets kind of bored (I'm sick and tired of Tom :P), but overall is a very useful resource for learning Italian, so thank you very much.

    ReplyDelete
    Replies
    1. That's why on the other decks, I filtered out the word "Tom"-- I just didn't do it for the Italian deck. (yet)

      Glad you like it~

      Delete
  2. Hi Neri,

    I like 'Tom', because it gives the sentences a more real meaning. If one prefers different names, it can be easily changed.

    Can you do Hungarian 5000 ?

    Do you wan the German nouns corrected ?

    Great job of yours here without doubt :)

    ReplyDelete
    Replies
    1. Hungarian was on my to-do list! Will try to do this week.

      Delete
    2. There it is!
      https://ankiweb.net/shared/info/1096999243
      https://ankiweb.net/shared/info/1442776235

      The reason that it took me long is that I thought there was no text-to-speech software for Hungarian. Turned out, there is. Now it's done.

      If it's helpful to you, please rate it, thanks~

      Delete
  3. Do you think would be possible to have the list for download not in Anki form?

    Thanks

    ReplyDelete
  4. Adrian,

    you have to scroll all the way up in the iframe on this page - then you will see a download raw option - just click on it.

    re Anki: after you download an Anki package, you can export the contents as text; there is an option in the Anki program that allows that. You can then 'clean' that text file by deleting html code until only the text strings of interest remain using a test processor such as sublime text 3 etc.

    ReplyDelete