POV-Ray: Newsgroups: povray.off-topic: Randomly-generated "Finnish" text: Re: Randomly-generated "Finnish" text

POV-Ray : Newsgroups : povray.off-topic : Randomly-generated "Finnish" text : Re: Randomly-generated "Finnish" text		Server Time 2 Jul 2025 11:24:51 EDT (-0400)

From: John VanSickle
Date: 5 Jun 2008 07:53:02
Message: <4847d39e@news.povray.org>

Orchid XP v8 wrote:
> http://blog.plover.com/lang/finnpar.html
> 
> I've never actually seen any text written in Finnish, so... does this 
> look even remotely plausible?
> 
Scientific American had an article on this in the late 1980's.  The 
author first tried making random text by selecting letters, based on 
their frequency in the English language.  The longest English word found 
in the entire result set was the word "rare".

The author then tried to generate text based on letter pairs; the 
algorithm, having one letter, searched for that letter and randomly 
selected the letter to follow based on the frequency of letters that 
followed in the source text.  This produced results that looked a lot 
like English.

He then tried using longer strings of letters, by matching pairs of 
letters and choosing the next letter based on the letters following the 
pair to be matched.  The results were much better this time; many of the 
resulting words were actual English words, and the ones that weren't 
could be taken for words with which the reader was not unfamiliar.

Setting the match string to a longer length made it more likely that the 
  random text would consists of lengthy passages from the source text.

He then tried a different tack; instead of matching letters, he decided 
to match whole words.  A word to follow the current word was randomly 
chosen from all of the words that follow the current word.  This 
produced interesting results as well.

I had fun using AmigaBASIC to test out these ideas.

Regards,
John

Post a reply to this message