how to check if a string looks randomized, or human generated and pronouncable?

前端 未结 10 1624
旧巷少年郎
旧巷少年郎 2020-12-13 03:55

For the purpose of identifying [possible] bot-generated usernames.

Suppose you have a username like \"bilbomoothof\" .. it may be nonsense, but it still contains pro

10条回答
  •  孤街浪徒
    2020-12-13 04:30

    You could use a neural network to evaluate whether the nickname looks like a natural-language nickname.

    Assemble two data-sets: one of valid nicknames, and one of bogus-generated ones. Train a simple back-progating single hidden layer neural network with the character values as inputs. The neural network will learn to discriminate between strings like "zrgssgbt" and "zargbyt", since the latter has consonants and vowels intermingled .

    It is important to use real-world examples to get a good discriminator.

提交回复
热议问题