I realized that I can avoid many further steps if I can create a delimiter using regex.
In pentaho, there\'s a transformation which allows you to split columns into rows.