Cleans the raw text extracted from the picture and extracts only valuable part from it
the string which is ready for the validation and does not contain any junk
string to extract the valuable part from it