Which of the following is *not* a mapping rule for tokens Canonicalization? 1) Removing characters such as hyphen, periods and accents. 2) Reducing all letters to lower case (case-folding) 3) Translating words from other languages to standard English. 4) Expands abbreviations into their full form (In4matx → informatics).