test less cases of utf-8 validation, to avoid taking forever