When I import a Stata dataset in R (using the foreign
package), the import sometimes contains characters that are not valid UTF-8
. This is unpleasant enough by itself, but it breaks everything as soon as I try to transform the object to JSON
(using the rjson
package).
How I can identify non-valid-UTF-8
-characters in a string and delete them after that?