Set encoding for actor

#1

Hi!
I am using a cheerio crawler. How can I adjust the encoding? For German “Umlaute” I get those strange ASCII symbols …
Thanks

#2

Hi, could you provide an example URL and crawler configuration?

The encoding is parsed from the Content-Type header of the HTTP response carrying the page’s HTML and cannot be configured manually.

1 Like
#3

Hi!
e.g…

https://www.nachrichten.at/nachrichten/chronik/man-liess-seinen-hund-erschiessen-laut-gericht-keine-tierquaelerei;art58,3100920

parsed headline text:

"Urteil: Hund erschie�en war keine Tierqu�lerei"

Page content type:

<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

omg strange page encoding. I will just have to convert the scraped data to utf8…

Thanks for support!