I'm running a data import using @bentael 's importer.
It looks like posts that have escape sequences render correctly as part of a posts content, but display the literal escape sequence in the post title. Screenshot:
And looking at the data via redis on the backend:
127.0.0.1:6379> hget topic:3369 title
"Исчезли деньги ?!"
127.0.0.1:6379> hget post:3369 content
"Всем привет, буквально 5.06 поменял реквизиты с помощью своего мененжера, сегодня захожу в dashboard, а на баллансе 2 доллара, было до этого почти 50 долларов, в разделе Royalties нету выплат, где тогда деньги?"
127.0.0.1:6379>
It looks like the content and title are being encoded the same way, but for some reason it renders properly in the post content but not the title.
I also tried copying and pasting some of the properly rendered characters as the title of a new post. This appears to work but produces a different encoding within the database:
127.0.0.1:6379> hget topic:3472 title
"\xd0\x92\xd1\x81\xd0\xb5\xd0\xbc \xd0\xbf\xd1\x80\xd0\xb8\xd0\xb2\xd0\xb5\xd1\x82"
127.0.0.1:6379>
Any suggestions? I suppose I could try to perform the conversion between encodings during the import... any idea the specific names of these two encoding types?