0

I am using domdocuments and using things like $div->nodeValue to obtain certain info from a web page.

On my ubuntu machine when i do php crawl.php everything is displayed properly in Chinese (the page is in UTF-8).

However on my CENTOS machine using the same code I get æ´å¤åå¸ when I print in the terminal. and when I save it to the database, the characters are also messed up.

One thing I noticed is that when I do print $content, both systems display them properly.

cpbills
  • 2,692
  • 17
  • 12

1 Answers1

1

Check mysql connection charset & collation, while you insert the content of grabbed data, Mysql should know which charset & collation will be used to store data. add the lines below to your php script. by this way your code will be much more stable to work on different OS'es & configurations.

`
$dbcharset='utf8';
$dbcollation='utf8_general_ci';
mysql_query('SET CHARACTER SET '.$dbcharset); 
mysql_query("SET NAMES '".$dbcharset."' COLLATE '".$dbcollation."'");
`
risyasin
  • 1,564
  • 9
  • 16