PHP的Excel的读者 – 忽略具有特殊符号的单元格
我使用parsing器将xls转换为csv http://code.google.com/p/php-excel-reader/
<?php set_time_limit(300); require_once 'excel_reader2.php'; $data = new Spreadsheet_Excel_Reader("file.xls", false, 'UTF-8'); $f = fopen('file.csv', 'w'); for($row = 1; $row <= $data->rowcount(); $row++) { $out = ''; for($col = 1; $col <= $data->colcount(); $col++) { $val = $data->val($row,$col); // escape " and \ characters inside the cell $escaped = preg_replace(array('#”#u', '#\\\\#u', '#[”"]#u'), array('"', '\\\\\\\\', '\"'), $val); if(empty($val)) $out .= ','; else $out .= '"' . $escaped . '",'; } // remove last comma (,) fwrite($f, substr($out, 0, -1)); fwrite($f, "\n"); } fclose($f); ?>
从一些奇怪的原因,它跳过特殊符号的单元格 – 如°或®。 如何解决?
utf8_decode
和html_entity_decode
适用于我:
<?php set_time_limit(300); require_once 'excel_reader2.php'; $data = new Spreadsheet_Excel_Reader("file.xls", false, 'UTF-8'); $f = fopen('file.csv', 'w'); for($row = 1; $row <= $data->rowcount(); $row++) { $out = ''; for($col = 1; $col <= $data->colcount(); $col++) { $val = $data->val($row,$col); // escape " and \ characters inside the cell $escaped = preg_replace(array('#”#u', '#\\\\#u', '#[”"]#u'), array('"', '\\\\\\\\', '\"'), $val); $escaped = utf8_decode($escaped); //$escaped = html_entity_decode($escaped); if(empty($val)) $out .= ','; else $out .= '"' . $escaped . '",'; } // remove last comma (,) fwrite($f, substr($out, 0, -1)); fwrite($f, "\n"); } fclose($f); ?>
输出:
"1","2","3","4","5" "a","b","c","d","e" "6","7","°","9","10" "q","w","e","r","t" "®","12","13","14","15" "z","x","c","v","b"