Automan
2014-09-16 01:03:59 +08:00
function rplace($filename) {
$content = file_get_contents($filename);
$content = str_replace('<meta name=Generator content="Microsoft Word 11 (filtered)">', ' <link rel="stylesheet" type="text/css" href="css/book.css">
', $content);
$content = preg_replace('/<\/?SPAN[^>]*>/i', '', $content);
// Class
$content = preg_replace('/<(\w[^>]*) class=([^ |>]*)([^>]*)/i', "<$1$3", $content);
// Style
$content = preg_replace('/<(\w[^>]*) style=([^ |>]*)([^>]*)/i', "<$1$3", $content);
// Lang
$content = preg_replace('/<(\w[^>]*) lang=([^ |>]*)([^>]*)/i', "<$1$3", $content);
$content = str_replace('<p> </p>', '', $content);
$content = preg_replace('/<style>([\s\S]*)<\/style>/i', "", $content);
$content = preg_replace('/<(\w[^>]*) height=([^ |>]*)([^>]*)/i', "<$1$3", $content);
echo $content;
file_put_contents($filename, $content);
}
因为我还要保留其他格式,所以只去除了一部分标签,如果你只要 Table,用strip_tags可能更好一点。
比如 strip_tags($data, '<table><b><p><td><tr><th><tbody>');