重庆招考信息网 登陆:清除從word粘貼過來多餘html代碼的ClearWord函數

来源:百度文库 编辑:九乡新闻网 时间:2024/07/07 13:23:10
我們在利用FckEditor編輯器的時候會有一個清除從Word粘貼過來的多餘html代碼的功能,它是利用javascript編寫的。有了這項功能以後,我們的網頁內容可以直接從Word拷貝粘貼而不用擔心內容裏會有一大堆多餘的東西佔據資料庫空間影響網頁執行的性能了。那麼,我們參照了Fckeditor的javascript功能編寫了CFScript功能版本的ClearWord函數,利用該函數在頁面內容添加入庫時可以直接進行清除冗餘操作了。

    function CleanWord(html)
    {
        html = REReplaceNocase(html,'s*','','all');
        html = REReplaceNocase(html,'.*?',' ','all');
        html = REReplaceNocase(html,'s*mso-[^:]+:[^;"]+;?','','all');
        html = REReplaceNocase(html,'s*MARGIN: 0cm 0cm 0pts*;','','all');
        html = REReplaceNocase(html,'s*MARGIN: 0cm 0cm 0pts*"','""','all');
        html = REReplaceNocase(html,'s*TEXT-INDENT: 0cms*;','','all');
        html = REReplaceNocase(html,'s*TEXT-INDENT: 0cms*"','""','all');
        html = REReplaceNocase(html,'s*TEXT-ALIGN: [^s;]+;?"','""','all');
        html = REReplaceNocase(html,'s*PAGE-BREAK-BEFORE: [^s;]+;?"','""','all');
        html = REReplaceNocase(html,'s*FONT-VARIANT: [^s;]+;?"','""','all');
        html = REReplaceNocase(html,'s*tab-stops:[^;"]*;?','','all');
        html = REReplaceNocase(html,'s*tab-stops:[^"]*','','all');
        html = REReplaceNocase(html,'s*face="[^"]*"','','all');
        html = REReplaceNocase(html,'s*face=[^ >]*','','all');
        html = REReplaceNocase(html,'s*FONT-FAMILY:[^;"]*;?','','all');
        html = REReplaceNocase(html,'<(w[^>]*) class=([^ |>]*)([^>]*)','<1','all');
        html = REReplaceNocase(html,'<(w[^>]*) style="([^""]*)"([^>]*)','<1','all');
        html = REReplaceNocase(html,'s*style="s*"','','all');
        html = REReplaceNocase(html,']*>s* s*',' ','all');
        html = REReplaceNocase(html,']*>','','all');
        html = REReplaceNocase(html,'<(w[^>]*) lang=([^ |>]*)([^>]*)','<1','all');
        html = REReplaceNocase(html,'(.*?)','1','all');
        html = REReplaceNocase(html,'(.*?)','1','all');
        html = REReplaceNocase(html,'<\??xml[^>]*>','','all');
        html = REReplaceNocase(html,']*>','','all');
        html = REReplaceNocase(html,'s*','','all');  
        html = REReplaceNocase(html,']*)>','','all');
        html = REReplaceNocase(html,']*)>','','all');
        html = REReplaceNocase(html,']*)>','','all');
        html = REReplaceNocase(html,']*)>','','all');
        html = REReplaceNocase(html,']*)>','','all');
        html = REReplaceNocase(html,']*)>','','all');           
        html = REReplaceNocase(html,'','
','all');
        html = REReplaceNocase(html,'<(U|I|STRIKE)> ',' ','all');
        html = REReplaceNocase(html,'<([^s>]+)[^>]*>s*','','all');
        html = REReplaceNocase(html,'<([^s>]+)[^>]*>s*','','all');
        html = REReplaceNocase(html,'<([^s>]+)[^>]*>s*','','all');
        html = REReplaceNocase(html,'(]*>.*?)(

)','','all');
        return html;
    }

 ---★ 本文转摘自『IT学习者』→ http://www.itlearner.com/article/2718