快捷導(dǎo)航

批處理從html格式(接收到的郵件)中讀取數(shù)據(jù)的操作方法

更新時(shí)間：2023年01月04日 09:15:40 作者：列兵

這篇文章主要介紹了批處理從html格式(接收到的郵件)中讀取數(shù)據(jù)的操作方法,本文給大家介紹的非常詳細(xì)，對(duì)大家的學(xué)習(xí)或工作具有一定的參考借鑒價(jià)值，需要的朋友可以參考下

通過(guò)第三方批處理getmail可以獲取到郵箱里的郵件。獲取后經(jīng)其自身解碼，得到一個(gè)Extract*.out文件，大致看一下其格式應(yīng)該為html的寫(xiě)法，并且內(nèi)容只分一行。

<div dir="auto">here is the content</div>

顯然我郵件發(fā)送的原內(nèi)容為：

here is the content

現(xiàn)在要提取出其中的原內(nèi)容并且將所有內(nèi)容合并到一行。問(wèn)題在于，郵箱不一定每次都接收到這樣簡(jiǎn)單的文件。例如內(nèi)容當(dāng)中出現(xiàn)換行：

<div dir="auto">abababababababab<br /><br />abababababababab<br /><br /></div>

有時(shí)內(nèi)容含特殊字符，解碼之后也無(wú)法正常顯示（某些字符）：

i wanna get these words # $ % & * @ ? !

<div dir="auto">i wanna get these words # $ % &amp; * &#64; ? !</div>

有時(shí)發(fā)送的內(nèi)容經(jīng)過(guò)復(fù)制粘貼，不小心留下了其他信息：

<div dir="auto">hey get it <span style="font-family:sans-serif">hey get it </span><br style="font-family:sans-serif" /><span style="font-family:sans-serif">hey get it </span><span style="font-family:sans-serif">hey get it </span><br style="font-family:sans-serif" /></div>

顯然需要忽略<>標(biāo)簽里的所有內(nèi)容才能得到原內(nèi)容，并且需要還原未正常顯示的符號(hào)。
可能要用到第三方了……

到此這篇關(guān)于批處理從html格式(接收到的郵件)中讀取數(shù)據(jù)的操作方法的文章就介紹到這了,更多相關(guān)批處理html格式讀取數(shù)據(jù)內(nèi)容請(qǐng)搜索腳本之家以前的文章或繼續(xù)瀏覽下面的相關(guān)文章希望大家以后多多支持腳本之家！

您可能感興趣的文章: