remove comments from html source code
Try PHP DOM*:
$html = '<html><body><!--a comment--><div>some content</div></body></html>'; // put your cURL result here
$dom = new DOMDocument;
$dom->loadHtml($html);
$xpath = new DOMXPath($dom);
foreach ($xpath->query('//comment()') as $comment) {
$comment->parentNode->removeChild($comment);
}
$body = $xpath->query('//body')->item(0);
$newHtml = $body instanceof DOMNode ? $dom->saveXml($body) : 'something failed';
var_dump($newHtml);
Output:
string(36) "<body><div>some content</div></body>"
Regex solved this problem for me as follows:
function remove_html_comments($html = '') {
return preg_replace('/<!--(.|\s)*?-->/', '', $html);
}