Repeatable unescaping of html content leads to not valid html

https://github.com/ebsco/edsapi-php-sample/blob/b3e5f321ecc5fb34dbff6f55ecad24648c0cb29c/rest/EBSCOResponse.php#L1115

This line leads to invalid HTML for some documents (for example for `/edsapi/rest/Retrieve?an=T115986&dbid=dmp`) because of double decoding of HTML content (`&amp;lt;` becomes `<` inside HTML body). 

Looks like there is no reason to decode HTML content here - it is already decoded inside `SimpleXML` object. The only thing left to decode is the content of the `<ephtml>` tags which is double encoded.
So, this line should probably be something like this:
```php
$data = preg_replace_callback('/<ephtml>(.*?)<\/ephtml>/m', function($escaped) {
            return html_entity_decode($escaped[0]);
}, $data);
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repeatable unescaping of html content leads to not valid html #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Repeatable unescaping of html content leads to not valid html #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions