$name
$name
Advisory name of injector, this is for friendly error messages.
Injector that converts http, https and ftp text URLs to actual links.
rewindOffset(bool|int $offset) : mixed
Rewind to a spot to re-perform processing. This is useful if you deleted a node, and now need to see if this change affected any earlier nodes. Rewinding does not affect other injectors, and can result in infinite loops if not used carefully.
bool|int | $offset |
prepare(\HTMLPurifier_Config $config, \HTMLPurifier_Context $context) : bool|string
Prepares the injector by giving it the config and context objects: this allows references to important variables to be made within the injector. This function also checks if the HTML environment will work with the Injector (see checkNeeded()).
\HTMLPurifier_Config | $config | |
\HTMLPurifier_Context | $context |
Boolean false if success, string of missing needed element/attribute if failure
checkNeeded(\HTMLPurifier_Config $config) : bool|string
This function checks if the HTML environment will work with the Injector: if p tags are not allowed, the Auto-Paragraphing injector should not be enabled.
\HTMLPurifier_Config | $config |
Boolean false if success, string of missing needed element/attribute if failure
forward(int $i, \HTMLPurifier_Token $current) : bool
Iterator function, which starts with the next token and continues until you reach the end of the input tokens.
int | $i | Current integer index variable for inputTokens |
\HTMLPurifier_Token | $current | Current token variable. Do NOT use $token, as that variable is also a reference |
warning |
Please prevent previous references from interfering with this functions by setting $i = null beforehand! |
---|
forwardUntilEndToken(int $i, \HTMLPurifier_Token $current, int $nesting) : bool
Similar to _forward, but accepts a third parameter $nesting (which should be initialized at 0) and stops when we hit the end tag for the node $this->inputIndex starts in.
int | $i | Current integer index variable for inputTokens |
\HTMLPurifier_Token | $current | Current token variable. Do NOT use $token, as that variable is also a reference |
int | $nesting |
None found |
backward(int $i, \HTMLPurifier_Token $current) : bool
Iterator function, starts with the previous token and continues until you reach the beginning of input tokens.
int | $i | Current integer index variable for inputTokens |
\HTMLPurifier_Token | $current | Current token variable. Do NOT use $token, as that variable is also a reference |
warning |
Please prevent previous references from interfering with this functions by setting $i = null beforehand! |
---|
<?php
/**
* Injector that converts http, https and ftp text URLs to actual links.
*/
class HTMLPurifier_Injector_Linkify extends HTMLPurifier_Injector
{
/**
* @type string
*/
public $name = 'Linkify';
/**
* @type array
*/
public $needed = array('a' => array('href'));
/**
* @param HTMLPurifier_Token $token
*/
public function handleText(&$token)
{
if (!$this->allowsElement('a')) {
return;
}
if (strpos($token->data, '://') === false) {
// our really quick heuristic failed, abort
// this may not work so well if we want to match things like
// "google.com", but then again, most people don't
return;
}
// there is/are URL(s). Let's split the string.
// We use this regex:
// https://gist.github.com/gruber/249502
// but with @cscott's backtracking fix and also
// the Unicode characters un-Unicodified.
$bits = preg_split(
'/\\b((?:[a-z][\\w\\-]+:(?:\\/{1,3}|[a-z0-9%])|www\\d{0,3}[.]|[a-z0-9.\\-]+[.][a-z]{2,4}\\/)(?:[^\\s()<>]|\\((?:[^\\s()<>]|(?:\\([^\\s()<>]+\\)))*\\))+(?:\\((?:[^\\s()<>]|(?:\\([^\\s()<>]+\\)))*\\)|[^\\s`!()\\[\\]{};:\'".,<>?\x{00ab}\x{00bb}\x{201c}\x{201d}\x{2018}\x{2019}]))/iu',
$token->data, -1, PREG_SPLIT_DELIM_CAPTURE);
$token = array();
// $i = index
// $c = count
// $l = is link
for ($i = 0, $c = count($bits), $l = false; $i < $c; $i++, $l = !$l) {
if (!$l) {
if ($bits[$i] === '') {
continue;
}
$token[] = new HTMLPurifier_Token_Text($bits[$i]);
} else {
$token[] = new HTMLPurifier_Token_Start('a', array('href' => $bits[$i]));
$token[] = new HTMLPurifier_Token_Text($bits[$i]);
$token[] = new HTMLPurifier_Token_End('a');
}
}
}
}
// vim: et sw=4 sts=4