Most stuff can be retrieved via oembed http://www.oembed.com/ https://github.com/WordPress/WordPress/blob/master/wp-includes/class-oembed.php