rss

Automatically Extracting feed links (atom, rss,etc) from webpages [closed]

独自空忆成欢 提交于 2020-01-01 00:12:08
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed last year . I have a huge list of URLs and my task is to feed them to a python script which should spit out the feed urls if there are any. Is there an API library or code out there that can help? 回答1: I second waffle paradox in recommending Beautiful Soup for parsing the HTML and then getting the <link rel="alternate"> tags,

Objective-C Library for RSS feed parsing?

China☆狼群 提交于 2019-12-31 22:41:27
问题 I'm looking for an Objective-C library for an iOS app that will parse and display articles form an RSS feed. Specifically,m I'm looking to make an app for a news website. The framework should be able to work with various RSS XML structures if possible. (Well, two different news sites. One uses Wordpress-generated RSS and one does not.) 回答1: You could use Apples NSXMLParser Class. Or you could use the C-based libXML2. 回答2: MWFeedParser is another interesting library to look into. 回答3: How

Android XML Parser skipping tags

纵然是瞬间 提交于 2019-12-31 06:03:50
问题 I am making an reader that will read RSS Feeds. At first I wanted to use a library but realized they were failing to load some data so I decided to make my own reader. But here is the issue. It is not always that my parser returns an image, depending on the site like for this RSS The image is in the content:data while for RSS like that of BBC the image is found in the content:data I fail to get in the first iteration So i have the next item getting the image of the previous item, below is my

Android XML Parser skipping tags

假如想象 提交于 2019-12-31 06:02:28
问题 I am making an reader that will read RSS Feeds. At first I wanted to use a library but realized they were failing to load some data so I decided to make my own reader. But here is the issue. It is not always that my parser returns an image, depending on the site like for this RSS The image is in the content:data while for RSS like that of BBC the image is found in the content:data I fail to get in the first iteration So i have the next item getting the image of the previous item, below is my

Python Regex doesn't work as expected

自闭症网瘾萝莉.ら 提交于 2019-12-31 04:55:06
问题 i've crafted this regular expression : <entry>\\n<(\w+)>(.+?)</\w+>\\n</entry> to parse the following RSS Feed : <?xml version="1.0" encoding="UTF-8"?>\n<feed version="0.3" xmlns="http://purl.org/atom/ns#">\n<title>Gmail - Inbox for g.bargelli@gmail.com</title>\n<tagline>New messages in your Gmail Inbox</tagline>\n<fullcount>2</fullcount>\n<link rel="alternate" href="http://mail.google.com/mail" type="text/html" />\n<modified>2011-03-15T11:07:48Z</modified>\n<entry>\n<title>con due mail...<

PHP way of parsing HTML string

坚强是说给别人听的谎言 提交于 2019-12-30 19:26:43
问题 I have a php string that contains the below HTML I am retrieving from an RSS feed. I am using simple pie and cant find any other way of splitting these two datasets it gets from <description> . If anyone knows of a way in simple pie to select children that would be great. <div style="example"><div style="example"><img title="example" alt="example" src="example.jpg"/></div><div style="example">EXAMPLE TEXT</div></div> to: $image = '<img title="example" alt="example" src="example.jpg">';

How to add System.ServiceModel.Syndication.dll in Windows Phone 8 project?

走远了吗. 提交于 2019-12-30 13:48:10
问题 I am trying to build an rss feed app. and i want to add System.Servicemodel.Syndication . But when i am trying to add, it is saying a reference to a higher version or incompatible assembly cannot be added to the project. After reading some post on how to solve it, i try to unblock the assembly but i found no unblock button. That means the project is already unblocked. So how can i solve the issue?? Is there any alternative way to read rss? 回答1: Judging from the documentation here Adding a

HTML parsing in Android

血红的双手。 提交于 2019-12-30 11:31:48
问题 I am trying to learn how to parse HTML, but as I don't have a lot of experience in either Java or Android, it's a little complicated. I have read the IBM XML parsing tutorial and have learned to parse an RSS feed. My problem is: I would like to get data from an HTML site. I have read some information on HTML cleaner, JSON, etc., but I can't find a good tutorial to help me. Do you have any tutorials that might be helpful? Thanks. 回答1: Check out the following HTML parsers. There are more out

How to detect if a page is an RSS or ATOM feed

落爺英雄遲暮 提交于 2019-12-30 10:59:48
问题 I'm currently building a new online Feed Reader in PHP. One of the features i'm working on is feed auto-discovery. If a user enters a website URL, the script will detect that its not a feed and look for the real feed URL by parsing the HTML for the proper tag. The problem is, the way im currently detecting if the URL is a feed or a website only works part of the time, and I know it can't be the best solution. Right now im taking the CURL response and running it through simplexml_load_string,

Index '5' beyond bounds of empty array crash

别等时光非礼了梦想. 提交于 2019-12-30 10:00:13
问题 I'm building an RSS-Reader and put a refreshbutton on the right corner of the navigation bar. It works fine and I get no crashes. But if if I press the refresh button during scrolling the app crashes. And I have no idea where the problem is. I analyzed the project but it couldn't find anything... So here's the error I get: 2012-01-22 16:36:48.205 GYSA[712:707] *** Terminating app due to uncaught exception 'NSRangeException', reason: '*** -[__NSArrayM objectAtIndex:]: index 5 beyond bounds for