Parse Website form into array

2013 年 8 月 3 日3530

Project Description:
I have 1 webpage that i need parsed, best options would be with a perl module or perl grep. I need the values from the form to be put into an array. So in the file below, i would need the quote, and any other information to be taken out, thanks

Additional Project Description:

08/29/2011 at 22:55 HKT

example

page code looks like this:

code.......
" id="z" title="Browse titles that begin with "Z"">ÂÂZÂÂ</div>
</div>

<div id="content" class="clearfix">
<div align="center" style="margin-bottom: 10px;">
<script type="text/javascript"><!--
google_ad_client = "pub-5749494218316031";
/* MQDB Header - 728x90, created 5/4/10 */
google_ad_slot = "6464136780";
google_ad_width = 728;
google_ad_height = 90;
//-->
</script>
<script type="text/javascript"
src="http://http://www.zjjv.com///pagead/show_ads.js">
</script>
</div>

<div style="margin-bottom: 12px;">

<b>Ratings:</b> 0, <b>Total:</b> <b>0</font></b>
</div>

<form name="quote"><input type="hidden" name="text" value="Derek Zoolander: Oh, I thought you were going to tell me what a bad eugoogalizor I am.
Matilda: What?
Derek Zoolander: A eugoogalizor, one who speaks at funerals. Or did you think I'd be too stupid to know what a eugoogoly was?"></form>

<div class="quote_nohover padded" style="margin-bottom: 0; border-bottom: 0; padding: 18px;">
<div style="padding-bottom: 10px; display: none;">
<script type="text/javascript"><!--
google_ad_client = "pub-5749494218316031";
/* MQDB Links Bar 728x15, created 6/9/10 */
google_ad_slot = "6727755440";
google_ad_width = 728;
google_ad_height = 15;
//-->
</script>
<script type="text/javascript"
src="http://http://www.zjjv.com///pagead/show_ads.js">
</script>
</div>

<font style="font-size: 14px">
<b>Derek Zoolander</b>: Oh, I thought you were going to tell me what a bad eugoogalizor I am.<br />
<b>Matilda</b>: What?<br />
<b>Derek Zoolander</b>: A eugoogalizor, one who speaks at funerals. Or did you think I'd be too stupid to know what a eugoogoly was?<br />
</font>
</div>

<div class="titlebar" align="right" style="background-color: #333; background-image: none; margin-bottom: 25px;">
<span style="float: left">
code..........

Im looking for an array to be automatically parsed containing the data:
movie[name] = zoolander
movie[character1]= Matilda
movie[character2] = Derek Zoolander
movie[quote] = A eugoogalizor, one who speaks at funerals. Or did you think I'd be too stupid to know what a eugoogoly was?

09/09/2011 at 9:40 HKT

I am using the content from the website http://www.zjjv.com/. It does appear that every page is in the same format, some pages have the character names on other parts of the page. I will just need it parsed one time (probably Grep)and added to a mysql database with as many fields as possible. i.e name, character, character quote etc. Please let me know what you feel the best way this should be done, thanks

Skills required:

HTML,

Perl,

PHP

Additional Files:

quote_26700.html

See more:

website parse perl, perl parse, price quote form website, form website data entries excel file, add form website customers upload file, quote form website, add quote form website, file sending form website, create website form submit pdf file, online quote form website, php parse webpage, create form website, python parse file database

0 0