[UPHPU] PHP parser
Aaron Luman
aaron.luman at gmail.com
Fri Oct 2 18:13:35 MDT 2009
As a matter of introduction I should say that I struggle with PHP. I
do most of my web work in PHP, but probably not how I should be doing
it, haha. Anyway, I'm trying to learn better practices as well as how
to use pre-made scripts and tools.
My current struggle with PHP is that i am trying to figure out a
simple way to parse an html file. My hope is that I will be able to
copy the source and then submit it via a form using post. I found two
pre-made scripts that looked promising, but I am not sure how good of
an option they are.
The first - http://sourceforge.net/projects/simplehtmldom/ - looked
good but then when I tried using it with the entire file it returned
memory usage errors while testing on my local machine.
The second - http://php-html.sourceforge.net/ - works without error,
but takes nearly a minute to parse the full file (nearly 5 MB of text)
I am concerned that when I get my code working and posted to my host
that they will freak out about the heavy workload imposed by parsing
the large files.
The end result of parsing the source is that I would like to be able
to find particular values so that they can be filtered out along with
their markup (a table of between 1500 and 2000 characters) and
reposted in a final, results page, which should be fairly easy once I
get the parsing concerns worked out.
Should I be worried about the memory/processor draw while using these
(or similar) parses? Do any of you have experience with another
parsing tool that is more efficient? Is there a better way to go
about doing this?
Thanks in advance for any help that you might be able to offer.
Aaron
More information about the UPHPU
mailing list