Ja procurei mas não encontrei nada que fosse claro.
Preciso ler um arquivo html e pegar conteudo entre tags html
Achei o jtidy, mas não consegui fazer rodar.
Achei tb sobre o url, mas ele atende.
o que me interessa são os dados entre as tags <td></td>
exemplo<tr bgcolor="#D5BCCD">
<td>1</td>
<td>29/09/2003</td>
<td>18</td>
<td>20</td>
<td>25</td>
<td>23</td>
<td>10</td>
<td>11</td>
<td>24</td>
<td>14</td>
<td>06</td>
<td>02</td>
<td>13</td>
<td>09</td>
<td>05</td>
<td>16</td>
<td>03</td>
<td>0,00</td>
<td>5</td>
<td>154</td>
<td>4645</td>
<td>48807</td>
<td>257593</td>
<td>49.765,82</td>
<td>689,84</td>
<td>10,00</td>
<td>4,00</td>
<td>2,00</td>
<td>0,00</td>
<td>0,00</td>
</tr>
<tr>
<td>2</td>
<td>06/10/2003</td>
<td>23</td>
<td>15</td>
<td>05</td>
<td>04</td>
<td>12</td>
<td>16</td>
<td>20</td>
<td>06</td>
<td>11</td>
<td>19</td>
<td>24</td>
<td>01</td>
<td>09</td>
<td>13</td>
<td>07</td>
<td>0,00</td>
<td>1</td>
<td>184</td>
<td>6232</td>
<td>81252</td>
<td>478188</td>
<td>596.323,70</td>
<td>1.388,95</td>
<td>10,00</td>
<td>4,00</td>
<td>2,00</td>
<td>0,00</td>
<td>0,00</td>
</tr>
</tbody></table>
</body></html>