2012-02-07 56 views
0

サイトから特定のコンテンツを取得してテキストファイルに配置しようとしています。私は処理したいと思うURLのループのためのリストボックスを使用して、もう一つはデータの出力を見ました。今私はテキストファイルのすべてのデータを "〜" sysmbolで区切られた各項目にしたい。テキストファイルに期待http://www.maxpreps.com/high-schools/abbeville-yellowjackets-(abbeville,al)/basketball/previous_seasons.htmテキストファイルにデータを書き込む

データ:

Exmapleリンクiがmy.txtファイルで使用
アブビル高校バスケットボール統計〜チーム:バーシティ11-12〜色:マルーン、グレー、ホワイト...

Imports System.IO.StreamReader 
Imports System.Text.RegularExpressions 
Imports System.IO 


Public Class Form1 
    Private Sub Button1_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles Button1.Click 
     Dim abc As String = My.Computer.FileSystem.ReadAllText("C:\Documents and Settings\Santosh\Desktop\my.txt") 
     Dim pqr As String() = abc.Split(vbNewLine) 
     ListBox2.Items.AddRange(pqr) 

    End Sub 
    Private Sub Button2_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles Button2.Click 
     For Each item In ListBox2.Items 
      Dim request As System.Net.HttpWebRequest = System.Net.WebRequest.Create(item) 
      Dim response As System.Net.HttpWebResponse = request.GetResponse 

      Dim sr As System.IO.StreamReader = New System.IO.StreamReader(response.GetResponseStream()) 
      Dim rsssource As String = sr.ReadToEnd 
      Dim r As New System.Text.RegularExpressions.Regex("<h1 id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_Header"">.*</h1>") 
      Dim r1 As New System.Text.RegularExpressions.Regex("<span id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_Mascot"">.*</span>") 
      Dim r3 As New System.Text.RegularExpressions.Regex("<span id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_Colors"">.*</span>") 
      Dim r4 As New System.Text.RegularExpressions.Regex("<span id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_GenderType"">.*</span>") 
      Dim r5 As New System.Text.RegularExpressions.Regex("<span id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_AthleteDirectorGenericControl"">.*</span>") 
      Dim r6 As New System.Text.RegularExpressions.Regex("<address>.*</address>") 
      Dim r7 As New System.Text.RegularExpressions.Regex("<span id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_Phone"">.*</span>") 
      Dim r8 As New System.Text.RegularExpressions.Regex("<span id=""ctl00_NavigationWithContentOverRelated_ContentOverRelated_Header_Fax"">.*</span>") 

      Dim matches As MatchCollection = r.Matches(rsssource) 
      Dim matches1 As MatchCollection = r1.Matches(rsssource) 
      Dim matches3 As MatchCollection = r3.Matches(rsssource) 
      Dim matches4 As MatchCollection = r4.Matches(rsssource) 
      Dim matches5 As MatchCollection = r5.Matches(rsssource) 
      Dim matches6 As MatchCollection = r6.Matches(rsssource) 
      Dim matches7 As MatchCollection = r7.Matches(rsssource) 
      Dim matches8 As MatchCollection = r8.Matches(rsssource) 


      For Each itemcode As Match In matches 
       Dim W As New IO.StreamWriter("C:\" & FileName.Text & ".txt") 
       W.Write(itemcode.Value.Split("""").GetValue(2)) 
       W.Close() 

       'ListBox1.Items.Add(itemcode.Value.Split("""").GetValue(2)) 
      Next 
      For Each itemcode As Match In matches1 
       ListBox1.Items.Add(itemcode.Value.Split("""").GetValue(2)) 
      Next 
     Next item 

    End Sub 
End Class 

答えて

0

このようにWriteステートメントの末尾に追加するだけです。

W.Write(itemcode.Value.Split("""").GetValue(2) & " ~ ") 
+0

各ループのコードはMatchのみです。Match〜match1〜match2〜match4 ....のようなテキストが必要です – sam

関連する問題