有 Java 编程相关的问题?

你可以在下面搜索框中键入要查询的问题!

java Jsoup选择器“非”

此代码如下:

Document doc = Jsoup.connect("http://wikitravel.org/en/San_Francisco").get();
System.out.println(doc.select("h2:contains(Get around) ~ *:not(h2:contains(See) ~ *)"));

输出http://pastebin.com/gkcCfr1F。是否有使“非”选择器包含在内的选择器?现在它正在删除“see”之后的所有内容,我想删除带有^{id1}的最后一个h2标记$

我希望获得的最终输出是:http://pastebin.com/ntpVrgui


共 (1) 个答案

  1. # 1 楼答案

    我会这样做:

    获取内容div:

     StringBuilder sb = new StringBuilder();
        boolean start = false;
        Document doc = Jsoup.connect("http://wikitravel.org/en/San_Francisco").get();
                Elements content = doc.select("#content");
                for (Element element : content) {
                    /*Pseudo code
                       if element is h3 and it contains span with id Navigating and if start is  
    false append it to stringbuilder, set start to true, else append everything in between until you reach h2 with span id See
                      */                
        }