ruby - Parse HTML with Nokogiri and get text with the closest "" -

- June 15, 2012

मेरे पास एचटीएमएल युक्त है:

  & lt; div class = "s" & gt; & Lt; p & gt; पाठ 1 & lt; / p & gt; & lt; div class = "i" & gt; & Lt; p & gt; पाठ 2 & lt; / p & gt; & Lt; / div & gt; & Lt; / div & gt;    मुझे कक्षा  "s"  के साथ निकटतम  & lt; div & gt;  के साथ सभी लेख प्राप्त करने की आवश्यकता है।  
 उदाहरण के लिए, मैं प्राप्त करने की कोशिश कर रहा हूं:  
  सरणी = [] html.css ("s")। प्रत्येक do | element | सरणी & lt; & lt; Element.text.strip end    यह सब अच्छा है, सिवाय इसके कि मेरे सरणी में  "text2"  प्रकट होता है, और मुझे यह नहीं चाहिए तो  "text2"  के लिए, सबसे निकटतम  
 gt;  वर्ग  "i"  था, और मैं इसे मेरी सरणी में नहीं देखना चाहता ।   मैं इसका समाधान कैसे कर सकता हूं? उदाहरण के लिए: विभिन्न वर्ग के नाम और गहरी घोंसले के शिकार हो सकते हैं:  
  & lt; div class = "s" & gt; & Lt; p & gt; पाठ 1 & lt; / p & gt; & Lt; div class = "i" & gt; & Lt; p & gt; पाठ 2 & lt; / p & gt; & Lt; div वर्ग = "s" & gt; & Lt; p & gt; पाठ 3 & lt; / p & gt; & lt; div class = "p" & gt; & Lt; p & gt; टेक्स्ट 4 & lt; / p & gt; & Lt; / div & gt; & Lt; / div & gt; & Lt; / div & gt; & Lt; / div & gt;    इस से, मैं इस पर एक सरणी प्राप्त करना चाहता हूं:  ["text1", "text3"]    
 < Div class = "post-text" itemprop = "text"> 
 अपडेट: बेहतर xpath-only उत्तर मेरा मूल उत्तर नीचे है।  
  # एक `नोकोगीरी :: HTML दस्तावेज़ को` html` चर में दिये गये: html.xpath ("// text () [normalize-space ()) और पूर्वज: : Div [1] [@ वर्ग = 'एस']] ")। नक्शा (और: पाठ) .मैप (और: पट्टी)    यह सिर्फ सभी रिक्त स्थान को पाता है पाठ नोड जिसका निकटतम  div  पूर्वज का  s  वर्ग है यह मेरे मूल उत्तर के समान है, सिवाय यह पूरी तरह से XPath में किया जाता है।  
  & lt; div class = "s" & gt; & Lt; p & gt; पाठ 1 & lt; / p & gt; & lt; div class = "i" & gt; & Lt; p & gt; पाठ 2 & lt; / p & gt; & Lt; / div & gt; & Lt; / div & gt; # = & Gt; ["Text1"] & lt; div वर्ग = "s" & gt; & Lt; p & gt; पाठ 1 & lt; / p & gt; & lt; div class = "i" & gt; & Lt; p & gt; पाठ 2 & lt; / p & gt; & Lt; div वर्ग = "s" & gt; & Lt; p & gt; पाठ 3 & lt; / p & gt; & lt; div class = "p" & gt; & Lt; p & gt; टेक्स्ट 4 & lt; / p & gt; & Lt; / div & gt; & Lt; / div & gt; & Lt; / div & gt; & Lt; / div & gt; # = & Gt; ["पाठ 1", "पाठ 3"] & lt; div class = "s" & gt; & lt; div वर्ग = 'पी' & gt; पाठ 1 & lt; / div & gt; पाठ 2 & lt; / div & gt; # = & Gt; ["पाठ 2"]   
  मूल उत्तर:  
  html.search ("// div [@ class = 's'] / / पाठ () ")। {| T | चुनें टी। डिस्टेस्टर्स ("डिवि")। पहले.ट्रैट ("क्लास") == "s"} मानचित्र (और: पाठ) .join.squeeze.strip # = & gt; "Text1"    यहां मूल विचार यह है कि हम सभी टेक्स्ट नोड पाते हैं जो  div.s  से उतरते हैं और निकटतम  div  प्रत्येक पाठ नोड के लिए पूर्वज, और केवल नोड्स को स्वीकार करते हैं जिनके पास  s  की एक कक्षा के साथ निकटतम डिवेल पूर्वज हैं। यह थोड़ा सीपीयू-गहन है, लेकिन यह कठोर आवश्यकताओं को पूरा करता है ।   

 




  



















Get link





Facebook





X





Pinterest





Email





Other Apps




Comments





Post a Comment



Popular posts from this blog




java - Why my included JSP file won't get processed correctly? -



-



August 15, 2015








    I am trying to create Java web framework (and learning), and on the basis of 'Code Generator In the process of making the material view of the database in the process of development, I stumbled into a difficulty, which I do not know how to solve it.   First of all, I want the following  index.jsp :    & lt; Body & gt; Use all pages to be created. & Lt;% @ include file = "header.jsp"% & gt; & Lt; Hours / & gt; & Lt;% @ included file = "body.jsp"%> & Lt; Hours / & gt; & Lt;% @ include file = "footer.jsp"% & gt; & Lt; / Body & gt;    And, in  body.jsp , I should have it like this:    & lt; Jsp: include page = "$ {Application_modul}"%>    Where  application_modul  is an attribute defined in its controller:    request.setAttribute ("application_modul", "USER_ACCOUNT \\ ' View_user_account.jsp ");    This file can be found properly, but the processed JSP does ...





Read more





php - PDO bindParam() fatal error -



-



September 15, 2015








I am trying to learn PDO, now I have written this small code, but it gives me a serious error:        Fatal error:  Call a member function at a non-object dot () ...    $ con = new mysqli ("127.0.0.1", "root", "", "csvdangercheck"); $ Query = $ con- & gt; ("Id,` var1`, `var2`) values (" id,: var1,: var2) ";); $ query- & gt; bypass pattern (': id', $ id); $ query- & gt; BindParam (': var1', $ VAL1); $ Query- & gt; bindParam (': var2', $ val2); $ Query-> Executed ();    I Trying to use  print_r ($ con-> errorInfo ())  but    Fatal error:  Undefined method mysqli :: errorInfo ( ) ... can someone tell me what am I missing?      Flaffeh said, you have PDO Missing Mysqy, try it:    $ con = new PDO ('mysql: host = 127.0.0.1; dbname = csvdangercheck', 'root', '');     





Read more





logging - How can I log both the Request.InputStream and
Response.OutputStream traffic in my ASP.NET MVC3 Application for
specific Actions? -



-



August 15, 2014








    For a specific set of tasks, I need to log in to the incoming request inputstream as well as outgoing response. is.   I am using an ActionFilterAttribute for this and overriding the OnActionExecuted and OnResultExecuted methods.   So I'm starting with this idea ...    Public category ActionLoggerAttribute: ActionFilterArrbit {Public Override Zero On-Offed Execution (ActionActioned Contact Filter Filter) {base.OnActionAxecuted (FilterContext); HttpRequestBase request = filterContext.HttpContext.Request; // TODO: Log the request. InputStream} Public Override Zero OnResultExecuted (ResultExecutedContext FilterContext) {base.OnResultExecuted (filterContext); HTTPRPSPointBase Response = FilterContacts Hpptex response; // TODO: Log on Response.OutputStream}}    Ideally I'll hook it up with enterprise library logging because I'm already using it for error logging.    Am I reaching the input and output stream at the right time?   Can I easily use the Enterprise Library to log i...





Read more

Search This Blog

linux

ruby - Parse HTML with Nokogiri and get text with the closest "" -

Comments

Post a Comment

Popular posts from this blog

java - Why my included JSP file won't get processed correctly? -

php - PDO bindParam() fatal error -

logging - How can I log both the Request.InputStream and Response.OutputStream traffic in my ASP.NET MVC3 Application for specific Actions? -