浏览代码

add pandas 2.2.2 documentation

Mustafa Cagri Ardic 1 年之前
父节点
当前提交
75a085b92d
共有 2 个文件被更改,包括 27 次插入1 次删除
  1. 4 1
      docs/file-scrapers.md
  2. 23 0
      lib/docs/scrapers/pandas.rb

+ 4 - 1
docs/file-scrapers.md

@@ -183,10 +183,13 @@ mv ./usr/share/doc/openjdk-16-jre-headless/api/ docs/openjdk~$VERSION
 
 ## Pandas
 
+From the home directory; `devdocs`, execute below:
+
 ```sh
-curl https://pandas.pydata.org/docs/pandas.zip | bsdtar --extract --file - --directory=docs/pandas~1
+curl https://pandas.pydata.org/docs/pandas.zip -o tmp.zip && unzip tmp.zip -d docs/pandas~2 && rm tmp.zip
 ```
 
+
 ## PHP
 Click the link under the "Many HTML files" column on https://www.php.net/download-docs.php, extract the tarball, change its name to `php` and put it in `docs/`.
 

+ 23 - 0
lib/docs/scrapers/pandas.rb

@@ -16,6 +16,29 @@ module Docs
       Licensed under the 3-clause BSD License.
     HTML
 
+    version '2' do
+      self.release = '2.2.2'
+      self.base_url = "https://pandas.pydata.org/pandas-docs/version/#{self.release}/"
+
+      html_filters.push 'pandas/clean_html', 'pandas/entries'
+
+      options[:container] = 'main section'
+
+      options[:skip_patterns] = [
+        /development/,
+        /getting_started/,
+        /whatsnew/
+      ]
+
+      options[:skip] = [
+        'panel.html',
+        'pandas.pdf',
+        'pandas.zip',
+        'ecosystem.html'
+      ]
+
+    end
+
     version '1' do
       self.release = '1.5.0'
       self.base_url = "https://pandas.pydata.org/pandas-docs/version/#{self.release}/"