=================== Webscraping and RPA =================== .. role:: raw-html(raw) :format: html .. _click: |click| Click ============== .. |click| image:: ../../png/click.png :width: 60 Description ----------- Click Node does a mouse (double)click on a given coordinates. Parameters ---------- Two parameters, a pair of coordinates **count in pixels from upper left corner**, are required: .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - X (required) - integer - A horizontal coordinate count in pixels from left edge of the display. * - Y (required) - integer - A vertical coordinate count in pixels from the upper edge of the display. * - Double click - bool (checkbox) - If checked (True): mouse will perform a double-click, else single-click .. _click_image: |click_image| Click Image ============================ .. |click_image| image:: ../../png/click_image.png :width: 60 Description ----------- Finds provided image on currently scraped page and clicks at point that has coordinates (image_pos_X + X, image_pos_Y + Y). In other words icon finds the coordinates of image on page and clicks at point (X, Y), relatively to found image. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Image path (required) - string - Path to image * - Click offset X (required) - int/float - Relative to image offset on axis X * - Click offset Y (required) - int/float - Relative to image offset on axis Y .. _wait: |wait| Wait ============ .. |wait| image:: ../../png/wait.png :width: 60 Description ----------- Wait Node creates a pauses between pipeline steps. It waits (and does nothing, obviously) for a specified amount of ms and then lets the next step in a pipeline to proceed. Parameters ---------- Two parameters are required: .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Miliseconds (required) - int/float - Waiting time interval in *miliseconds*, e.g. entry: 1000 :raw-html:`→` 1000 ms (1 s) waiting time before another step. * - Add random ms - int/float - Adds a random real picked from a uniform distribution defined on the interval (- entered value, + entered value). .. _write: |write| Write ============== .. |write| image:: ../../png/write.png :width: 60 Description ----------- Types the text given via the entry box as if it were typed on the regular keyboard. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Text (required) - string - The text to be typed by the write icon .. _use_key: |use_key| Use Key ================= .. |use_key| image:: ../../png/command.png :width: 60 Description ----------- Hits a single key or a combination of keys on a keyboard. At least one key must be entered and a combination (series) of up to three keys can be set up. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Key 1 (required) - comboentry value - The keyboard key to be clicked or, if other keys entered, the first key of a combination. Can be either selected from the combobox or written in manually. * - Key 2 - comboentry value - The second key of the combination. Can be either selected from the combobox or written in manually. * - Key 2 - comboentry value - The third key of the combination. Can be either selected from the combobox or written in manually. .. _open_browser: |open_browser| Open Browser ================================ .. |open_browser| image:: ../../png/open_browser.png :width: 60 Description ----------- Opens the system default web browser (for further usage such as :ref:`Load Website ` etc.). Allows to choose driver to use. For now available: Firefox (Geckodriver), Chrome. **Note**: To use a driver an appropriate web browser should be installed. Driver itself will be installed automatically. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Show browser - boolean - Show browser GUI * - Driver - driver (combobox) - Driver type (Firefox, Chrome) .. _load_website: |load_website| Load Website ============================ .. |load_website| image:: ../../png/load_website.png :width: 60 Description ----------- Loads website with a given URL. Parameters ---------- One parameter is required: .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - URL (required) - string - URL of the loaded website, e.g. https://forloop.ai/. * - Take screenshot - boolean - If screenshot is needed (e.g. for using in :ref:`Browser View `) .. _refresh_page_source: |refresh_page_source| Refresh Page Source ========================================= .. |refresh_page_source| image:: ../../png/refresh_page_source.png :width: 60 Description ----------- Refreshes source code of the loaded website. .. _click_xpath_icon: |click_xpath| Click XPath Element =================================== .. |click_xpath| image:: ../../png/click_xpath.png :width: 60 Description ----------- Clicks on the website element specified by its XPath. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - XPath (required) - string - The XPath of the element to be clicked upon, e.g. *//div[@class="media-select__input-content"]//button*. .. _scan_web_page_icon: |scan_web_page| Scan Web Page =================================== .. |scan_web_page| image:: ../../png/scan_web_page.png :width: 60 Description ----------- Searches for different elements on page and highlights them in :ref:`Browser View `. Available types of elements: * Tables * Bullet lists * Texts * Headlines * Links * Images * \+ elements with custom XPath Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Tables - boolean - Include tables in search * - Bullet lists - boolean - Include bullet lists in search * - Texts - boolean - Include texts in search * - Headlines - boolean - Include headlines lists in search * - Links - boolean - Include links in search * - Images - boolean - Include images in search * - Custom XPath (optional) - string - Include elements with custom XPath .. _extract_xpath_icon: |print_xpath| Extract XPath Element ===================================== .. |print_xpath| image:: ../../png/print_xpath.png :width: 60 Description ----------- Finds HTML element using XPath and exports it. Also allows to load XPath of element from :ref:`Browser View ` to the first parameter (*XPath*). Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - XPath (required) - string - XPath of element to be extracted * - Output variable (required) - string - Name of variable to be created inside platform * - Output filename (required) - string - Name of file to be created * - Write in file mode - string (combobox) - File mode to use: - w+ (rewrite previous file content), - a+ (append to previous file content) |print_xpath| Extract Multiple XPaths ===================================== Description ----------- Finds multiple HTML elements using XPath and exports them. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Extraction setup file path (required) - string - Path to .txt file with XPaths (1 per row) * - Output variable (required) - string - Name of variable to be created inside platform .. _extract_table_xpath_icon: |print_xpath| Extract Table XPath Element ========================================== Description ----------- Finds table using XPath and exports it. Also allows to load XPath of element from :ref:`Browser View ` to the first parameter (*XPath*). Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - XPath Rows (required) - string - XPath of table row * - XPath Columns (required) - string - XPath of table column * - Output variable (required) - string - Name of variable to be created inside platform .. _set_proxy: |print_xpath| Set Proxy ========================================== .. |set_proxy| image:: ../../png/set_proxy.png :width: 60 Description ----------- Set's a new proxy connection. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - IP (required) - string - IP of the proxy server. * - Port (required) - integer - Proxy server port. * - Username - string - Login username. * - Password - string - Login password. .. _click_name: |click_name| Click Name Element ================================= .. |click_name| image:: ../../png/click_name.png :width: 60 Description ----------- Clicks on the website element containing the given text. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Text (required) - string - The text to be searched on the website and clicked upon. .. _click_id: |click_id| Click Id Element ============================ .. |click_id| image:: ../../png/click_id.png :width: 60 Description ----------- Clicks on the website element specified by its ID. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Id - string - The ID of the element to be clicked upon. .. _get_current_url: |get_current_url| Get Current URL =================================== .. |get_current_url| image:: ../../png/get_current_url.png :width: 60 Description ----------- Saves the URL of web page, which is being currently scraped, to new variable inside platform. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Output variable (required) - string - Name of variable to be created inside platform .. _close_browser: |get_current_url| Close Browser =================================== .. |close_browser| image:: ../../png/close_browser.png :width: 60 Description ----------- Closes browser after all scraping processes are done. .. _wait_until_element_is_located: |wait_until_element_is_located| Wait Until Element Is Located =============================================================== .. |wait_until_element_is_located| image:: ../../png/wait_until_element_is_located.png :width: 60 Description ----------- Creates a pause until a certain element is located. Can be useful for instance when dealing with popup windows on page. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - XPath (required) - string - XPath of element that should appear on page .. _extract_page_source: |page_source| Extract Page Source =================================== .. |page_source| image:: ../../png/refresh_page_source.png :width: 60 Description ----------- Extracts the HTML source of currently scraped web page and saves it to new variable inside platform. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Output variable (required) - string - Name of variable to be created inside platform .. _scroll_web_page: |scroll_web_page| Scroll Web Page =================================== .. |scroll_web_page| image:: ../../png/scroll_web_page.png :width: 60 Description ----------- Scrolls web page by X pixels in directions Up/Down. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Scroll to - string (combobox) - The direction of scrolling (Up/Down) * - Scroll by (required) - int - Length of scrolling in pixels * - Scroll max - boolean - Scroll to the top / end of page .. _create_folder: |create_folder| Create Folder ============================= .. |create_folder| image:: ../../png/add_folder.png :width: 60 Description ----------- Creates a new folder with a given name at a given location. Parameters ---------- Two parameters, a name of the new folder and its location are required. .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Folder location (req.) - string - Path to a directory in which the new folder will be created, eg. */Users/MrForloop/new_dir_place*. * - Folder name (req.) - string - A name of the new folder, eg. *my_new_dir*. The final path to the new directory will be in the example case */Users/MrForloop/new_dir_place/my_new_dir*. .. _move_file: |move_file| Move File ===================== .. |move_file| image:: ../../png/move_file.png :width: 60 Description ----------- Moves a file from it's current directory to a different one. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - File (req.) - string - Path to a file to be moved, eg. */Users/awesome_file.flpl*. * - Move to (req.) - string - Path to a directory to which the file will be moved into, eg. */Users/MrForloop*. The final path to the moved file will be in the example case */Users/MrForloop/awesome_file.flpl*. .. _create_file_queue: |create_file_queue| Create File Queue ===================================== .. |create_file_queue| image:: ../../png/batch.png :width: 60 Description ----------- Creates a file queue folder whit a specified template suffix, i.e. *.txt*. Files in this queue can be further processed directly in Forloop. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Root Folder (req.) - string - Path to a root folder where the queue directory will be created, eg. */Users/MrForloop/Desktop/*. * - Queue name (req.) - string - A name of the to be created queue directory. * - Suffix - combobox option - Only files with this specific suffix in the queue folder will be processed via :ref:`Process Item In Queue `. .. _process_item_in_queue: |process_item_in_queue| Process Item In Queue ============================================= .. |process_item_in_queue| image:: ../../png/process_file.png :width: 60 Description ----------- Processes a next-in-the-line file in the file queue. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - New variable name (req.) - string - Name of the variable storing the data from the processed file. .. _delete_file: |delete_file| Delete File ========================= .. |delete_file| image:: ../../png/delete_file.png :width: 60 Description ----------- Deletes a specified file as if by using ``rm `` command on UNIX systems or ``del `` on Win/DOS). Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - File name (req.) - string - Path to the file, i.e. */Users/Delete/This/file.txt*. .. _download_image: |download_image| Download Image =================================== .. |download_image| image:: ../../png/download_image.png :width: 60 Description ----------- Downloads image from URL. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - Image URL (required) - string - URL with image * - Output filename (required) - string - Name of image to be saved .. _download_images_xpath: |download_images_xpath| Download Images XPath ============================================== .. |download_images_xpath| image:: ../../png/download_image.png :width: 60 Description ----------- Downloads image using XPath. Can be used for downloading multiple images at once. Parameters ---------- .. list-table:: :header-rows: 1 * - Parameter - Type - Description * - XPath (required) - string - XPath of image element * - Output filename (required) - string - Name of image to be saved. In case of downloading multiple images will be used as a directory name.