🧩 Go Script Demo
- GitHub Repository https://github.com/CafeScraper/GoScirptDemo
Required Files (Located in the Project Root Directory)
| File Name | Description |
|---|---|
main.go | Script source code |
main | Script entry executable (execution entry point), must be named main |
input_schema.json | UI input form configuration |
README.md | Project documentation |
sdk.go | Core SDK functionality (located in GoSdk directory) |
sdk_pd.go | Data processing enhancement module (located in GoSdk directory) |
sdk_grpc_pd.go | Network communication module (located in GoSdk directory) |
Go Scripts Must Be Built into an Executable Before Uploading
⭐ Core SDK Files
📁 File Overview
The following three SDK files are required and must be placed in the root directory of the script:| File Name | Main Function |
|---|---|
sdk.go | Core SDK functionality |
sdk_pd.go | Data processing enhancement module |
sdk_grpc_pd.go | Network communication module |
🔧 Core Feature Usage Guide
1. Environment Parameters – Retrieve Script Input Configuration
When the script starts, configuration parameters (such as target website URLs or search keywords) can be passed in externally. Use the following method to retrieve them:You can reuse the same script to crawl different websites or datasets simply by changing input parameters, without modifying the code.
2. Runtime Logs – Record Script Execution Process
During execution, logs of different levels can be recorded and displayed in the platform console for monitoring and debugging:- debug: Detailed debugging information (recommended during development)
- info: Normal execution flow
- warn: Warnings that do not stop execution
- error: Critical errors that require attention
3. Result Submission – Sending Data Back to the Platform
After data collection, results must be returned to the platform in two steps.Step 1: Define Table Headers (Required)
Before pushing data, define the table structure (similar to defining column headers in Excel):- Label: Column name displayed to users
- Key: Unique identifier used in code
- Format: Data type, supported values:
"text"– string"integer"– integer"boolean"– true/false"array"– list"object"– dictionary/object
Step 2: Push Data Row by Row
After defining headers, push the collected data one record at a time:- The order of setting headers and pushing data does not matter
- Keys in pushed data must exactly match the keys defined in headers
- Data must be pushed one record at a time
- Logging after each push is recommended for traceability
⚠️ Common Issues & Notes
- File placement: Ensure all SDK files are located in the script directory
- Imports: Use
SDKorCafeSDKdirectly to access SDK functionality - Key consistency: Data keys must exactly match table header keys
- Error handling: Always check return values, especially when pushing data
⭐ Script Entry File (main.go)
💡 Example Code
Automated Data Collection Script: Workflow & Principles
1. Script Overview
This script is an automation Script that works like a digital employee.It automatically opens target web pages (such as social media sites), extracts required information, and organizes the data into structured tables.
2. How Does It Work?
The entire process can be simplified into four main stages:Step 1: Receive Instructions (Input Parameters)
Before execution, you provide instructions such as:- Target page URL
- Number of records to collect
Step 2: Stealth Preparation (Proxy Network Configuration)
To reliably access overseas or restricted websites, the script automatically configures a secure proxy channel.Step 3: Automated Execution (Business Logic Processing)
This is the core stage where the script:- Visits target pages
- Extracts titles, content, images, and other required data
Step 4: Result Reporting (Data Push & Table Generation)
After collection:- Raw data is converted into standardized formats
- Results are saved to the system
- Table headers (e.g., “URL”, “Content”) are automatically generated