You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Author: Tiger, member from [HKUST Dial](https://github.com/HKUSTDial)
5
6
6
7
Last update: September 09, 2025
7
8
8
-
# Objectives
9
-
This workflow serves for tracking daily updates in Arxiv.org. Paper info will be preprocessed and concluded by a series of modules. Finally, it will post to a group chat in Feishu for reading. The target audience is for education and research community.
9
+
## 🎯 Objectives
10
+
This workflow serves for tracking daily updates in arXiv.org. Paper info will be preprocessed and concluded by a series of modules. Finally, it will post to a group chat in Feishu for reading. The target audience is for education and research community.
11
+
12
+
> 💰 Cost: less than 0.05 CNY per workflow execution.
13
+
14
+
## ✨ Key Features
15
+
16
+
- 📚 Automatically fetch latest arXiv papers
17
+
- 🤖 AI-powered paper summarization and filtering
18
+
- 📱 Auto-send to Feishu group chat
19
+
- ⏰ GitHub Actions automated scheduling
20
+
- 🛠️ Local debugging script support
21
+
22
+
## 📋 Prerequisites
23
+
24
+
Before getting started, please ensure you have prepared the following accounts and services:
25
+
26
+
1.**[Dify](https://dify.ai/) account** - Free registration for building AI workflows
The project now includes an integrated scheduler that runs your workflow automatically. Since your Dify workflow already handles Feishu messaging internally, the scheduler simply executes the workflow and logs the results.
54
+
The project provides an integrated scheduler that can trigger Dify-side workflows on schedule.
29
55
30
-
### Quick Setup:
31
-
1.**Configure GitHub Secrets**: Go to your repository Settings > Secrets and variables > Actions > New repository secret, and add:
32
-
-`DIFY_TOKENS`: Your Dify workflow API token(s) - separate multiple tokens with `;`
33
-
-`DIFY_BASE_URL`: (Optional) Dify API URL, defaults to `https://api.dify.ai/v1`
34
-
-`DIFY_INPUTS`: (Optional) JSON format input variables if your workflow requires them
56
+
#### Quick Setup:
35
57
36
-
2.**Enable GitHub Actions**: Go to the Actions tab in your repository and enable workflows
58
+
1.**Configure GitHub Secrets**:
59
+
- Go to repository Settings > Secrets and variables > Actions > New repository secret
60
+
- Add secret `DIFY_TOKENS`: Your Dify workflow API token (separate multiple tokens with `;`)
37
61
38
-
3.**Automatic Execution**: The scheduler runs daily at 06:30 Beijing Time automatically
62
+
2.**Enable GitHub Actions**: Go to repository Actions tab and enable workflows
63
+
64
+
3.**Automatic Execution**: The scheduler will automatically run according to timing rules defined in [dify-scheduler.yml](.github/workflows/dify-scheduler.yml). For syntax details, see [cron.help](https://cron.help/).
39
65
40
66
### Manual Execution:
41
67
-**GitHub Actions**: Go to Actions tab > "Dify ArxivFlow Scheduler" > "Run workflow"
@@ -47,7 +73,8 @@ The project now includes an integrated scheduler that runs your workflow automat
47
73
npm start
48
74
```
49
75
50
-
## Final Result
76
+
### 📱 Final Result
77
+
51
78

52
79
53
80
The scheduler will automatically:
@@ -56,29 +83,47 @@ The scheduler will automatically:
56
83
- ❌ Report any errors to GitHub Actions logs
57
84
- 🔄 Support multiple workflows if needed
58
85
59
-
# Scheduler Configuration
60
-
61
-
## Environment Variables
62
-
Configure these as GitHub repository secrets or local environment variables:
86
+
## 🔧 Environment Variables Configuration
63
87
64
-
### Required Variables:
65
-
-`DIFY_TOKENS`: Your Dify workflow API token(s). For multiple workflows, separate with `;`
88
+
### GitHub Actions Secrets (Required):
89
+
-`DIFY_TOKENS`: Your Dify workflow API token, separate multiple workflows with `;`
66
90
67
-
### Optional Variables:
91
+
### Optional Configuration:
68
92
-`DIFY_BASE_URL`: Dify API base URL (default: `https://api.dify.ai/v1`)
69
-
-`DIFY_INPUTS`: JSON format input variables for workflows (default: `{}`)
93
+
-`DIFY_INPUTS`: Workflow input variables in JSON format (default: `{}`)
94
+
95
+
### Dify Workflow Internal Environment Variables:
96
+
-`FEISHU_DEV` / `FEISHU_PROD`: Feishu Group Bot Webhook for testing/production environments
97
+
-`JINA`: API key for crawling arXiv search results
98
+
-`KEYWORDS`: Keywords for arXiv paper search, comma-separated
99
+
- The number of KEYWORDS and sending frequency needs to match the timing rules in GitHub Actions
100
+
- Example: If sending 4 pushes daily, KEYWORDS needs 4 keywords, and timing rules need 4 time points
101
+
-`PAPER_NUM_MAX`: Maximum number of papers per message (limited by Feishu message length)
102
+
103
+
## 🛠️ Debugging Scripts
70
104
71
-
## Original Dify Workflow Env Vars:
72
-
-`FEISHU_DEV` / `FEISHU_PROD`: Webhook of Feishu Group Bot for testing/deployment
73
-
-`JINA`: Web crawler API key for Arxiv.org
74
-
-`KEYWORDS`: Comma-separated keywords for Arxiv query
75
-
-`PAPER_NUM_MAX`: Maximum number of papers per message (Feishu message limits)
105
+
The `/scripts` folder contains scripts for local debugging and testing, simulating the processes used in Dify Workflow:
76
106
107
+
-**`jina_extract.py`**: Simulates Jina API calls and paper information extraction logic
108
+
-**`sample.text`**: Sample data returned by Jina API for local testing
109
+
-**`extracted_papers.json`**: Example of structured paper data after extraction, serves as input for downstream LLM analysis in workflow
77
110
78
-
# Acknowledgement
111
+
These scripts help you test and debug paper extraction logic without consuming API credits.
112
+
113
+
### Usage for Local Development:
114
+
```bash
115
+
cd scripts
116
+
python jina_extract.py
117
+
```
118
+
119
+
## 🤝 Acknowledgement
79
120
- Dify Official Guidance: [Link](https://docs.dify.ai/docs/workflow/overview)
80
121
- Feishu - How to use Bot in Group Chat: [Link (Chinese)](https://www.feishu.cn/hc/zh-CN/articles/360024984973-%E5%9C%A8%E7%BE%A4%E7%BB%84%E4%B8%AD%E4%BD%BF%E7%94%A8%E6%9C%BA%E5%99%A8%E4%BA%BA?from=in-im-bot)
0 commit comments