User Manual - Portable PsyAgent

1. Project Overview

Portable PsyAgent is a portable psychological assessment agent system that supports multiple large model evaluators and local Ollama models. The system can perform multi-dimensional personality assessments on AI agents, generate detailed analysis reports, and support targeted enhanced stress testing.

2. Features

🧠 Multi-dimensional Personality Assessment

Supports Big Five personality trait analysis

🤖 Multi-evaluator Support

Supports OpenAI, Claude, Gemini, DeepSeek, GLM, Qwen, and local Ollama

🔧 Configuration-driven

Easily switch models and parameters through configuration files

📊 Detailed Analysis Reports

Generates comprehensive reports including motivation analysis, personality traits, and behavioral patterns

🛡️ Local Assessment

Supports fully localized Ollama model evaluation

🔍 Debug Logs

Complete conversation logs and debugging information

🚀 Batch Analysis

Automatically processes large numbers of assessment reports with intelligent batch processing and progress tracking

💪 Stress Testing

Supports emotional stress, cognitive traps, and context load stress testing

3. System Requirements

Python 3.8 or higher
Windows, Linux, or macOS operating system
At least 4GB RAM (8GB or higher recommended)
Disk space: At least 10GB available space

4. Installation Guide

1. Clone the Project

git clone https://github.com/ptreezh/AgentPsyAssessment

cd AgentPsyAssessment

2. Install Dependencies

# Install base dependencies

pip install -r requirements.txt

# Optional: Install Google Gemini support

pip install google-generativeai

3. Configure Environment Variables

Create a .env file or set environment variables:

# OpenAI

OPENAI_API_KEY=your_openai_key

# Anthropic Claude

ANTHROPIC_API_KEY=your_claude_key

# Google Gemini

GOOGLE_API_KEY=your_gemini_key

# Alibaba Cloud Qwen

DASHSCOPE_API_KEY=your_qwen_key

# DeepSeek

DEEPSEEK_API_KEY=your_deepseek_key

# GLM

GLM_API_KEY=your_glm_key

5. Quick Start

Using Ollama Local Models (Recommended)

Install Ollama

# Windows

# Download from https://ollama.ai/download

# Linux

curl -fsSL https://ollama.ai/install.sh | sh

# macOS

brew install ollama

Download Models

# Start Ollama service

ollama serve

# Download recommended models

ollama pull llama3:latest

ollama pull qwen3:8b

ollama pull mistral-nemo:latest

Basic Assessment

# Use default evaluator

python shared_analysis/analyze_results.py data/your_data.json

# Use specific evaluators

python shared_analysis/analyze_results.py data/your_data.json --evaluators gpt claude

# Use local Ollama evaluators

python shared_analysis/analyze_results.py data/your_data.json --evaluators ollama_llama3 ollama_qwen3

6. Detailed Usage

Project Structure

portable_psyagent/
├── llm_assessment/          # Assessment module
│   ├── run_assessment_unified.py  # Unified assessment entry
│   ├── roles/               # Role definition files
│   ├── test_files/          # Test question banks
│   ├── results/             # Assessment results
│   └── services/            # Core services
├── shared_analysis/         # Analysis module
│   ├── analyze_results.py   # Results analysis main program
│   ├── analyze_big5_results.py  # Big Five analysis
│   ├── analyze_motivation.py    # Motivation analysis
│   └── ollama_evaluator.py  # Ollama evaluator
├── interference_materials/  # Interference materials (stress testing)
├── config/                  # Configuration files
├── batch_analysis_output/   # Batch analysis output
└── docs/                    # Documentation

7. Assessment Module

Running Assessments

# Basic assessment

python llm_assessment/run_assessment_unified.py --model_name gemma3:latest --test_file big5 --role_name a1

# Assessment with stress testing

python llm_assessment/run_assessment_unified.py --model_name gemma3:latest --test_file big5 --role_name a1 --emotional-stress-level 3 --cognitive-trap-type p

# Set temperature parameter

python llm_assessment/run_assessment_unified.py --model_name gemma3:latest --test_file big5 --role_name a1 --tmpr 0.7

# Set context length

python llm_assessment/run_assessment_unified.py --model_name gemma3:latest --test_file big5 --role_name a1 --context-length-mode static --context-length-static 4

Assessment Parameters

--model_name: Model identifier (e.g., ollama/gemma3:latest)
--test_file: Test file name or path
--role_name: Role name
--emotional-stress-level: Emotional stress level (0-4)
--cognitive-trap-type: Cognitive trap type (p, c, s, r)
--tmpr: Model temperature setting
--context-length-mode: Context length mode (auto, static, dynamic, none)
--timeout: Model response timeout (seconds)

8. Analysis Module

Motivation Analysis

# Run motivation analysis (no API required)

python shared_analysis/analyze_motivation.py data/your_data.json --debug

Big Five Personality Analysis

# Basic Big Five analysis

python shared_analysis/analyze_big5_results.py data/your_data.json

Comprehensive Analysis

# Use default evaluator

python shared_analysis/analyze_results.py data/your_data.json

# Use specific evaluators

python shared_analysis/analyze_results.py data/your_data.json --evaluators gpt claude

# Use local Ollama evaluators

python shared_analysis/analyze_results.py data/your_data.json --evaluators ollama_llama3 ollama_qwen3

9. Stress Testing Module

Supported Stress Test Types

Emotional Stress Testing - Affects model performance through different levels of emotional stress
Cognitive Trap Testing - Introduces paradoxes, circularity, semantic fallacies, and procedural traps
Context Load Testing - Tests model processing capabilities by increasing context length

Stress Testing Parameters

-esL, --emotional-stress-level: Emotional stress level (0-4)
-ct, --cognitive-trap-type: Cognitive trap type
- p: Paradox trap
- c: Circularity trap
- s: Semantic fallacy trap
- r: Procedural trap
--context-length-mode: Context length mode
- auto: Automatic detection
- static: Fixed length
- dynamic: Dynamic ratio
- none: Disable context injection

10. Configuration Files

Ollama Configuration (`config/ollama_config.json`)

{
  "ollama": {
    "base_url": "http://localhost:11434",
    "timeout": 120,
    "models": {
      "llama3": {
        "name": "llama3:latest",
        "temperature": 0.1,
        "max_tokens": 1024,
        "description": "Meta Llama 3 - General-purpose large model"
      },
      "qwen3": {
        "name": "qwen3:8b",
        "temperature": 0.1,
        "max_tokens": 1024,
        "description": "Alibaba Cloud Qwen3 - 8B parameter version"
      },
      "mistral": {
        "name": "mistral-nemo:latest",
        "temperature": 0.1,
        "max_tokens": 1024,
        "description": "Mistral NeMo - High-performance reasoning model"
      }
    }
  },
  "evaluators": {
    "ollama_llama3": {
      "provider": "ollama",
      "model": "llama3",
      "description": "Llama3 local evaluator"
    },
    "ollama_qwen3": {
      "provider": "ollama",
      "model": "qwen3",
      "description": "Qwen3 local evaluator"
    },
    "ollama_mistral": {
      "provider": "ollama",
      "model": "mistral",
      "description": "Mistral NeMo local evaluator"
    }
  }
}

11. Batch Processing

Batch Analysis

# View file statistics

python ultimate_batch_analysis.py --stats

# Quick test (5 files)

python ultimate_batch_analysis.py --quick

# Analyze specific model (e.g., deepseek)

python ultimate_batch_analysis.py --filter deepseek

# Complete batch analysis (all 294 files)

python ultimate_batch_analysis.py

# One-click start for Windows users

start_batch_analysis.bat

Supported Assessment Data

The system supports automatic analysis of assessment reports in the results/results directory, containing test data from multiple model series.

Batch Analysis Features

🔄 Automatic Format Conversion - Supports original assessment data formats
📊 Intelligent Batch Processing - Supports resume and error recovery
⏱️ Progress Tracking - Real-time display of analysis progress and estimated time
📋 Detailed Reports - Generates JSON and Markdown format summaries
🎯 Flexible Filtering - Filters by model, sample size, and other conditions

12. Troubleshooting

Common Issues

Ollama Connection Failure
# Check Ollama service
ollama ps
curl http://localhost:11434/api/tags
Batch Analysis Interrupted
# Check output directory
ls -la batch_analysis_results/

# Re-run (will automatically skip completed files)
python ultimate_batch_analysis.py --filter deepseek
Insufficient Memory
# Reduce batch size
python ultimate_batch_analysis.py --sample 10
API Key Issues
# Check environment variables
echo $OPENAI_API_KEY
Missing Modules
# Install missing dependencies
pip install google-generativeai

Debug Mode

# Enable detailed debug output

python shared_analysis/analyze_results.py data.json --evaluators ollama_llama3

Check log files:

logs/evaluator_conversation_log.txt - Conversation logs
logs/debug_info.json - Debug information

13. API Reference

Assessment API

run_assessment_unified.py

Main parameters:

--model_name (required): Model identifier
--test_file (required): Test file
--role_name: Role name
--debug: Enable debug mode
--test_connection: Test model connectivity only

Stress testing parameters:

--emotional-stress-level: Emotional stress level (0-4)
--cognitive-trap-type: Cognitive trap type (p, c, s, r)
--tmpr: Model temperature setting
--context-length-mode: Context length mode
--timeout: Response timeout

Analysis API

analyze_results.py

Main functions:

Comprehensive analysis of assessment results
Generates Big Five and MBTI personality assessments
Supports multiple evaluators

analyze_motivation.py

Main functions:

Motivation analysis of assessment results
Generates motivation test reports
Supports Markdown format output

analyze_big5_results.py

Main functions:

Specialized analysis of Big Five personality traits
Generates detailed Big Five scoring reports

14. License

This project is for research and educational purposes only.

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program. If not, see <https://www.gnu.org/licenses/>.

Portable PsyAgent User Manual

Table of Contents

1. Project Overview

2. Features

3. System Requirements

4. Installation Guide

1. Clone the Project

2. Install Dependencies

3. Configure Environment Variables

5. Quick Start

Using Ollama Local Models (Recommended)

Install Ollama

Download Models

Basic Assessment

6. Detailed Usage

Project Structure

7. Assessment Module

Running Assessments

Assessment Parameters

8. Analysis Module

Motivation Analysis

Big Five Personality Analysis

Comprehensive Analysis

9. Stress Testing Module

Supported Stress Test Types

Stress Testing Parameters

10. Configuration Files

Ollama Configuration (config/ollama_config.json)

11. Batch Processing

Batch Analysis

Supported Assessment Data

Batch Analysis Features

12. Troubleshooting

Common Issues

Debug Mode

13. API Reference

Assessment API

run_assessment_unified.py

Analysis API

analyze_results.py

analyze_motivation.py

analyze_big5_results.py

14. License

Ready to Start Using Portable PsyAgent?

Ollama Configuration (`config/ollama_config.json`)