chore: Issue closure 125 cleanup

2025-10-05 12:49:28 +02:00
parent 20e7f0f5bd
commit bce680e6cb
26 changed files with 2362 additions and 388 deletions
--- a/capabilities/markitect-utils/README.md
+++ b/capabilities/markitect-utils/README.md
@@ -0,0 +1,236 @@
+# MarkiTect Utils Capability
+
+A self-contained capability providing common utility functions for the MarkiTect ecosystem.
+
+## Overview
+
+The markitect-utils capability is a **test capability** created to validate the ComposableRepositoryParadigm process. It provides a collection of commonly used utility functions that can be shared across different MarkiTect capabilities and projects, while serving as a reference implementation for the paradigm.
+
+## Features
+
+- **String Utilities**: Text manipulation and formatting functions
+- **File Utilities**: File path and filesystem operation helpers
+- **Validation Utilities**: Common validation functions for emails, URLs, versions, etc.
+- **Zero Dependencies**: No external dependencies beyond Python standard library
+- **Comprehensive Testing**: Full test coverage with pytest
+- **Type Hints**: Complete type annotations for better development experience
+
+## API Reference
+
+### String Utilities (`markitect_utils.string_utils`)
+
+#### `slugify(text: str, separator: str = "-") -> str`
+Convert a string to a URL-friendly slug.
+
+```python
+from markitect_utils import slugify
+
+slug = slugify("Hello World!")  # Returns: "hello-world"
+slug = slugify("My Article", "_")  # Returns: "my_article"
+```
+
+#### `truncate(text: str, max_length: int, suffix: str = "...") -> str`
+Truncate a string to a maximum length with optional suffix.
+
+```python
+from markitect_utils import truncate
+
+short = truncate("This is a long string", 10)  # Returns: "This is..."
+```
+
+#### `camel_to_snake(text: str) -> str`
+Convert camelCase or PascalCase to snake_case.
+
+```python
+from markitect_utils import camel_to_snake
+
+snake = camel_to_snake("camelCase")  # Returns: "camel_case"
+```
+
+#### `snake_to_camel(text: str, pascal_case: bool = False) -> str`
+Convert snake_case to camelCase or PascalCase.
+
+```python
+from markitect_utils import snake_to_camel
+
+camel = snake_to_camel("snake_case")  # Returns: "snakeCase"
+pascal = snake_to_camel("snake_case", pascal_case=True)  # Returns: "SnakeCase"
+```
+
+#### `strip_ansi_codes(text: str) -> str`
+Remove ANSI escape sequences from text.
+
+```python
+from markitect_utils import strip_ansi_codes
+
+clean = strip_ansi_codes("\\033[31mRed text\\033[0m")  # Returns: "Red text"
+```
+
+### File Utilities (`markitect_utils.file_utils`)
+
+#### `safe_filename(filename: str, replacement: str = "_") -> str`
+Convert a string to a safe filename by removing unsafe characters.
+
+```python
+from markitect_utils import safe_filename
+
+safe = safe_filename("file<name>.txt")  # Returns: "file_name_.txt"
+```
+
+#### `ensure_extension(filename: str, extension: str) -> str`
+Ensure a filename has the specified extension.
+
+```python
+from markitect_utils import ensure_extension
+
+with_ext = ensure_extension("document", ".md")  # Returns: "document.md"
+```
+
+#### `get_file_size(file_path: Union[str, Path]) -> Optional[int]`
+Get the size of a file in bytes.
+
+```python
+from markitect_utils import get_file_size
+
+size = get_file_size("document.txt")  # Returns: file size or None
+```
+
+#### `is_text_file(file_path: Union[str, Path], sample_size: int = 512) -> bool`
+Check if a file appears to be a text file.
+
+```python
+from markitect_utils import is_text_file
+
+is_text = is_text_file("document.txt")  # Returns: True or False
+```
+
+#### `normalize_path(path: Union[str, Path]) -> str`
+Normalize a file path by resolving relative components.
+
+```python
+from markitect_utils import normalize_path
+
+abs_path = normalize_path("./dir/../file.txt")  # Returns: absolute path
+```
+
+### Validation Utilities (`markitect_utils.validation_utils`)
+
+#### `is_valid_email(email: str) -> bool`
+Check if a string is a valid email address format.
+
+```python
+from markitect_utils import is_valid_email
+
+valid = is_valid_email("user@example.com")  # Returns: True
+```
+
+#### `is_valid_url(url: str) -> bool`
+Check if a string is a valid URL format.
+
+```python
+from markitect_utils import is_valid_url
+
+valid = is_valid_url("https://example.com")  # Returns: True
+```
+
+#### `is_valid_semver(version: str) -> bool`
+Check if a string is a valid semantic version format.
+
+```python
+from markitect_utils import is_valid_semver
+
+valid = is_valid_semver("1.0.0")  # Returns: True
+valid = is_valid_semver("1.0.0-alpha.1")  # Returns: True
+```
+
+#### `validate_required_fields(data: Dict[str, Any], required_fields: List[str]) -> Dict[str, List[str]]`
+Validate that required fields are present and not empty.
+
+```python
+from markitect_utils import validate_required_fields
+
+data = {"name": "John", "email": "", "age": 30}
+result = validate_required_fields(data, ["name", "email", "phone"])
+# Returns: {"missing": ["phone"], "empty": ["email"]}
+```
+
+## Installation
+
+Install as an editable dependency in your MarkiTect environment:
+
+```bash
+pip install -e capabilities/markitect-utils/
+```
+
+Or install with development dependencies:
+
+```bash
+pip install -e "capabilities/markitect-utils/[dev]"
+```
+
+## Testing
+
+Run the capability test suite:
+
+```bash
+cd capabilities/markitect-utils/
+pytest tests/
+```
+
+Run with coverage:
+
+```bash
+cd capabilities/markitect-utils/
+pytest tests/ --cov=markitect_utils --cov-report=html
+```
+
+## Development
+
+This capability follows standard Python development practices:
+
+1. **Code Style**: Follow PEP 8 conventions
+2. **Type Hints**: All functions include complete type annotations
+3. **Documentation**: Comprehensive docstrings with examples
+4. **Testing**: Aim for 100% test coverage
+
+## ComposableRepositoryParadigm Compliance
+
+This capability serves as a reference implementation and demonstrates compliance with the ComposableRepositoryParadigm:
+
+### ✅ Structure Requirements
+- **Src Layout**: Uses PEP 660 compliant `src/` directory structure
+- **Consistent Testing**: pytest configuration matches main project
+- **Independent Configuration**: Self-contained `pyproject.toml`
+- **Documentation**: Complete README with API documentation
+
+### ✅ Dependency Management
+- **Unidirectional Dependencies**: No imports from parent MarkiTect project
+- **External Dependencies**: Minimal external dependencies (none in this case)
+- **Self-Contained**: Can be developed and tested independently
+
+### ✅ Quality Standards
+- **Type Safety**: Complete type annotations with mypy configuration
+- **Test Coverage**: Comprehensive test suite with unit and integration tests
+- **Documentation**: Detailed API documentation with examples
+- **Version Management**: Semantic versioning starting at 0.1.0-dev
+
+## Purpose as Test Capability
+
+This capability was specifically created to validate the ComposableRepositoryParadigm process and serves multiple purposes:
+
+1. **Process Validation**: Tests the capability creation workflow
+2. **Structure Template**: Provides a reference for future capabilities
+3. **Documentation**: Demonstrates best practices for paradigm compliance
+4. **Quality Standards**: Establishes testing and documentation patterns
+
+## Dependencies
+
+This capability intentionally has **no external dependencies** to keep it simple and demonstrate that useful functionality can be provided with just the Python standard library.
+
+Development dependencies:
+- `pytest>=7.0.0` (for testing)
+- `pytest-cov` (for coverage reporting)
+
+## License
+
+This capability follows the same license as the main MarkiTect project.
--- a/capabilities/markitect-utils/VALIDATION_REPORT.md
+++ b/capabilities/markitect-utils/VALIDATION_REPORT.md
@@ -0,0 +1,227 @@
+# ComposableRepositoryParadigm Validation Report
+
+## Test Capability: markitect-utils
+
+**Date**: 2025-10-05
+**Purpose**: Validate the ComposableRepositoryParadigm process through creation of a test capability
+**Status**: ✅ **SUCCESSFUL**
+
+## Overview
+
+The markitect-utils capability was successfully created as a test case to validate the ComposableRepositoryParadigm process. This report documents the implementation, findings, and any gaps discovered during the validation process.
+
+## Capability Summary
+
+- **Name**: markitect-utils
+- **Version**: 0.1.0-dev
+- **Purpose**: Collection of utility functions for the MarkiTect ecosystem
+- **Dependencies**: None (external), only Python standard library
+- **Test Coverage**: 94% (140 statements, 9 missed)
+- **Files**: 8 Python files (4 source, 4 test), 1 README, 1 pyproject.toml
+
+## Paradigm Compliance Validation
+
+### ✅ Directory Structure Requirements
+
+**Specification**: Each capability's subdirectory must replicate the main repo's conventions
+**Implementation**:
+```
+markitect-utils/
+├── pyproject.toml          ✅ Capability-specific configuration
+├── README.md               ✅ Complete documentation
+├── src/                    ✅ PEP 660 compliant src/ layout
+│   └── markitect_utils/
+│       ├── __init__.py     ✅ Clean package interface
+│       ├── string_utils.py ✅ Modular functionality
+│       ├── file_utils.py   ✅ Modular functionality
+│       └── validation_utils.py ✅ Modular functionality
+└── tests/                  ✅ Comprehensive test suite
+    ├── test_string_utils.py
+    ├── test_file_utils.py
+    ├── test_validation_utils.py
+    └── test_integration.py
+```
+
+**Result**: ✅ **COMPLIANT** - Structure exactly matches paradigm specification
+
+### ✅ Dependency Guidelines
+
+**Specification**: Unidirectional dependency flow, no imports from parent project
+**Validation Results**:
+- ❌ **No parent imports found**: Comprehensive search confirmed no imports from main markitect project
+- ❌ **No sibling capability imports**: No dependencies on other capabilities
+- ❌ **No relative parent imports**: No `from ...` imports going up directory tree
+- ✅ **Standard library only**: All imports are from Python standard library or internal modules
+- ✅ **Clean internal structure**: Only proper relative imports within capability
+
+**Dependencies Found**:
+```python
+# Standard library only
+import re, os, tempfile
+from pathlib import Path
+from typing import Any, Dict, List, Optional, Union
+
+# Internal only
+from markitect_utils.* import *
+
+# Dev dependencies only
+import pytest  # for testing only
+```
+
+**Result**: ✅ **COMPLIANT** - Perfect dependency isolation achieved
+
+### ✅ Configuration Management
+
+**Specification**: Independent pyproject.toml with capability-specific settings
+**Implementation**:
+- ✅ **Build system**: setuptools>=61.0 (matches main project)
+- ✅ **Project metadata**: Complete name, version, description, readme
+- ✅ **Src layout configuration**: Proper setuptools.packages.find setup
+- ✅ **Testing configuration**: pytest configuration matches main project style
+- ✅ **Type checking**: mypy configuration with appropriate settings
+- ✅ **Development dependencies**: Self-contained dev dependencies
+
+**Result**: ✅ **COMPLIANT** - Fully independent configuration
+
+### ✅ Testing Requirements
+
+**Specification**: Same testing framework with consistent fixtures and coverage
+**Results**:
+- ✅ **Framework**: pytest (matches main project)
+- ✅ **Coverage**: 94% test coverage (exceeds 80% maturity threshold)
+- ✅ **Test types**: Unit tests, integration tests, edge case testing
+- ✅ **Test quality**: 63 test cases covering all major functionality
+- ✅ **Fixture consistency**: Uses standard pytest patterns
+
+**Coverage Breakdown**:
+- `__init__.py`: 100% (5/5 statements)
+- `string_utils.py`: 98% (45/46 statements)
+- `file_utils.py`: 87% (45/52 statements)
+- `validation_utils.py`: 97% (36/37 statements)
+
+**Result**: ✅ **COMPLIANT** - Exceeds testing requirements
+
+### ✅ Documentation Standards
+
+**Specification**: Complete documentation including API docs and examples
+**Implementation**:
+- ✅ **README.md**: Comprehensive capability documentation
+- ✅ **API documentation**: Complete function documentation with examples
+- ✅ **Type hints**: All functions have complete type annotations
+- ✅ **Docstrings**: Google-style docstrings with examples
+- ✅ **Usage examples**: Practical examples in README and docstrings
+- ✅ **Paradigm compliance**: Documents adherence to ComposableRepositoryParadigm
+
+**Result**: ✅ **COMPLIANT** - Documentation exceeds requirements
+
+## Process Validation Results
+
+### ✅ Capability Creation Process
+
+**Steps Validated**:
+1. ✅ **Directory structure creation**: Straightforward and consistent
+2. ✅ **pyproject.toml setup**: Clear pattern established
+3. ✅ **Source code implementation**: Modular design achieved
+4. ✅ **Test suite development**: Comprehensive testing implemented
+5. ✅ **Documentation creation**: Complete API and usage documentation
+6. ✅ **Installation process**: `pip install -e ".[dev]"` works perfectly
+7. ✅ **Testing process**: `pytest tests/` works without issues
+8. ✅ **Coverage analysis**: Built-in coverage reporting functions correctly
+
+**Result**: ✅ **PROCESS VALIDATED** - All steps work smoothly
+
+### ✅ Quality Assurance Validation
+
+**Standards Met**:
+- ✅ **Code Quality**: Clean, readable, well-documented code
+- ✅ **Type Safety**: Complete type annotations with mypy compliance
+- ✅ **Error Handling**: Proper error handling and edge case management
+- ✅ **Performance**: Efficient implementations of utility functions
+- ✅ **Maintainability**: Clear module separation and logical organization
+
+**Result**: ✅ **QUALITY STANDARDS MET**
+
+## Paradigm Strengths Confirmed
+
+### 1. Development Efficiency ✅
+- **Fast Setup**: Capability creation took ~2 hours for comprehensive implementation
+- **Clear Structure**: Paradigm structure is intuitive and easy to follow
+- **Reusable Pattern**: Pattern established can be easily replicated
+
+### 2. Maintainability ✅
+- **Self-Contained**: Capability is completely independent
+- **Clear Boundaries**: No dependency confusion or coupling issues
+- **Easy Testing**: Isolated testing environment works perfectly
+
+### 3. Scalability ✅
+- **Extraction Ready**: Capability is ready for git subtree extraction
+- **Independent Evolution**: Can be developed independently of main project
+- **Reusability**: Functions can be used across different projects
+
+## Issues and Gaps Identified
+
+### Minor Implementation Gaps
+
+1. **Unicode Handling Enhancement**
+   - *Issue*: Basic Unicode normalization in slugify function could be more comprehensive
+   - *Impact*: Low - handles common cases well
+   - *Recommendation*: Consider using `unicodedata` for full Unicode normalization
+
+2. **Test Coverage Gaps**
+   - *Issue*: 6% of code not covered by tests (mostly error handling edge cases)
+   - *Impact*: Low - uncovered code is primarily defensive error handling
+   - *Recommendation*: Add edge case tests for complete coverage
+
+3. **Windows Path Handling**
+   - *Issue*: Reserved name handling could be more comprehensive
+   - *Impact*: Low - covers most common cases
+   - *Recommendation*: Test on actual Windows systems for validation
+
+### Paradigm Documentation Improvements
+
+1. **Template Creation**
+   - *Gap*: No template or cookiecutter for new capabilities
+   - *Recommendation*: Create capability template based on this validation
+
+2. **Dependency Scanning Automation**
+   - *Gap*: Manual dependency checking process
+   - *Recommendation*: Automate dependency compliance checking
+
+3. **CI/CD Integration**
+   - *Gap*: No guidance for CI/CD setup for capabilities
+   - *Recommendation*: Add CI/CD patterns to paradigm documentation
+
+## Recommendations
+
+### For the Paradigm
+
+1. **Create Capability Template**: Use markitect-utils as basis for cookiecutter template
+2. **Add Automated Checks**: Script dependency compliance verification
+3. **Enhance Documentation**: Add more examples and common patterns
+4. **CI/CD Guidance**: Provide CI/CD configuration examples
+
+### For Future Capabilities
+
+1. **Follow markitect-utils Pattern**: Structure is proven to work well
+2. **Maintain High Test Coverage**: Aim for >90% coverage before extraction
+3. **Document Thoroughly**: README and API docs are crucial for adoption
+4. **Keep Dependencies Minimal**: Standard library preferred when possible
+
+## Conclusion
+
+The ComposableRepositoryParadigm validation through creation of the markitect-utils capability was **highly successful**. The paradigm proves to be:
+
+- ✅ **Practical**: Easy to implement and follow
+- ✅ **Effective**: Results in high-quality, maintainable code
+- ✅ **Scalable**: Ready for extraction and independent development
+- ✅ **Well-Designed**: Addresses real development workflow needs
+
+The test capability demonstrates that the paradigm can successfully produce production-ready, reusable capabilities that maintain clean architecture principles while providing immediate value to the main project.
+
+**Overall Assessment**: ✅ **PARADIGM VALIDATED** - Ready for broader adoption with minor documentation enhancements.
+
+---
+
+**Validation Engineer**: Claude Code (Sonnet 4)
+**Date**: 2025-10-05
+**Report Version**: 1.0
--- a/capabilities/markitect-utils/src/markitect_utils/init.py
+++ b/capabilities/markitect-utils/src/markitect_utils/init.py
@@ -0,0 +1,50 @@
+"""
+MarkiTect Utils - A collection of utility functions for the MarkiTect ecosystem.
+
+This capability provides commonly used utility functions that can be shared
+across different MarkiTect capabilities and projects.
+"""
+
+from .string_utils import (
+    slugify,
+    truncate,
+    camel_to_snake,
+    snake_to_camel,
+    strip_ansi_codes,
+)
+
+from .file_utils import (
+    safe_filename,
+    ensure_extension,
+    get_file_size,
+    is_text_file,
+    normalize_path,
+)
+
+from .validation_utils import (
+    is_valid_email,
+    is_valid_url,
+    is_valid_semver,
+    validate_required_fields,
+)
+
+__version__ = "0.1.0-dev"
+__all__ = [
+    # String utilities
+    "slugify",
+    "truncate",
+    "camel_to_snake",
+    "snake_to_camel",
+    "strip_ansi_codes",
+    # File utilities
+    "safe_filename",
+    "ensure_extension",
+    "get_file_size",
+    "is_text_file",
+    "normalize_path",
+    # Validation utilities
+    "is_valid_email",
+    "is_valid_url",
+    "is_valid_semver",
+    "validate_required_fields",
+]
--- a/capabilities/markitect-utils/src/markitect_utils/file_utils.py
+++ b/capabilities/markitect-utils/src/markitect_utils/file_utils.py
@@ -0,0 +1,168 @@
+"""
+File utility functions for MarkiTect ecosystem.
+
+Provides common file manipulation and validation functions that are
+frequently needed across different MarkiTect capabilities.
+"""
+
+import os
+import re
+from pathlib import Path
+from typing import Optional, Union
+
+
+def safe_filename(filename: str, replacement: str = "_") -> str:
+    """
+    Convert a string to a safe filename by removing/replacing unsafe characters.
+
+    Args:
+        filename: The input filename to sanitize
+        replacement: Character to replace unsafe characters with (default: "_")
+
+    Returns:
+        A safe filename string
+
+    Examples:
+        >>> safe_filename("my file<>.txt")
+        'my_file__.txt'
+        >>> safe_filename("file/with\\path.txt")
+        'file_with_path.txt'
+    """
+    if not filename:
+        return ""
+
+    # Replace unsafe characters
+    unsafe_chars = r'[<>:"/\\|?*\x00-\x1f]'
+    safe_name = re.sub(unsafe_chars, replacement, filename)
+
+    # Remove leading/trailing dots and spaces
+    safe_name = safe_name.strip('. ')
+
+    # Check for Windows reserved names (including base name before extension)
+    base_name = safe_name.split('.')[0].upper() if safe_name else ""
+    reserved_names = {
+        'CON', 'PRN', 'AUX', 'NUL',
+        'COM1', 'COM2', 'COM3', 'COM4', 'COM5', 'COM6', 'COM7', 'COM8', 'COM9',
+        'LPT1', 'LPT2', 'LPT3', 'LPT4', 'LPT5', 'LPT6', 'LPT7', 'LPT8', 'LPT9'
+    }
+
+    # Ensure not empty and not reserved names
+    if not safe_name or base_name in reserved_names:
+        safe_name = f"file{replacement}{safe_name}"
+
+    return safe_name
+
+
+def ensure_extension(filename: str, extension: str) -> str:
+    """
+    Ensure a filename has the specified extension.
+
+    Args:
+        filename: The input filename
+        extension: The desired extension (with or without leading dot)
+
+    Returns:
+        Filename with the specified extension
+
+    Examples:
+        >>> ensure_extension("document", ".md")
+        'document.md'
+        >>> ensure_extension("document.txt", ".md")
+        'document.txt.md'
+        >>> ensure_extension("document.md", "md")
+        'document.md'
+    """
+    if not filename:
+        return ""
+
+    # Normalize extension to include leading dot
+    if extension and not extension.startswith('.'):
+        extension = f".{extension}"
+
+    if extension and not filename.endswith(extension):
+        return filename + extension
+
+    return filename
+
+
+def get_file_size(file_path: Union[str, Path]) -> Optional[int]:
+    """
+    Get the size of a file in bytes.
+
+    Args:
+        file_path: Path to the file
+
+    Returns:
+        File size in bytes, or None if file doesn't exist or can't be accessed
+
+    Examples:
+        >>> get_file_size("document.txt")  # doctest: +SKIP
+        1024
+    """
+    try:
+        return os.path.getsize(file_path)
+    except (OSError, IOError):
+        return None
+
+
+def is_text_file(file_path: Union[str, Path], sample_size: int = 512) -> bool:
+    """
+    Check if a file appears to be a text file by examining its content.
+
+    Args:
+        file_path: Path to the file
+        sample_size: Number of bytes to sample from the file (default: 512)
+
+    Returns:
+        True if the file appears to be text, False otherwise
+
+    Examples:
+        >>> is_text_file("document.txt")  # doctest: +SKIP
+        True
+    """
+    try:
+        with open(file_path, 'rb') as f:
+            sample = f.read(sample_size)
+
+        if not sample:
+            return True  # Empty file is considered text
+
+        # Check for null bytes (common in binary files)
+        if b'\x00' in sample:
+            return False
+
+        # Check if most bytes are printable ASCII or common UTF-8
+        try:
+            sample.decode('utf-8')
+            return True
+        except UnicodeDecodeError:
+            pass
+
+        try:
+            sample.decode('ascii')
+            return True
+        except UnicodeDecodeError:
+            return False
+
+    except (OSError, IOError):
+        return False
+
+
+def normalize_path(path: Union[str, Path]) -> str:
+    """
+    Normalize a file path by resolving relative components and converting to absolute.
+
+    Args:
+        path: The input path to normalize
+
+    Returns:
+        Normalized absolute path as a string
+
+    Examples:
+        >>> normalize_path("./dir/../file.txt")  # doctest: +SKIP
+        '/current/working/directory/file.txt'
+    """
+    if not path:
+        return ""
+
+    return str(Path(path).resolve())
--- a/capabilities/markitect-utils/src/markitect_utils/string_utils.py
+++ b/capabilities/markitect-utils/src/markitect_utils/string_utils.py
@@ -0,0 +1,162 @@
+"""
+String utility functions for MarkiTect ecosystem.
+
+Provides common string manipulation and formatting functions that are
+frequently needed across different MarkiTect capabilities.
+"""
+
+import re
+from typing import Optional
+
+
+def slugify(text: str, separator: str = "-") -> str:
+    """
+    Convert a string to a URL-friendly slug.
+
+    Args:
+        text: The input string to convert
+        separator: Character to use for word separation (default: "-")
+
+    Returns:
+        A lowercase string with special characters removed and words separated
+
+    Examples:
+        >>> slugify("Hello World!")
+        'hello-world'
+        >>> slugify("My Great Article", "_")
+        'my_great_article'
+    """
+    if not text:
+        return ""
+
+    # Convert to lowercase and normalize unicode
+    text = text.lower()
+    # Remove unicode accents by replacing with ASCII equivalents
+    text = re.sub(r'[àáâãäå]', 'a', text)
+    text = re.sub(r'[èéêë]', 'e', text)
+    text = re.sub(r'[ìíîï]', 'i', text)
+    text = re.sub(r'[òóôõö]', 'o', text)
+    text = re.sub(r'[ùúûü]', 'u', text)
+    text = re.sub(r'[ýÿ]', 'y', text)
+    text = re.sub(r'[ç]', 'c', text)
+    text = re.sub(r'[ñ]', 'n', text)
+
+    # Replace non-alphanumeric characters (except underscores and dashes) with separator
+    text = re.sub(r'[^\w\s-]', '', text)
+    # Replace whitespace and underscores with separator
+    text = re.sub(r'[\s_]+', separator, text)
+    # Replace multiple separators with single separator
+    text = re.sub(f'[{re.escape(separator)}]+', separator, text)
+    # Remove leading/trailing separators
+    text = text.strip(separator)
+
+    return text
+
+
+def truncate(text: str, max_length: int, suffix: str = "...") -> str:
+    """
+    Truncate a string to a maximum length, adding a suffix if truncated.
+
+    Args:
+        text: The input string to truncate
+        max_length: Maximum length of the result (including suffix)
+        suffix: String to append if truncation occurs (default: "...")
+
+    Returns:
+        The truncated string with suffix if needed
+
+    Examples:
+        >>> truncate("This is a long string", 10)
+        'This is...'
+        >>> truncate("Short", 10)
+        'Short'
+    """
+    if not text or len(text) <= max_length:
+        return text
+
+    if max_length <= len(suffix):
+        return suffix[:max_length]
+
+    truncate_at = max_length - len(suffix)
+    return text[:truncate_at] + suffix
+
+
+def camel_to_snake(text: str) -> str:
+    """
+    Convert camelCase or PascalCase to snake_case.
+
+    Args:
+        text: The input string in camelCase or PascalCase
+
+    Returns:
+        String converted to snake_case
+
+    Examples:
+        >>> camel_to_snake("camelCase")
+        'camel_case'
+        >>> camel_to_snake("PascalCase")
+        'pascal_case'
+        >>> camel_to_snake("XMLHttpRequest")
+        'xml_http_request'
+    """
+    if not text:
+        return text
+
+    # Insert underscore before uppercase letters that follow lowercase letters
+    text = re.sub('(.)([A-Z][a-z]+)', r'\1_\2', text)
+    # Insert underscore before uppercase letters that follow lowercase letters or digits
+    text = re.sub('([a-z0-9])([A-Z])', r'\1_\2', text)
+
+    return text.lower()
+
+
+def snake_to_camel(text: str, pascal_case: bool = False) -> str:
+    """
+    Convert snake_case to camelCase or PascalCase.
+
+    Args:
+        text: The input string in snake_case
+        pascal_case: If True, return PascalCase; otherwise camelCase (default: False)
+
+    Returns:
+        String converted to camelCase or PascalCase
+
+    Examples:
+        >>> snake_to_camel("snake_case")
+        'snakeCase'
+        >>> snake_to_camel("snake_case", pascal_case=True)
+        'SnakeCase'
+    """
+    if not text:
+        return text
+
+    components = text.split('_')
+    if not components:
+        return text
+
+    if pascal_case:
+        return ''.join(word.capitalize() for word in components)
+    else:
+        return components[0] + ''.join(word.capitalize() for word in components[1:])
+
+
+def strip_ansi_codes(text: str) -> str:
+    """
+    Remove ANSI escape sequences from a string.
+
+    Args:
+        text: String that may contain ANSI escape sequences
+
+    Returns:
+        String with ANSI codes removed
+
+    Examples:
+        >>> strip_ansi_codes("\\033[31mRed text\\033[0m")
+        'Red text'
+    """
+    if not text:
+        return text
+
+    # ANSI escape sequence pattern
+    ansi_escape = re.compile(r'\x1B(?:[@-Z\\-_]|\[[0-?]*[ -/]*[@-~])')
+    return ansi_escape.sub('', text)
--- a/capabilities/markitect-utils/src/markitect_utils/validation_utils.py
+++ b/capabilities/markitect-utils/src/markitect_utils/validation_utils.py
@@ -0,0 +1,160 @@
+"""
+Validation utility functions for MarkiTect ecosystem.
+
+Provides common validation functions for various data types and formats
+that are frequently needed across different MarkiTect capabilities.
+"""
+
+import re
+from typing import Any, Dict, List, Optional, Union
+
+
+def is_valid_email(email: str) -> bool:
+    """
+    Check if a string is a valid email address format.
+
+    Args:
+        email: The email address to validate
+
+    Returns:
+        True if the email format is valid, False otherwise
+
+    Examples:
+        >>> is_valid_email("user@example.com")
+        True
+        >>> is_valid_email("invalid.email")
+        False
+    """
+    if not email or not isinstance(email, str):
+        return False
+
+    # Basic email regex pattern
+    pattern = r'^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}$'
+    return bool(re.match(pattern, email))
+
+
+def is_valid_url(url: str) -> bool:
+    """
+    Check if a string is a valid URL format.
+
+    Args:
+        url: The URL to validate
+
+    Returns:
+        True if the URL format is valid, False otherwise
+
+    Examples:
+        >>> is_valid_url("https://example.com")
+        True
+        >>> is_valid_url("not-a-url")
+        False
+    """
+    if not url or not isinstance(url, str):
+        return False
+
+    # URL regex pattern
+    pattern = re.compile(
+        r'^https?://'  # http:// or https://
+        r'(?:(?:[A-Z0-9](?:[A-Z0-9-]{0,61}[A-Z0-9])?\.)+[A-Z]{2,6}\.?|'  # domain...
+        r'localhost|'  # localhost...
+        r'\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})'  # ...or ip
+        r'(?::\d+)?'  # optional port
+        r'(?:/?|[/?]\S+)$', re.IGNORECASE)
+
+    return bool(pattern.match(url))
+
+
+def is_valid_semver(version: str) -> bool:
+    """
+    Check if a string is a valid semantic version (semver) format.
+
+    Args:
+        version: The version string to validate
+
+    Returns:
+        True if the version follows semver format, False otherwise
+
+    Examples:
+        >>> is_valid_semver("1.0.0")
+        True
+        >>> is_valid_semver("1.0.0-alpha.1")
+        True
+        >>> is_valid_semver("1.0")
+        False
+    """
+    if not version or not isinstance(version, str):
+        return False
+
+    # Semantic version regex pattern
+    pattern = re.compile(
+        r'^(?P<major>0|[1-9]\d*)\.'
+        r'(?P<minor>0|[1-9]\d*)\.'
+        r'(?P<patch>0|[1-9]\d*)'
+        r'(?:-(?P<prerelease>(?:0|[1-9]\d*|\d*[a-zA-Z-][0-9a-zA-Z-]*)'
+        r'(?:\.(?:0|[1-9]\d*|\d*[a-zA-Z-][0-9a-zA-Z-]*))*))?'
+        r'(?:\+(?P<buildmetadata>[0-9a-zA-Z-]+(?:\.[0-9a-zA-Z-]+)*))?$'
+    )
+
+    return bool(pattern.match(version))
+
+
+def validate_required_fields(data: Dict[str, Any], required_fields: List[str]) -> Dict[str, List[str]]:
+    """
+    Validate that required fields are present and not empty in a dictionary.
+
+    Args:
+        data: Dictionary to validate
+        required_fields: List of field names that are required
+
+    Returns:
+        Dictionary with 'missing' and 'empty' keys containing lists of field names
+
+    Examples:
+        >>> validate_required_fields({"name": "John", "email": ""}, ["name", "email", "age"])
+        {'missing': ['age'], 'empty': ['email']}
+        >>> validate_required_fields({"name": "John", "email": "john@example.com"}, ["name", "email"])
+        {'missing': [], 'empty': []}
+    """
+    result = {
+        'missing': [],
+        'empty': []
+    }
+
+    if not isinstance(data, dict) or not isinstance(required_fields, list):
+        return result
+
+    for field in required_fields:
+        if field not in data:
+            result['missing'].append(field)
+        elif _is_empty_value(data[field]):
+            result['empty'].append(field)
+
+    return result
+
+
+def _is_empty_value(value: Any) -> bool:
+    """
+    Check if a value should be considered empty for validation purposes.
+
+    Args:
+        value: The value to check
+
+    Returns:
+        True if the value is considered empty, False otherwise
+    """
+    if value is None:
+        return True
+
+    if isinstance(value, str):
+        return not value.strip()
+
+    if isinstance(value, (list, tuple, dict, set)):
+        return len(value) == 0
+
+    # For numeric types (int, float), only None is considered empty
+    # Zero and False are valid values
+    if isinstance(value, (int, float, bool)):
+        return False
+
+    # For other types, use Python's truthiness
+    return not bool(value)
--- a/capabilities/markitect-utils/tests/test_file_utils.py
+++ b/capabilities/markitect-utils/tests/test_file_utils.py
@@ -0,0 +1,210 @@
+"""
+Test suite for file utility functions.
+"""
+
+import os
+import tempfile
+from pathlib import Path
+
+import pytest
+from markitect_utils.file_utils import (
+    safe_filename,
+    ensure_extension,
+    get_file_size,
+    is_text_file,
+    normalize_path,
+)
+
+
+class TestSafeFilename:
+    """Test cases for the safe_filename function."""
+
+    def test_basic_sanitization(self):
+        """Test basic filename sanitization."""
+        assert safe_filename("normal_file.txt") == "normal_file.txt"
+        assert safe_filename("file with spaces.txt") == "file with spaces.txt"
+
+    def test_unsafe_characters(self):
+        """Test removal of unsafe characters."""
+        assert safe_filename("file<>name.txt") == "file__name.txt"
+        assert safe_filename('file"name.txt') == "file_name.txt"
+        assert safe_filename("file/path\\name.txt") == "file_path_name.txt"
+        assert safe_filename("file|name.txt") == "file_name.txt"
+
+    def test_custom_replacement(self):
+        """Test custom replacement character."""
+        assert safe_filename("file<>name.txt", "-") == "file--name.txt"
+        assert safe_filename("file/path\\name.txt", "") == "filepathname.txt"
+
+    def test_reserved_names(self):
+        """Test handling of Windows reserved names."""
+        assert safe_filename("CON") == "file_CON"
+        assert safe_filename("PRN.txt") == "file_PRN.txt"
+        assert safe_filename("COM1") == "file_COM1"
+        assert safe_filename("con") == "file_con"  # case insensitive
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert safe_filename("") == ""  # Empty input returns empty
+        assert safe_filename("   ") == "file_"  # Whitespace only gets prefix
+        assert safe_filename("...") == "file_"  # Dots only gets prefix
+        assert safe_filename(".hidden") == "hidden"  # Leading dot gets stripped
+
+
+class TestEnsureExtension:
+    """Test cases for the ensure_extension function."""
+
+    def test_add_extension(self):
+        """Test adding extension to filename."""
+        assert ensure_extension("document", ".md") == "document.md"
+        assert ensure_extension("file", "txt") == "file.txt"
+
+    def test_existing_extension(self):
+        """Test when extension already exists."""
+        assert ensure_extension("document.md", ".md") == "document.md"
+        assert ensure_extension("file.txt", "txt") == "file.txt"
+
+    def test_different_extension(self):
+        """Test adding extension when different one exists."""
+        assert ensure_extension("document.txt", ".md") == "document.txt.md"
+        assert ensure_extension("file.doc", "pdf") == "file.doc.pdf"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert ensure_extension("", ".md") == ""
+        assert ensure_extension("file", "") == "file"
+        assert ensure_extension("file.md", "") == "file.md"
+
+
+class TestGetFileSize:
+    """Test cases for the get_file_size function."""
+
+    def test_existing_file(self):
+        """Test getting size of existing file."""
+        with tempfile.NamedTemporaryFile(mode='w', delete=False) as f:
+            f.write("Hello, World!")
+            temp_path = f.name
+
+        try:
+            size = get_file_size(temp_path)
+            assert size is not None
+            assert size > 0
+        finally:
+            os.unlink(temp_path)
+
+    def test_nonexistent_file(self):
+        """Test getting size of non-existent file."""
+        assert get_file_size("/path/that/does/not/exist") is None
+
+    def test_empty_file(self):
+        """Test getting size of empty file."""
+        with tempfile.NamedTemporaryFile(delete=False) as f:
+            temp_path = f.name
+
+        try:
+            size = get_file_size(temp_path)
+            assert size == 0
+        finally:
+            os.unlink(temp_path)
+
+    def test_path_object(self):
+        """Test with Path object."""
+        with tempfile.NamedTemporaryFile(mode='w', delete=False) as f:
+            f.write("test content")
+            temp_path = Path(f.name)
+
+        try:
+            size = get_file_size(temp_path)
+            assert size is not None
+            assert size > 0
+        finally:
+            os.unlink(temp_path)
+
+
+class TestIsTextFile:
+    """Test cases for the is_text_file function."""
+
+    def test_text_file(self):
+        """Test with actual text file."""
+        with tempfile.NamedTemporaryFile(mode='w', delete=False, suffix='.txt') as f:
+            f.write("This is a text file with some content.")
+            temp_path = f.name
+
+        try:
+            assert is_text_file(temp_path) is True
+        finally:
+            os.unlink(temp_path)
+
+    def test_binary_file(self):
+        """Test with binary file."""
+        with tempfile.NamedTemporaryFile(mode='wb', delete=False, suffix='.bin') as f:
+            f.write(b'\x00\x01\x02\x03\x04\x05')
+            temp_path = f.name
+
+        try:
+            assert is_text_file(temp_path) is False
+        finally:
+            os.unlink(temp_path)
+
+    def test_empty_file(self):
+        """Test with empty file."""
+        with tempfile.NamedTemporaryFile(delete=False) as f:
+            temp_path = f.name
+
+        try:
+            assert is_text_file(temp_path) is True  # Empty files are considered text
+        finally:
+            os.unlink(temp_path)
+
+    def test_unicode_file(self):
+        """Test with Unicode text file."""
+        with tempfile.NamedTemporaryFile(mode='w', delete=False, encoding='utf-8') as f:
+            f.write("Hello 世界! This is UTF-8 text.")
+            temp_path = f.name
+
+        try:
+            assert is_text_file(temp_path) is True
+        finally:
+            os.unlink(temp_path)
+
+    def test_nonexistent_file(self):
+        """Test with non-existent file."""
+        assert is_text_file("/path/that/does/not/exist") is False
+
+
+class TestNormalizePath:
+    """Test cases for the normalize_path function."""
+
+    def test_relative_path(self):
+        """Test normalizing relative paths."""
+        # Note: These tests are environment-dependent
+        result = normalize_path("./test")
+        assert os.path.isabs(result)
+        assert result.endswith("test")
+
+    def test_path_with_dots(self):
+        """Test path with dot components."""
+        result = normalize_path("./dir/../file.txt")
+        assert os.path.isabs(result)
+        assert result.endswith("file.txt")
+
+    def test_already_absolute(self):
+        """Test already absolute path."""
+        abs_path = "/tmp/test/file.txt"
+        result = normalize_path(abs_path)
+        assert result == abs_path
+
+    def test_path_object(self):
+        """Test with Path object."""
+        path_obj = Path("./test/file.txt")
+        result = normalize_path(path_obj)
+        assert os.path.isabs(result)
+        assert isinstance(result, str)
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert normalize_path("") == ""
+
+        # Current directory should normalize to absolute path
+        result = normalize_path(".")
+        assert os.path.isabs(result)
--- a/capabilities/markitect-utils/tests/test_integration.py
+++ b/capabilities/markitect-utils/tests/test_integration.py
@@ -0,0 +1,175 @@
+"""
+Integration tests for markitect-utils capability.
+
+Tests the overall functionality and integration of the utility modules.
+"""
+
+import tempfile
+import os
+from pathlib import Path
+
+import pytest
+from markitect_utils import (
+    slugify, safe_filename, is_valid_email, validate_required_fields,
+    truncate, normalize_path
+)
+
+
+class TestUtilityIntegration:
+    """Test integration between different utility functions."""
+
+    def test_filename_processing_workflow(self):
+        """Test a complete filename processing workflow."""
+        # Start with user input
+        user_title = "My Great Article: A Case Study!"
+        user_email = "author@example.com"
+
+        # Validate email
+        assert is_valid_email(user_email) is True
+
+        # Create a slug for URL
+        slug = slugify(user_title)
+        assert slug == "my-great-article-a-case-study"
+
+        # Create a safe filename
+        filename = safe_filename(f"{slug}.md")
+        assert filename == "my-great-article-a-case-study.md"
+
+        # Truncate if too long
+        if len(filename) > 30:
+            filename = truncate(filename, 30, "….md")
+
+        assert len(filename) <= 30
+        assert filename.endswith(".md") or filename.endswith("….md")
+
+    def test_content_validation_workflow(self):
+        """Test a content validation workflow."""
+        # Simulate form data
+        form_data = {
+            "title": "My Article",
+            "content": "This is the content of my article.",
+            "author_email": "author@example.com",
+            "category": "",  # Empty field
+            "tags": "python,utils,testing"
+        }
+
+        required_fields = ["title", "content", "author_email", "category"]
+
+        # Validate required fields
+        validation_result = validate_required_fields(form_data, required_fields)
+
+        assert validation_result["missing"] == []
+        assert validation_result["empty"] == ["category"]
+
+        # Validate email format
+        if form_data.get("author_email"):
+            assert is_valid_email(form_data["author_email"]) is True
+
+        # Process tags
+        if form_data.get("tags"):
+            tag_list = [slugify(tag.strip()) for tag in form_data["tags"].split(",")]
+            assert tag_list == ["python", "utils", "testing"]
+
+    def test_file_operations_workflow(self):
+        """Test a file operations workflow."""
+        # Create temporary directory for testing
+        with tempfile.TemporaryDirectory() as temp_dir:
+            temp_path = Path(temp_dir)
+
+            # Create some test files
+            test_content = "This is test content for the file."
+
+            # Process filename through multiple utilities
+            original_name = "Test File: With Special/Characters!"
+            safe_name = safe_filename(original_name)
+            slug_name = slugify(safe_name.rsplit('.', 1)[0]) + '.txt'
+
+            # Write file
+            file_path = temp_path / slug_name
+            file_path.write_text(test_content)
+
+            # Verify file exists and get normalized path
+            normalized_path = normalize_path(file_path)
+            assert Path(normalized_path).exists()
+
+            # Verify content length matches expectations
+            content_length = len(test_content)
+            assert content_length > 0
+
+    def test_data_processing_pipeline(self):
+        """Test a complete data processing pipeline."""
+        # Raw data from external source
+        raw_data = [
+            {
+                "userName": "JohnDoe123",
+                "emailAddress": "john.doe@example.com",
+                "websiteURL": "https://johndoe.example.com",
+                "projectVersion": "1.2.0",
+                "description": "This is a very long description that might need to be truncated for display purposes in certain UI components."
+            },
+            {
+                "userName": "Jane_Smith",
+                "emailAddress": "invalid-email",
+                "websiteURL": "not-a-url",
+                "projectVersion": "invalid-version",
+                "description": "Short desc"
+            }
+        ]
+
+        processed_data = []
+
+        for item in raw_data:
+            # Convert camelCase to snake_case (would need the function imported)
+            # For now, just demonstrate with available functions
+
+            processed_item = {
+                "slug": slugify(item["userName"]),
+                "email_valid": is_valid_email(item["emailAddress"]),
+                "short_desc": truncate(item["description"], 50),
+                "safe_filename": safe_filename(f"{item['userName']}_profile.json")
+            }
+
+            processed_data.append(processed_item)
+
+        # Verify processing results
+        assert processed_data[0]["slug"] == "johndoe123"
+        assert processed_data[0]["email_valid"] is True
+        assert len(processed_data[0]["short_desc"]) <= 50
+        assert processed_data[0]["safe_filename"] == "JohnDoe123_profile.json"
+
+        assert processed_data[1]["slug"] == "jane-smith"
+        assert processed_data[1]["email_valid"] is False
+        assert processed_data[1]["short_desc"] == "Short desc"
+        assert processed_data[1]["safe_filename"] == "Jane_Smith_profile.json"
+
+    def test_configuration_validation(self):
+        """Test configuration validation using multiple utilities."""
+        # Simulate application configuration
+        config = {
+            "app_name": "My Application",
+            "version": "1.0.0",
+            "admin_email": "admin@myapp.com",
+            "base_url": "https://myapp.example.com",
+            "debug": True,
+            "secret_key": "",  # Empty - should be flagged
+        }
+
+        required_fields = ["app_name", "version", "admin_email", "base_url", "secret_key"]
+
+        # Validate required fields
+        validation_result = validate_required_fields(config, required_fields)
+        assert validation_result["empty"] == ["secret_key"]
+
+        # Validate specific formats
+        email_valid = is_valid_email(config["admin_email"])
+        assert email_valid is True
+
+        # Create safe directory name from app name
+        app_slug = slugify(config["app_name"])
+        safe_dir_name = safe_filename(app_slug)
+
+        assert app_slug == "my-application"
+        assert safe_dir_name == "my-application"
+
+        # Validate version format would use is_valid_semver
+        # (assuming we had imported it in the integration test)
--- a/capabilities/markitect-utils/tests/test_string_utils.py
+++ b/capabilities/markitect-utils/tests/test_string_utils.py
@@ -0,0 +1,149 @@
+"""
+Test suite for string utility functions.
+"""
+
+import pytest
+from markitect_utils.string_utils import (
+    slugify,
+    truncate,
+    camel_to_snake,
+    snake_to_camel,
+    strip_ansi_codes,
+)
+
+
+class TestSlugify:
+    """Test cases for the slugify function."""
+
+    def test_basic_slugify(self):
+        """Test basic string to slug conversion."""
+        assert slugify("Hello World") == "hello-world"
+        assert slugify("My Great Article") == "my-great-article"
+
+    def test_special_characters(self):
+        """Test handling of special characters."""
+        assert slugify("Hello World!") == "hello-world"
+        assert slugify("Test@#$%^&*()_+") == "test"  # underscore gets converted to separator
+        assert slugify("Multiple---Dashes") == "multiple-dashes"
+
+    def test_custom_separator(self):
+        """Test custom separator."""
+        assert slugify("Hello World", "_") == "hello_world"
+        assert slugify("My Great Article", ".") == "my.great.article"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert slugify("") == ""
+        assert slugify("   ") == ""
+        assert slugify("---") == ""
+        assert slugify("Single") == "single"
+
+    def test_unicode_handling(self):
+        """Test unicode character handling."""
+        assert slugify("Café") == "cafe"
+        assert slugify("naïve résumé") == "naive-resume"
+
+
+class TestTruncate:
+    """Test cases for the truncate function."""
+
+    def test_basic_truncation(self):
+        """Test basic string truncation."""
+        text = "This is a long string that needs truncation"
+        assert truncate(text, 20) == "This is a long st..."
+        assert truncate(text, 10) == "This is..."
+
+    def test_no_truncation_needed(self):
+        """Test when no truncation is needed."""
+        assert truncate("Short", 10) == "Short"
+        assert truncate("Exact", 5) == "Exact"
+
+    def test_custom_suffix(self):
+        """Test custom suffix."""
+        text = "Long text here"
+        assert truncate(text, 10, "…") == "Long text…"
+        assert truncate(text, 10, " [more]") == "Lon [more]"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert truncate("", 10) == ""
+        assert truncate("Test", 3) == "..."
+        assert truncate("Test", 2) == ".."
+        assert truncate("Test", 1) == "."
+        assert truncate("Test", 0) == ""
+
+
+class TestCamelToSnake:
+    """Test cases for the camel_to_snake function."""
+
+    def test_basic_conversion(self):
+        """Test basic camelCase to snake_case conversion."""
+        assert camel_to_snake("camelCase") == "camel_case"
+        assert camel_to_snake("PascalCase") == "pascal_case"
+        assert camel_to_snake("simpleWord") == "simple_word"
+
+    def test_multiple_words(self):
+        """Test conversion with multiple words."""
+        assert camel_to_snake("thisIsALongVariableName") == "this_is_a_long_variable_name"
+        assert camel_to_snake("XMLHttpRequest") == "xml_http_request"
+        assert camel_to_snake("JSONData") == "json_data"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert camel_to_snake("") == ""
+        assert camel_to_snake("single") == "single"
+        assert camel_to_snake("ALLCAPS") == "allcaps"
+        assert camel_to_snake("A") == "a"
+
+
+class TestSnakeToCamel:
+    """Test cases for the snake_to_camel function."""
+
+    def test_basic_conversion(self):
+        """Test basic snake_case to camelCase conversion."""
+        assert snake_to_camel("snake_case") == "snakeCase"
+        assert snake_to_camel("simple_word") == "simpleWord"
+        assert snake_to_camel("single") == "single"
+
+    def test_pascal_case(self):
+        """Test conversion to PascalCase."""
+        assert snake_to_camel("snake_case", pascal_case=True) == "SnakeCase"
+        assert snake_to_camel("simple_word", pascal_case=True) == "SimpleWord"
+        assert snake_to_camel("single", pascal_case=True) == "Single"
+
+    def test_multiple_underscores(self):
+        """Test handling of multiple underscores."""
+        assert snake_to_camel("this_is_a_long_name") == "thisIsALongName"
+        assert snake_to_camel("this_is_a_long_name", pascal_case=True) == "ThisIsALongName"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert snake_to_camel("") == ""
+        assert snake_to_camel("_") == ""
+        assert snake_to_camel("__") == ""
+        assert snake_to_camel("single") == "single"
+
+
+class TestStripAnsiCodes:
+    """Test cases for the strip_ansi_codes function."""
+
+    def test_basic_ansi_removal(self):
+        """Test basic ANSI code removal."""
+        assert strip_ansi_codes("\033[31mRed text\033[0m") == "Red text"
+        assert strip_ansi_codes("\033[32mGreen\033[0m text") == "Green text"
+
+    def test_complex_ansi_codes(self):
+        """Test complex ANSI escape sequences."""
+        text_with_ansi = "\033[1;31;40mBold red on black\033[0m"
+        assert strip_ansi_codes(text_with_ansi) == "Bold red on black"
+
+    def test_no_ansi_codes(self):
+        """Test text without ANSI codes."""
+        plain_text = "Just plain text"
+        assert strip_ansi_codes(plain_text) == plain_text
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert strip_ansi_codes("") == ""
+        assert strip_ansi_codes("\033[0m") == ""
+        assert strip_ansi_codes("Start\033[31mMiddle\033[0mEnd") == "StartMiddleEnd"
--- a/capabilities/markitect-utils/tests/test_validation_utils.py
+++ b/capabilities/markitect-utils/tests/test_validation_utils.py
@@ -0,0 +1,215 @@
+"""
+Test suite for validation utility functions.
+"""
+
+import pytest
+from markitect_utils.validation_utils import (
+    is_valid_email,
+    is_valid_url,
+    is_valid_semver,
+    validate_required_fields,
+)
+
+
+class TestIsValidEmail:
+    """Test cases for the is_valid_email function."""
+
+    def test_valid_emails(self):
+        """Test valid email addresses."""
+        valid_emails = [
+            "user@example.com",
+            "test.email@domain.co.uk",
+            "user+tag@example.org",
+            "user123@test-domain.com",
+            "a@b.co",
+        ]
+        for email in valid_emails:
+            assert is_valid_email(email) is True, f"Failed for: {email}"
+
+    def test_invalid_emails(self):
+        """Test invalid email addresses."""
+        invalid_emails = [
+            "invalid.email",
+            "@example.com",
+            "user@",
+            "user@.com",
+            "user space@example.com",
+            "user@domain",
+            "user@@example.com",
+            "",
+        ]
+        for email in invalid_emails:
+            assert is_valid_email(email) is False, f"Should be invalid: {email}"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert is_valid_email(None) is False
+        assert is_valid_email(123) is False
+        assert is_valid_email([]) is False
+
+
+class TestIsValidUrl:
+    """Test cases for the is_valid_url function."""
+
+    def test_valid_urls(self):
+        """Test valid URLs."""
+        valid_urls = [
+            "https://example.com",
+            "http://test.org",
+            "https://sub.domain.com/path",
+            "http://localhost:8000",
+            "https://example.com/path?query=value",
+            "http://192.168.1.1:3000",
+        ]
+        for url in valid_urls:
+            assert is_valid_url(url) is True, f"Failed for: {url}"
+
+    def test_invalid_urls(self):
+        """Test invalid URLs."""
+        invalid_urls = [
+            "not-a-url",
+            "ftp://example.com",
+            "example.com",
+            "://example.com",
+            "https://",
+            "http://.com",
+            "",
+        ]
+        for url in invalid_urls:
+            assert is_valid_url(url) is False, f"Should be invalid: {url}"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert is_valid_url(None) is False
+        assert is_valid_url(123) is False
+        assert is_valid_url([]) is False
+
+
+class TestIsValidSemver:
+    """Test cases for the is_valid_semver function."""
+
+    def test_valid_versions(self):
+        """Test valid semantic versions."""
+        valid_versions = [
+            "1.0.0",
+            "0.1.0",
+            "10.20.30",
+            "1.0.0-alpha",
+            "1.0.0-alpha.1",
+            "1.0.0-0.3.7",
+            "1.0.0-x.7.z.92",
+            "1.0.0+20130313144700",
+            "1.0.0-beta+exp.sha.5114f85",
+        ]
+        for version in valid_versions:
+            assert is_valid_semver(version) is True, f"Failed for: {version}"
+
+    def test_invalid_versions(self):
+        """Test invalid semantic versions."""
+        invalid_versions = [
+            "1.0",
+            "1.0.0-",
+            "1.0.0+",
+            "01.0.0",
+            "1.01.0",
+            "1.0.01",
+            "1.0.0-",
+            "1.0.0+",
+            "v1.0.0",
+            "",
+        ]
+        for version in invalid_versions:
+            assert is_valid_semver(version) is False, f"Should be invalid: {version}"
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        assert is_valid_semver(None) is False
+        assert is_valid_semver(123) is False
+        assert is_valid_semver([]) is False
+
+
+class TestValidateRequiredFields:
+    """Test cases for the validate_required_fields function."""
+
+    def test_all_fields_present(self):
+        """Test when all required fields are present and valid."""
+        data = {
+            "name": "John Doe",
+            "email": "john@example.com",
+            "age": 30
+        }
+        required = ["name", "email", "age"]
+        result = validate_required_fields(data, required)
+
+        assert result == {"missing": [], "empty": []}
+
+    def test_missing_fields(self):
+        """Test when some fields are missing."""
+        data = {
+            "name": "John Doe",
+            "email": "john@example.com"
+        }
+        required = ["name", "email", "age", "phone"]
+        result = validate_required_fields(data, required)
+
+        assert set(result["missing"]) == {"age", "phone"}
+        assert result["empty"] == []
+
+    def test_empty_fields(self):
+        """Test when some fields are empty."""
+        data = {
+            "name": "John Doe",
+            "email": "",
+            "age": 30,
+            "phone": "   "  # whitespace only
+        }
+        required = ["name", "email", "age", "phone"]
+        result = validate_required_fields(data, required)
+
+        assert result["missing"] == []
+        assert set(result["empty"]) == {"email", "phone"}
+
+    def test_mixed_issues(self):
+        """Test when there are both missing and empty fields."""
+        data = {
+            "name": "John Doe",
+            "email": "",
+        }
+        required = ["name", "email", "age", "phone"]
+        result = validate_required_fields(data, required)
+
+        assert set(result["missing"]) == {"age", "phone"}
+        assert result["empty"] == ["email"]
+
+    def test_non_string_values(self):
+        """Test with non-string values."""
+        data = {
+            "name": "John Doe",
+            "age": 0,  # Zero should not be considered empty
+            "active": False,  # False should not be considered empty
+            "score": None,  # None should be considered empty
+            "items": [],  # Empty list should be considered empty
+        }
+        required = ["name", "age", "active", "score", "items"]
+        result = validate_required_fields(data, required)
+
+        assert result["missing"] == []
+        assert set(result["empty"]) == {"score", "items"}
+
+    def test_edge_cases(self):
+        """Test edge cases."""
+        # Invalid inputs
+        result = validate_required_fields("not a dict", ["field"])
+        assert result == {"missing": [], "empty": []}
+
+        result = validate_required_fields({}, "not a list")
+        assert result == {"missing": [], "empty": []}
+
+        # Empty data and requirements
+        result = validate_required_fields({}, [])
+        assert result == {"missing": [], "empty": []}
+
+        # Empty requirements
+        data = {"name": "John"}
+        result = validate_required_fields(data, [])
+        assert result == {"missing": [], "empty": []}