Google Style API Documentation¶

Documentation Style

This page demonstrates API documentation using Google-style docstrings, which provide a clean and readable format popular in many open-source projects.

Overview¶

Google-style docstrings use a simple, indented format that's easy to read and write. They're particularly well-suited for:

Open-source projects
Team collaborations
General-purpose libraries
Web APIs and services

DataProcessor Class¶

src.docstring_examples.google_style.DataProcessor ¶

DataProcessor(name: str, validation_enabled: bool = True, max_transformations: int = 100)

Comprehensive data processor with loading, transformation, and export.

This class provides a complete data processing pipeline including data loading, transformation operations, validation, and export functionality. It supports various data formats and provides extensive configuration options.

The processor maintains internal state and provides detailed logging of all operations for debugging and monitoring purposes.

Attributes:

Name	Type	Description
`data`	`Optional[Dict[str, Any]]`	Currently loaded data (None if no data loaded)
`transformations_applied`		Number of transformations applied to current data
`export_count`		Number of times data has been exported
`validation_enabled`		Whether to validate data during operations

Example

Complete workflow example:

# Create processor with validation enabled
processor = DataProcessor("sales_data", validation_enabled=True)

# Load data from various sources
processor.load_data({"product": "Widget", "sales": 1000})
processor.load_from_file("additional_data.json")

# Apply transformations
processor.transform_data(
    lambda x: x * 1.1 if isinstance(x, (int, float)) else x
)
processor.apply_filter(lambda item: item.get("sales", 0) > 500)

# Export results
processor.export_data("processed_results.json")

Note

This processor is thread-safe for read operations but not for concurrent modifications. Use appropriate locking mechanisms if sharing across threads.

Initialize the data processor.

Parameters:

Name	Type	Description	Default
`name`	`str`	Descriptive name for this processor instance	required
`validation_enabled`	`bool`	Whether to enable data validation (default: True)	`True`
`max_transformations`	`int`	Maximum number of transformations allowed (default: 100)	`100`

Raises:

Type	Description
`ValueError`	If name is empty or max_transformations is negative

Example

# Basic processor
processor = DataProcessor("basic_processor")

# Advanced processor with custom settings
processor = DataProcessor(
    name="advanced_processor",
    validation_enabled=True,
    max_transformations=50
)

Source code in src/docstring_examples/google_style.py

def __init__(
    self, name: str, validation_enabled: bool = True, max_transformations: int = 100
) -> None:
    """Initialize the data processor.

    Args:
        name: Descriptive name for this processor instance
        validation_enabled: Whether to enable data validation (default: True)
        max_transformations: Maximum number of transformations allowed
            (default: 100)

    Raises:
        ValueError: If name is empty or max_transformations is negative

    Example:
        ```python
        # Basic processor
        processor = DataProcessor("basic_processor")

        # Advanced processor with custom settings
        processor = DataProcessor(
            name="advanced_processor",
            validation_enabled=True,
            max_transformations=50
        )
        ```
    """
    super().__init__(name)

    if max_transformations < 0:
        raise ValueError("max_transformations must be non-negative")

    self.data: Optional[Dict[str, Any]] = None
    self.transformations_applied = 0
    self.export_count = 0
    self.validation_enabled = validation_enabled
    self.max_transformations = max_transformations

    logger.info(
        f"DataProcessor '{name}' initialized with "
        f"validation={'on' if validation_enabled else 'off'}"
    )

Attributes¶

status `property` ¶

status: str

Get the current processor status.

Returns:

Type	Description
`str`	String indicating current status ("active" or "inactive")

Functions¶

load_data ¶

load_data(data: Union[Dict[str, Any], List[Dict[str, Any]]]) -> None

Load data into the processor.

This method accepts data in dictionary or list format and stores it internally for subsequent processing operations. The data is validated if validation is enabled.

Parameters:

Name	Type	Description	Default
`data`	`Union[Dict[str, Any], List[Dict[str, Any]]]`	Data to load - either a single dictionary or list of dictionaries	required

Raises:

Type	Description
`ProcessingError`	If data validation fails or processor is inactive
`TypeError`	If data is not in expected format

Example

# Load single record
processor.load_data({"id": 1, "name": "Alice", "score": 95})

# Load multiple records
processor.load_data([
    {"id": 1, "name": "Alice", "score": 95},
    {"id": 2, "name": "Bob", "score": 87}
])

Source code in src/docstring_examples/google_style.py

def load_data(self, data: Union[Dict[str, Any], List[Dict[str, Any]]]) -> None:
    """Load data into the processor.

    This method accepts data in dictionary or list format and stores it
    internally for subsequent processing operations. The data is validated
    if validation is enabled.

    Args:
        data: Data to load - either a single dictionary or list of dictionaries

    Raises:
        ProcessingError: If data validation fails or processor is inactive
        TypeError: If data is not in expected format

    Example:
        ```python
        # Load single record
        processor.load_data({"id": 1, "name": "Alice", "score": 95})

        # Load multiple records
        processor.load_data([
            {"id": 1, "name": "Alice", "score": 95},
            {"id": 2, "name": "Bob", "score": 87}
        ])
        ```
    """
    if not self.is_active:
        raise ProcessingError("Cannot load data: processor is inactive")

    if not isinstance(data, (dict, list)):
        raise TypeError("Data must be a dictionary or list of dictionaries")

    if self.validation_enabled:
        self._validate_data(data)

    if isinstance(data, dict):
        self.data = {"records": [data]}
    else:
        self.data = {"records": data}

    logger.info(
        f"Loaded {len(self.data['records'])} record(s) into processor '{self.name}'"
    )

load_from_file ¶

load_from_file(file_path: Union[str, Path]) -> None

Load data from a JSON file.

Reads data from the specified file path and loads it into the processor. Supports both string paths and Path objects.

Parameters:

Name	Type	Description	Default
`file_path`	`Union[str, Path]`	Path to the JSON file to load	required

Raises:

Type	Description
`ProcessingError`	If file cannot be read or contains invalid JSON
`FileNotFoundError`	If the specified file does not exist
`PermissionError`	If insufficient permissions to read the file

Example

# Load from string path
processor.load_from_file("data/input.json")

# Load from Path object
from pathlib import Path
processor.load_from_file(Path("data") / "input.json")

Source code in src/docstring_examples/google_style.py

def load_from_file(self, file_path: Union[str, Path]) -> None:
    """Load data from a JSON file.

    Reads data from the specified file path and loads it into the processor.
    Supports both string paths and Path objects.

    Args:
        file_path: Path to the JSON file to load

    Raises:
        ProcessingError: If file cannot be read or contains invalid JSON
        FileNotFoundError: If the specified file does not exist
        PermissionError: If insufficient permissions to read the file

    Example:
        ```python
        # Load from string path
        processor.load_from_file("data/input.json")

        # Load from Path object
        from pathlib import Path
        processor.load_from_file(Path("data") / "input.json")
        ```
    """
    if not self.is_active:
        raise ProcessingError("Cannot load from file: processor is inactive")

    file_path = Path(file_path)

    try:
        with file_path.open("r", encoding="utf-8") as f:
            data = json.load(f)
        self.load_data(data)
        logger.info(f"Successfully loaded data from {file_path}")

    except FileNotFoundError:
        raise ProcessingError(f"File not found: {file_path}")
    except json.JSONDecodeError as e:
        raise ProcessingError(f"Invalid JSON in file {file_path}: {e}")
    except Exception as e:
        raise ProcessingError(
            f"Error loading file {file_path}: {e}", original_error=e
        )

transform_data ¶

transform_data(transformation_func: Callable[[Any], Any]) -> ProcessingResult

Apply a transformation function to all data values.

Applies the provided transformation function to each value in the loaded data. The transformation preserves the data structure while modifying individual values.

Parameters:

Name	Type	Description	Default
`transformation_func`	`Callable[[Any], Any]`	Function to apply to each data value. Should accept any value and return the transformed value.	required

Returns:

Type	Description
`ProcessingResult`	Dictionary containing transformation results with keys: - 'records_processed': Number of records processed - 'transformations_applied': Total transformations applied to this dataset - 'success': Whether the transformation completed successfully

Raises:

Type	Description
`ProcessingError`	If no data is loaded, processor is inactive, or max transformations exceeded
`ValueError`	If transformation_func is not callable

Example

# Convert all strings to uppercase
result = processor.transform_data(
    lambda x: x.upper() if isinstance(x, str) else x
)

# Apply mathematical transformation to numbers
result = processor.transform_data(
    lambda x: x * 1.1 if isinstance(x, (int, float)) else x
)

# Complex transformation with type checking
def complex_transform(value):
    if isinstance(value, str):
        return value.strip().title()
    elif isinstance(value, (int, float)):
        return round(value * 1.05, 2)
    return value

result = processor.transform_data(complex_transform)

Source code in src/docstring_examples/google_style.py

def transform_data(
    self, transformation_func: Callable[[Any], Any]
) -> ProcessingResult:
    """Apply a transformation function to all data values.

    Applies the provided transformation function to each value in the loaded data.
    The transformation preserves the data structure while modifying
    individual values.

    Args:
        transformation_func: Function to apply to each data value.
            Should accept any value and return the transformed value.

    Returns:
        Dictionary containing transformation results with keys:
            - 'records_processed': Number of records processed
            - 'transformations_applied': Total transformations applied
                to this dataset
            - 'success': Whether the transformation completed successfully

    Raises:
        ProcessingError: If no data is loaded, processor is inactive,
            or max transformations exceeded
        ValueError: If transformation_func is not callable

    Example:
        ```python
        # Convert all strings to uppercase
        result = processor.transform_data(
            lambda x: x.upper() if isinstance(x, str) else x
        )

        # Apply mathematical transformation to numbers
        result = processor.transform_data(
            lambda x: x * 1.1 if isinstance(x, (int, float)) else x
        )

        # Complex transformation with type checking
        def complex_transform(value):
            if isinstance(value, str):
                return value.strip().title()
            elif isinstance(value, (int, float)):
                return round(value * 1.05, 2)
            return value

        result = processor.transform_data(complex_transform)
        ```
    """
    if not self.is_active:
        raise ProcessingError("Cannot transform data: processor is inactive")

    if self.data is None:
        raise ProcessingError("No data loaded for transformation")

    if not callable(transformation_func):
        raise ValueError("transformation_func must be callable")

    if self.transformations_applied >= self.max_transformations:
        raise ProcessingError(
            f"Maximum transformations ({self.max_transformations}) exceeded"
        )

    try:
        records_processed = 0
        for record in self.data["records"]:
            for key, value in record.items():
                record[key] = transformation_func(value)
            records_processed += 1

        self.transformations_applied += 1

        result = {
            "records_processed": records_processed,
            "transformations_applied": self.transformations_applied,
            "success": True,
        }

        logger.info(
            f"Applied transformation to {records_processed} records in "
            f"processor '{self.name}'"
        )
        return result

    except Exception as e:
        raise ProcessingError(f"Transformation failed: {e}", original_error=e)

apply_filter ¶

apply_filter(filter_func: Callable[[Dict[str, Any]], bool]) -> ProcessingResult

Filter data records based on a predicate function.

Removes records that don't match the filter criteria. The filter function should return True for records to keep and False for records to remove.

Parameters:

Name	Type	Description	Default
`filter_func`	`Callable[[Dict[str, Any]], bool]`	Predicate function that accepts a record dictionary and returns True to keep the record, False to remove it	required

Returns:

Type	Description
`ProcessingResult`	Dictionary containing filter results with keys: - 'records_before': Number of records before filtering - 'records_after': Number of records after filtering - 'records_removed': Number of records removed - 'success': Whether the filter operation completed successfully

Raises:

Type	Description
`ProcessingError`	If no data is loaded or processor is inactive
`ValueError`	If filter_func is not callable

Example

# Keep only records with score > 80
result = processor.apply_filter(lambda record: record.get('score', 0) > 80)

# Keep records with specific status
result = processor.apply_filter(
    lambda record: record.get('status') == 'active'
)

# Complex filter with multiple conditions
def complex_filter(record):
    return (record.get('score', 0) > 70 and
           record.get('active', False) and
           len(record.get('name', '')) > 0)

result = processor.apply_filter(complex_filter)

Source code in src/docstring_examples/google_style.py

def apply_filter(
    self, filter_func: Callable[[Dict[str, Any]], bool]
) -> ProcessingResult:
    """Filter data records based on a predicate function.

    Removes records that don't match the filter criteria. The filter function
    should return True for records to keep and False for records to remove.

    Args:
        filter_func: Predicate function that accepts a record dictionary
                    and returns True to keep the record, False to remove it

    Returns:
        Dictionary containing filter results with keys:
            - 'records_before': Number of records before filtering
            - 'records_after': Number of records after filtering
            - 'records_removed': Number of records removed
            - 'success': Whether the filter operation completed successfully

    Raises:
        ProcessingError: If no data is loaded or processor is inactive
        ValueError: If filter_func is not callable

    Example:
        ```python
        # Keep only records with score > 80
        result = processor.apply_filter(lambda record: record.get('score', 0) > 80)

        # Keep records with specific status
        result = processor.apply_filter(
            lambda record: record.get('status') == 'active'
        )

        # Complex filter with multiple conditions
        def complex_filter(record):
            return (record.get('score', 0) > 70 and
                   record.get('active', False) and
                   len(record.get('name', '')) > 0)

        result = processor.apply_filter(complex_filter)
        ```
    """
    if not self.is_active:
        raise ProcessingError("Cannot apply filter: processor is inactive")

    if self.data is None:
        raise ProcessingError("No data loaded for filtering")

    if not callable(filter_func):
        raise ValueError("filter_func must be callable")

    try:
        records_before = len(self.data["records"])

        filtered_records = []
        for record in self.data["records"]:
            if filter_func(record):
                filtered_records.append(record)

        self.data["records"] = filtered_records
        records_after = len(filtered_records)
        records_removed = records_before - records_after

        result = {
            "records_before": records_before,
            "records_after": records_after,
            "records_removed": records_removed,
            "success": True,
        }

        logger.info(
            f"Filter applied: {records_removed} records removed, "
            f"{records_after} remaining"
        )
        return result

    except Exception as e:
        raise ProcessingError(f"Filter operation failed: {e}", original_error=e)

export_data ¶

export_data(file_path: Union[str, Path], format: str = 'json') -> None

Export processed data to a file.

Saves the current processed data to the specified file path in the requested format. Currently supports JSON format with plans for additional formats in future versions.

Parameters:

Name	Type	Description	Default
`file_path`	`Union[str, Path]`	Output file path for the exported data	required
`format`	`str`	Export format ("json" currently supported, default: "json")	`'json'`

Raises:

Type	Description
`ProcessingError`	If no data to export, processor inactive, or export fails
`ValueError`	If format is not supported
`PermissionError`	If insufficient permissions to write to the file

Example

# Basic JSON export
processor.export_data("output.json")

# Export with explicit format
processor.export_data("output.json", format="json")

# Export to Path object
from pathlib import Path
output_path = Path("exports") / "processed_data.json"
processor.export_data(output_path)

Source code in src/docstring_examples/google_style.py

def export_data(self, file_path: Union[str, Path], format: str = "json") -> None:
    """Export processed data to a file.

    Saves the current processed data to the specified file path in the
    requested format. Currently supports JSON format with plans for
    additional formats in future versions.

    Args:
        file_path: Output file path for the exported data
        format: Export format ("json" currently supported, default: "json")

    Raises:
        ProcessingError: If no data to export, processor inactive, or export fails
        ValueError: If format is not supported
        PermissionError: If insufficient permissions to write to the file

    Example:
        ```python
        # Basic JSON export
        processor.export_data("output.json")

        # Export with explicit format
        processor.export_data("output.json", format="json")

        # Export to Path object
        from pathlib import Path
        output_path = Path("exports") / "processed_data.json"
        processor.export_data(output_path)
        ```
    """
    if not self.is_active:
        raise ProcessingError("Cannot export data: processor is inactive")

    if self.data is None:
        raise ProcessingError("No data to export")

    if format.lower() != "json":
        raise ValueError(f"Unsupported export format: {format}")

    file_path = Path(file_path)

    try:
        # Ensure parent directory exists
        file_path.parent.mkdir(parents=True, exist_ok=True)

        with file_path.open("w", encoding="utf-8") as f:
            json.dump(self.data, f, indent=2, ensure_ascii=False)

        self.export_count += 1
        logger.info(f"Exported data to {file_path} (export #{self.export_count})")

    except PermissionError:
        raise ProcessingError(f"Permission denied writing to {file_path}")
    except Exception as e:
        raise ProcessingError(f"Export failed: {e}", original_error=e)

get_statistics ¶

get_statistics() -> Dict[str, Any]

Get comprehensive statistics about the processor and its data.

Returns detailed information about the current state of the processor, including data counts, transformation history, and processing metrics.

Returns:

Type Description

Dict[str, Any]

Dictionary containing statistics with keys: - 'processor_name': Name of this processor instance - 'processor_status': Current status (active/inactive) - 'data_loaded': Whether data is currently loaded - 'record_count': Number of records currently loaded - 'transformations_applied': Number of transformations applied - 'export_count': Number of times data has been exported - 'validation_enabled': Whether validation is enabled - 'created_at': When the processor was created - 'uptime_seconds': How long the processor has existed

Example

stats = processor.get_statistics()
print(f"Processor: {stats['processor_name']}")
print(f"Records: {stats['record_count']}")
print(f"Transformations: {stats['transformations_applied']}")

Source code in src/docstring_examples/google_style.py

def get_statistics(self) -> Dict[str, Any]:
    """Get comprehensive statistics about the processor and its data.

    Returns detailed information about the current state of the processor,
    including data counts, transformation history, and processing metrics.

    Returns:
        Dictionary containing statistics with keys:
            - 'processor_name': Name of this processor instance
            - 'processor_status': Current status (active/inactive)
            - 'data_loaded': Whether data is currently loaded
            - 'record_count': Number of records currently loaded
            - 'transformations_applied': Number of transformations applied
            - 'export_count': Number of times data has been exported
            - 'validation_enabled': Whether validation is enabled
            - 'created_at': When the processor was created
            - 'uptime_seconds': How long the processor has existed

    Example:
        ```python
        stats = processor.get_statistics()
        print(f"Processor: {stats['processor_name']}")
        print(f"Records: {stats['record_count']}")
        print(f"Transformations: {stats['transformations_applied']}")
        ```
    """
    current_time = datetime.now()
    uptime = (current_time - self.created_at).total_seconds()

    stats = {
        "processor_name": self.name,
        "processor_status": self.status,
        "data_loaded": self.data is not None,
        "record_count": len(self.data["records"]) if self.data else 0,
        "transformations_applied": self.transformations_applied,
        "export_count": self.export_count,
        "validation_enabled": self.validation_enabled,
        "created_at": self.created_at.isoformat(),
        "uptime_seconds": round(uptime, 2),
    }

    return stats

process ¶

process(data: Any) -> Any

Process data using the internal pipeline.

Implementation of the abstract process method from BaseProcessor. This method provides a simplified interface for basic data processing.

Parameters:

Name	Type	Description	Default
`data`	`Any`	Data to process	required

Returns:

Type	Description
`Any`	Processed data

Raises:

Type	Description
`ProcessingError`	If processing fails

Source code in src/docstring_examples/google_style.py

def process(self, data: Any) -> Any:
    """Process data using the internal pipeline.

    Implementation of the abstract process method from BaseProcessor.
    This method provides a simplified interface for basic data processing.

    Args:
        data: Data to process

    Returns:
        Processed data

    Raises:
        ProcessingError: If processing fails
    """
    try:
        self.load_data(data)
        return self.data
    except Exception as e:
        raise ProcessingError(f"Processing failed: {e}", original_error=e)

deactivate ¶

deactivate() -> None

Deactivate the processor.

Once deactivated, the processor should not perform any operations until reactivated.

Source code in src/docstring_examples/google_style.py

def deactivate(self) -> None:
    """Deactivate the processor.

    Once deactivated, the processor should not perform any operations
    until reactivated.
    """
    self.is_active = False
    logger.info(f"Processor '{self.name}' deactivated")

ProcessingError Exception¶

src.docstring_examples.google_style.ProcessingError ¶

ProcessingError(message: str, error_code: Optional[str] = None, original_error: Optional[Exception] = None)

Bases: Exception

Custom exception for data processing errors.

This exception is raised when data processing operations fail due to invalid data, configuration errors, or runtime issues.

Attributes:

Name	Type	Description
`message`		Error message describing the failure
`error_code`		Optional error code for categorization
`original_error`		Original exception that caused this error

Initialize ProcessingError.

Parameters:

Name	Type	Description	Default
`message`	`str`	Descriptive error message	required
`error_code`	`Optional[str]`	Optional categorization code	`None`
`original_error`	`Optional[Exception]`	The original exception if this is a wrapper	`None`

Module-Level Functions¶

Functions Coming Soon

Module-level function documentation will be added when the source code is available.

Example Usage¶

from docstring_examples.google_style import DataProcessor

# Create a processor instance
processor = DataProcessor(
    name="sales_analytics",
    validation_enabled=True,
    max_transformations=10
)

# Load and process data
processor.load_data(sales_data)
processor.transform_data(lambda x: x.upper() if isinstance(x, str) else x)
processor.apply_filter(lambda record: record.get('sales', 0) > 1000)

# Export results
processor.export_data('output.json')

Style Benefits¶

Readability¶

Clean, minimal syntax
Natural indentation
Easy to scan

Tooling Support¶

Excellent IDE support
Works with most documentation generators
Compatible with type hints

Best Practices¶

Keep descriptions concise
Use consistent formatting
Include type information
Provide clear examples

Google Style API Documentation¶

Overview¶

DataProcessor Class¶

src.docstring_examples.google_style.DataProcessor ¶

Attributes¶

status property ¶

Functions¶

load_data ¶

load_from_file ¶

transform_data ¶

apply_filter ¶

export_data ¶

get_statistics ¶

process ¶

deactivate ¶

ProcessingError Exception¶

src.docstring_examples.google_style.ProcessingError ¶

Module-Level Functions¶

Example Usage¶

Style Benefits¶

Readability¶

Tooling Support¶

Best Practices¶

Related Documentation¶

status `property` ¶