Data Warehouse Log Tables Schema

This is a guide to provide the schema definitions for the four log tables in our Data Warehouse:

  • logs.events
  • logs.production
  • logs.unextracted_events
  • logs.unextracted_production

logs.production

The logs.production table contains the following fields:

  • cloudwatch_timestamp
  • message
  • uuid
  • method
  • path
  • format
  • controller
  • action
  • status
  • duration
  • git_sha
  • git_branch
  • timestamp
  • pid
  • user_agent
  • ip
  • host
  • trace_id

logs.events

The logs.events table contains the following fields:

  • cloudwatch_timestamp
  • message
  • id
  • name
  • time
  • visitor_id
  • visit_id
  • log_filename
  • new_event
  • path
  • user_id
  • locale
  • user_ip
  • hostname
  • pid
  • service_provider
  • trace_id
  • git_sha
  • git_branch
  • user_agent
  • browser_name
  • browser_version
  • browser_platform_name
  • browser_platform_version
  • browser_device_name
  • browser_mobile
  • browser_bot
  • success

logs.unextracted_events

The logs.unextracted_events table contains the following fields:

  • cloudwatch_timestamp
  • message

logs.unextracted_production

The logs.unextracted_production table contains the following fields:

  • cloudwatch_timestamp
  • message

NOTE: At present, we only allow valid JSON to land into the data warehouse tables. For example, the production logs containing Ruby hash will be ignored.

Below is an example of the logs that will NOT be ingested:

2024-06-10T17:10:15.234Z;"{:name=>""unused_identity_config_keys"", :keys=>[:ab_testing_idv_ten_digit_otp_enabled, :ab_testing_idv_ten_digit_otp_percent, :acuant_timeout, :disallow_all_web_crawlers, :doc_auth_exit_question_section_enabled, :doc_auth_selfie_capture_enabled, :platform_authentication_enabled, :phone_recaptcha_mock_validator]}"