Detecting Security Threats and Investigating Incidents with AWS CloudTrail (2026 Edition): CIS Benchmark Monitoring, GuardDuty/Security Hub Integration, and Forensic Investigation in Practice

2 a.m. Suppose you're jolted awake by an alert that "an unfamiliar IAM user has been created in the production account." What's the first thing you should look at?

There's one answer. Is CloudTrail still alive. Because the standard first thing an attacker does is "stop the trail and erase their footprints." The logs after a trail is stopped don't capture the crucial act of compromise. Both detection and investigation start from here first.

I designed and led the reliability layer of a serverless (Lambda + DynamoDB) payment platform, and maintained 0 double charges during production operation. The core of that discipline is "designing from the start a state where correctness can be proven non-repudiably even after the fact." In a system where money moves, "probably fine" doesn't fly. Leaving in a tamper-proof form who, when, with which permission, from where, did what, and being able to pull it up in seconds when needed — that foundation is security operations centered on CloudTrail.

This article is a guide for assembling CloudTrail, not as "a place to put audit logs," but as a real-combat playbook for threat detection, fraud detection, and incident investigation (forensics). It handles end to end: the detection code, CIS-benchmark-compliant monitoring, integration with GuardDuty/Security Hub/Detective, Athena/Lake investigation queries, and evidence preservation.

This article's premise and division of labor: the basic setup of CloudTrail (multi-Region trail, encryption, enabling integrity validation, the mechanism of EventBridge/CloudWatch integration itself) is handled by the CloudTrail complete guide (the pillar article). On that foundation, this article concentrates on the deep dive into security operations and forensics. The specs, conditions, and finding names are all checked against the AWS official documentation (as of June 2026). Because pricing and the status of managed features are revised quickly, always confirm the latest official information before going to production. The account ID (111122223333), bucket names, and Region are illustrative.

0. Mental model: CloudTrail is the foundation of "detection" and "investigation," and the "evidence"

In the context of security operations, let me fix CloudTrail's role split into 3. This is the map of the whole article.

Detection = notice what's happening now. Real-time-ness is the lifeline. EventBridge / CloudWatch metric filters / GuardDuty.

Investigation = reconstruct what happened in the past with cross-cutting queries. Comprehensiveness is the lifeline. Athena / CloudTrail Lake.

Evidence = prove non-repudiably that "this credential really hit this API." Integrity is the lifeline. Log integrity validation / S3 Object Lock.

Aspect	What realizes it	The property it values	Time axis
Detection	EventBridge → Lambda/SNS, CloudWatch metric filters + alarms, GuardDuty	Latency (seconds to minutes)	Now
Investigation	Athena (logs on S3), CloudTrail Lake (Trino SQL)	Comprehensiveness, cross-cutting	The past (>90 days)
Evidence	Log integrity validation (SHA-256 / digest chain), S3 Object Lock / MFA Delete	Integrity, non-repudiation	After the fact

A CloudTrail event has "core fields" used repeatedly in both detection and investigation. Put these in your body and you can decipher all the later queries and filters (from the official userIdentity / event-record spec).

Field	Meaning	Where it's used in detection / investigation
`userIdentity.type`	The type of the calling principal	`Root` / `IAMUser` / `AssumedRole` / `Role` / `FederatedUser` / `AWSService` / `IdentityCenterUser`
`userIdentity.arn`	The principal's ARN	Identifying the compromised principal, the tracking key for all its actions
`userIdentity.sessionContext.attributes.mfaAuthenticated`	Whether the session was MFA-authenticated	Be wary of `false` sign-ins
`eventName`	The API called	The substance of the operation, like `StopLogging`, `CreateAccessKey`
`eventSource`	The service	`cloudtrail.amazonaws.com`, `iam.amazonaws.com`, etc.
`sourceIPAddress`	The caller's IP	A suspicious IP, Tor, access from abroad
`errorCode`	The error	A surge of `AccessDenied` / `UnauthorizedOperation` = a sign of reconnaissance
`awsRegion`	The Region	An operation in an unused Region = a sign of compromise
`readOnly`	Whether read-only	Prioritize monitoring write-system (destructive) operations

Note: the correct values of userIdentity.type include IdentityCenterUser, but a value IAMIdentityCenter doesn't exist. Don't confuse them when writing filters or queries.

1. The top-priority detection: an attacker comes to "stop the trail" first

Back to the opening scenario. The single most valuable thing in incident detection is, plainly, the immediate detection of "a tampering operation against CloudTrail itself." Because if this gets stopped, the premise of all subsequent detection and investigation collapses.

The eventSource to monitor is cloudtrail.amazonaws.com, and the central eventNames are the next 4.

eventName	The attacker's intent
`StopLogging`	Stop the trail and halt log recording (erasing footprints)
`DeleteTrail`	Delete the trail itself
`UpdateTrail`	Tamper with the trail's settings (un-set multi-Region, change the log destination, etc.)
`PutEventSelectors`	Narrow the events to record, excluding their own operations from recording

"Look back at the logs later" is too slow for this. Because you need to detect in near real time, in seconds, the first choice is EventBridge → Lambda/SNS, not a CloudWatch metric filter (which has a buffer of several minutes). EventBridge can receive CloudTrail's API calls as detail-type: "AWS API Call via CloudTrail".

The EventBridge rule (Terraform)

# 証跡改ざん操作をニアリアルタイムで捕捉する EventBridge ルール
resource "aws_cloudwatch_event_rule" "cloudtrail_tampering" {
  name        = "detect-cloudtrail-tampering"
  description = "Detect attempts to stop, delete, or weaken CloudTrail logging"

  event_pattern = jsonencode({
    "detail-type" : ["AWS API Call via CloudTrail"],
    "detail" : {
      "eventSource" : ["cloudtrail.amazonaws.com"],
      "eventName" : ["StopLogging", "DeleteTrail", "UpdateTrail", "PutEventSelectors"]
    }
  })
}

resource "aws_cloudwatch_event_target" "tampering_to_sns" {
  rule      = aws_cloudwatch_event_rule.cloudtrail_tampering.name
  target_id = "notify-secops"
  arn       = aws_sns_topic.security_alerts.arn
}

# Lambda でトリアージ情報を整形してから通知したい場合は、こちらをターゲットに
resource "aws_cloudwatch_event_target" "tampering_to_lambda" {
  rule      = aws_cloudwatch_event_rule.cloudtrail_tampering.name
  target_id = "triage-lambda"
  arn       = aws_lambda_function.tampering_triage.arn
}

The triage Lambda (TypeScript)

Rather than just firing a notification, shape it into a form where "who, from where, which trail" they tried to stop is instantly readable. An alert has meaning only when it arrives in a "readable and judgable" state.

import type { EventBridgeEvent } from "aws-lambda";
import { SNSClient, PublishCommand } from "@aws-sdk/client-sns";

const sns = new SNSClient({});
const TOPIC_ARN = process.env.SNS_TOPIC_ARN!; // 環境変数のみ。ハードコード禁止

// CloudTrail 経由の API コール詳細（必要フィールドのみ）
interface CloudTrailDetail {
  eventName: string;
  eventSource: string;
  sourceIPAddress?: string;
  awsRegion?: string;
  userIdentity?: {
    type?: string;
    arn?: string;
    sessionContext?: { attributes?: { mfaAuthenticated?: string } };
  };
  requestParameters?: Record<string, unknown>;
}

export const handler = async (
  event: EventBridgeEvent<"AWS API Call via CloudTrail", CloudTrailDetail>,
): Promise<void> => {
  const d = event.detail;
  const id = d.userIdentity;

  const message = [
    "[CRITICAL] CloudTrail tampering detected",
    `eventName : ${d.eventName}`,
    `principal : ${id?.arn ?? "unknown"} (type=${id?.type ?? "unknown"})`,
    `mfa       : ${id?.sessionContext?.attributes?.mfaAuthenticated ?? "n/a"}`,
    `sourceIP  : ${d.sourceIPAddress ?? "unknown"}`,
    `region    : ${d.awsRegion ?? "unknown"}`,
    `time      : ${event.time}`,
    `params    : ${JSON.stringify(d.requestParameters ?? {})}`,
  ].join("\n");

  await sns.send(
    new PublishCommand({
      TopicArn: TOPIC_ARN,
      Subject: `CloudTrail tampering: ${d.eventName}`,
      Message: message,
    }),
  );
};

A design pressure point: consider prevention of "can't stop the trail" as a set too. Deny cloudtrail:StopLogging / cloudtrail:DeleteTrail with an SCP at the organization level, and a compromised principal in a member account can't stop the trail. The design of defense-in-depth with SCPs and multi-account auditing is detailed in the organization-trail / multi-account audit guide (not chased here). Detection (this chapter) and prevention (SCPs) are two wheels of a cart.

2. Detecting attack signs: a collection of CIS-benchmark-compliant alerts

The net to stretch next after trail tampering is monitoring alerts for "typical behaviors of compromise." Here, one most important honesty point.

The well-known monitoring alerts for "root usage," "unauthorized APIs," "sign-in without MFA," etc. are NOT the ones on the 'examples' page of CloudTrail's user guide. The metric filters CloudTrail officially lists as "examples" are actually only three.

The "CloudWatch alarm examples" page of CloudTrail's official documentation provides only these 3 examples.

Monitoring target	The eventName the metric filter picks up	Metric name
Security-group change	The SG-operation group from `AuthorizeSecurityGroupIngress` to `DeleteSecurityGroup`	`SecurityGroupEventCount`
Console sign-in failure	`ConsoleLogin` and `errorMessage = "Failed authentication"`	`ConsoleSigninFailureCount`
IAM policy change	The IAM-policy-operation group from `PutUserPolicy` to `DetachGroupPolicy`	`IAMPolicyEventCount`

Note that the filter pattern for console sign-in failure is, per the official docs, the following (usable as-is).

{ ($.eventName = ConsoleLogin) && ($.errorMessage = "Failed authentication") }

So, where do the standard alert groups widely spread as "CloudTrail monitoring best practices" — root usage, unauthorized APIs, sign-in without MFA, CMK disabling… — come from? The source is the CIS AWS Foundations Benchmark (and AWS Security Hub's CIS standard controls). To be precise, Security Hub's CloudWatch.1 – CloudWatch.14 controls correspond to each CIS recommendation (v1.2.0's 3.x / v1.4.0's 4.x).

Why being able to correctly cite the source is important. When asked in an audit or review "what is this monitoring compliant with," answering "it's CloudTrail's official examples" is inaccurate and damages trust. Being able to say "it's compliant with §3/§4 of the CIS AWS Foundations Benchmark and Security Hub's CloudWatch controls" is accountability for the design itself.

A list of CIS-compliant monitoring alerts (Security Hub CloudWatch controls)

A correspondence table based on the official Security Hub user guide. Each control is implemented with the same mechanism of "flow CloudTrail → CloudWatch Logs and place a metric filter + alarm."

Security Hub control	Monitoring target	Corresponding CIS
CloudWatch.1	Use of the root user	CIS v1.2.0/3.3, v1.4.0/4.3
CloudWatch.2	Unauthorized API calls (`AccessDenied`/`UnauthorizedOperation`)	CIS v1.2.0/3.1
CloudWatch.3	Management-console sign-in without MFA	CIS v1.2.0/3.2
CloudWatch.4	IAM policy change	CIS v1.2.0/3.4, v1.4.0/4.4
CloudWatch.5	CloudTrail configuration change	CIS v1.2.0/3.5, v1.4.0/4.5
CloudWatch.6	Console authentication failure	CIS v1.2.0/3.6, v1.4.0/4.6
CloudWatch.7	Disabling / scheduled deletion of a CMK (customer-managed key)	CIS v1.2.0/3.7, v1.4.0/4.7
CloudWatch.8	S3 bucket-policy change	CIS v1.2.0/3.8, v1.4.0/4.8
CloudWatch.9	AWS Config configuration change	CIS v1.2.0/3.9, v1.4.0/4.9
CloudWatch.10	Security-group change	CIS v1.2.0/3.10, v1.4.0/4.10
CloudWatch.11	NACL (network ACL) change	CIS v1.2.0/3.11, v1.4.0/4.11
CloudWatch.12	Network-gateway change	CIS v1.2.0/3.12, v1.4.0/4.12
CloudWatch.13	Route-table change	CIS v1.2.0/3.13, v1.4.0/4.13
CloudWatch.14	VPC change	CIS v1.2.0/3.14, v1.4.0/4.14

Supplement (from the official note): these Security Hub controls become FAILED unless they exactly match "the precise metric filter the CIS prescribes." Add your own fields or phrases and it fails. On the other hand, only the remediation procedure examples for IAM-policy change (CloudWatch.4) and route-table change (CloudWatch.13) present a recommendation AWS intentionally changed from the CIS pattern (to "target only IAM/EC2 API calls"). For compliance purposes, use the CIS pattern strictly; for operational-noise-reduction purposes, use AWS's remediation examples — distinguish them.

Implementation example 1: detecting unauthorized APIs (AccessDenied surge) — CIS 3.1 / CloudWatch.2

An attacker who grabbed a compromised credential first probes "what can I do with this credential." In that process, AccessDenied / UnauthorizedOperation surges in a short time. This is an extremely useful signal of reconnaissance.

# CIS AWS Foundations Benchmark 3.1 / Security Hub CloudWatch.2 準拠
# 未認可API呼び出しの検知（CISが規定する正確なパターン）
resource "aws_cloudwatch_log_metric_filter" "unauthorized_api_calls" {
  name           = "CIS-UnauthorizedAPICalls"
  log_group_name = var.cloudtrail_log_group_name # CloudTrailの配信先ロググループ

  pattern = "{ ($.errorCode = \"*UnauthorizedOperation\") || ($.errorCode = \"AccessDenied*\") }"

  metric_transformation {
    name      = "UnauthorizedAPICalls"
    namespace = "CISBenchmark"
    value     = "1"
  }
}

resource "aws_cloudwatch_metric_alarm" "unauthorized_api_calls" {
  alarm_name          = "CIS-3.1-UnauthorizedAPICalls"
  comparison_operator = "GreaterThanOrEqualToThreshold"
  evaluation_periods  = 1
  metric_name         = "UnauthorizedAPICalls"
  namespace           = "CISBenchmark"
  period              = 300
  statistic           = "Sum"
  threshold           = 1
  alarm_description   = "CIS 3.1: Unauthorized API calls detected"
  alarm_actions       = [aws_sns_topic.security_alerts.arn]
  treat_missing_data  = "notBreaching"
}

Implementation example 2: detecting root-user usage — CIS 1.7/4.3 / CloudWatch.1

Because root has all permissions, its use in daily operation should in principle be zero. Pick up ones where userIdentity.type is Root and that aren't calls by an AWS service itself (invokedBy).

# CIS / Security Hub CloudWatch.1 準拠：root ユーザー利用の検知
resource "aws_cloudwatch_log_metric_filter" "root_usage" {
  name           = "CIS-RootAccountUsage"
  log_group_name = var.cloudtrail_log_group_name

  pattern = "{ $.userIdentity.type = \"Root\" && $.userIdentity.invokedBy NOT EXISTS && $.eventType != \"AwsServiceEvent\" }"

  metric_transformation {
    name      = "RootAccountUsage"
    namespace = "CISBenchmark"
    value     = "1"
  }
}

resource "aws_cloudwatch_metric_alarm" "root_usage" {
  alarm_name          = "CIS-RootAccountUsage"
  comparison_operator = "GreaterThanOrEqualToThreshold"
  evaluation_periods  = 1
  metric_name         = "RootAccountUsage"
  namespace           = "CISBenchmark"
  period              = 300
  statistic           = "Sum"
  threshold           = 1
  alarm_description   = "CIS: Root account used"
  alarm_actions       = [aws_sns_topic.security_alerts.arn]
  treat_missing_data  = "notBreaching"
}

Noise countermeasure: always set alarm_actions (SNS, etc.) on these alarms. Security Hub's CloudWatch.15 "alarms should have an action set" / CloudWatch.17 "alarm actions should be enabled" (these are NIST-family controls) hit exactly the typical hole of "created an alarm but there's no notification target / it's disabled." If the last meter of detection — the notification — isn't connected, the detector is the same as nonexistent.

3. Riding on managed detection: GuardDuty / Security Hub / Detective

A metric filter can pick up only "a pre-decided pattern." To catch unknown, anomalous behavior, the proper approach is to ride on managed detection services taking CloudTrail as input. Grasp the 3-way role split.

Service	Role	How it uses CloudTrail
Amazon GuardDuty	Threat detection (ML/anomaly detection/threat intel)	Takes CloudTrail's management events and S3 data events as a foundational data source, and detects anomalous APIs and known malicious patterns with ML
AWS Security Hub	CSPM (posture management) / compliance	Scores accounts against standards like CIS and AWS FSBP. Aggregates findings from GuardDuty and others
Amazon Detective	Investigation (investigation graph)	Builds an investigation graph from CloudTrail and others, visualizing the background of findings

GuardDuty: representative findings derived from CloudTrail (the official IAM finding types)

GuardDuty, with CloudTrail's management events as the main data source, generates findings about IAM entities / access keys (the Resource Type is AccessKey). Let me list, classified, the representative findings taking CloudTrail as input that are documented officially.

The ML-anomaly-detection family (*:IAMUser/AnomalousBehavior) — the data source is CloudTrail management events

Finding type	Meaning (which stage of the attack)	Default severity
`CredentialAccess:IAMUser/AnomalousBehavior`	Credential collection (`GetPasswordData`/`GetSecretValue` etc. in an anomalous form)	Medium
`DefenseEvasion:IAMUser/AnomalousBehavior`	Defense evasion (stopping/deleting like `StopLogging`/`DeleteFlowLogs`)	Medium
`Discovery:IAMUser/AnomalousBehavior`	Reconnaissance (enumeration like `DescribeInstances`/`ListAccessKeys`)	Low
`Exfiltration:IAMUser/AnomalousBehavior`	Data exfiltration (control-plane operations like `CreateSnapshot`/`PutBucketReplication`)	High
`Impact:IAMUser/AnomalousBehavior`	Destruction/tampering (`DeleteSecurityGroup`/`PutBucketPolicy` etc.)	High
`Persistence:IAMUser/AnomalousBehavior`	Persistence (`CreateAccessKey`/`ImportKeyPair` etc.)	Medium
`PrivilegeEscalation:IAMUser/AnomalousBehavior`	Privilege escalation (`AddUserToGroup`/`PutUserPolicy` etc.)	Medium
`InitialAccess:IAMUser/AnomalousBehavior`	Initial access (`GetAuthorizationToken` etc.)	Medium

The signature / threat-intel family

Finding type	Meaning	Default severity
`Policy:IAMUser/RootCredentialUsage`	An API was called with root sign-in credentials	Low
`Stealth:IAMUser/CloudTrailLoggingDisabled`	CloudTrail logging was disabled	Low
`Stealth:IAMUser/PasswordPolicyChange`	The account's password policy was weakened	Low to High
`Recon:IAMUser/MaliciousIPCaller`	A resource-enumeration API was called from a known malicious IP	Medium
`Recon:IAMUser/TorIPCaller`	A resource-enumeration API was called from a Tor exit node	Medium
`UnauthorizedAccess:IAMUser/MaliciousIPCaller`	An API was called from a known malicious IP	Medium
`UnauthorizedAccess:IAMUser/InstanceCredentialExfiltration.OutsideAWS`	EC2's temporary credentials were used from an IP outside AWS (credential theft)	High
`UnauthorizedAccess:IAMUser/ConsoleLoginSuccess.B`	The same IAM user signed in successfully multiple times from around the world in a short time	Medium

The difference from CredentialAccess:IAMUser/CompromisedCredentials: this isn't ML anomaly detection but a High finding indicating that an access key judged "already leaked" by Amazon's threat intelligence was used. The data source is documented officially as "a feature included in Foundational Data Source Protection," and its generation basis differs from the AnomalousBehavior family above (ML of CloudTrail management events). Don't confuse them.

GuardDuty findings flow to EventBridge (detail-type: "GuardDuty Finding"). Using this, you can assemble "if a finding of severity 7 or higher appears, immediately SNS / an auto-containment Lambda" in the same way as Chapter 1. The defense-in-depth, multi-account design of GuardDuty alone (delegated administrator, EventBridge automated response, Terraform) is split into the GuardDuty deep-dive guide.

# GuardDuty の重大度の高い Finding を SecOps に即通知する
resource "aws_cloudwatch_event_rule" "guardduty_high_severity" {
  name = "guardduty-high-severity-findings"
  event_pattern = jsonencode({
    "source" : ["aws.guardduty"],
    "detail-type" : ["GuardDuty Finding"],
    # GuardDuty の severity は数値。7.0 以上 = High 以上を対象にする
    "detail" : { "severity" : [{ "numeric" : [">=", 7.0] }] }
  })
}

Security Hub: visualize "the holes in the detection net" with the CIS score

If GuardDuty is "dynamic detection," Security Hub is "static scoring of the posture." Enable the CIS / AWS FSBP standards including CloudWatch.1–17 from the previous chapter, and the holes in detection themselves — "no root-monitoring alert," "no monitoring of CloudTrail configuration changes" — are listed as FAILED. It's a device that prevents "you think you built a detector but didn't."

4. The incident-investigation playbook: forensics with Athena / Lake

After detection comes investigation. If detection is a "point," investigation is "lines and a plane" — reconstruct all of the compromised principal's actions chronologically and confirm the scope of the damage.

For a cross-cutting query over a period exceeding 90 days, or over a large volume of logs, Athena (SQL directly on CloudTrail logs on S3) is realistic. Because scan volume = billing, the iron rule is to narrow the date and Region with partition projection. CloudTrail Lake can query an immutable event data store directly with Trino SQL, but it ended new-customer onboarding as of May 31, 2026 (existing customers can continue). For a new build, Athena + S3 is the standard answer.

The Athena queries below assume that an external table for CloudTrail logs (with partition projection) has already been created. How to create the table definition (ROW FORMAT SERDE 'org.apache.hive.hcatalog.data.JsonSerDe' and the partition-projection properties) is left to the pillar article.

The IR playbook: detection → triage → scope → containment → evidence preservation

1. 検知    : EventBridge / GuardDuty / CISアラームで気づく
2. トリアージ: 「本物か・誤検知か」「侵害プリンシパルは誰か(ARN)」を確定
3. スコープ : Athena/Lake で侵害ARNの全行動を時系列再構成、被害範囲を確定
4. 封じ込め : 鍵の無効化/失効、ポリシーのデタッチ、セッションの取り消し
5. 証拠保全 : 整合性検証(validate-logs)、ログの隔離コピー、否認不能化（第5章）

Let me list the queries you repeatedly hit in the field, used in step 3, "scoping."

Query 1: reconstruct "all actions" of the compromised principal chronologically

What, when, and from where the ARN identified in triage did. This is the starting point of all investigation.

-- 侵害が疑われるプリンシパルの全 API コールを時系列で並べる
SELECT
  eventtime,
  eventsource,
  eventname,
  sourceipaddress,
  awsregion,
  errorcode,
  readonly
FROM cloudtrail_logs
WHERE useridentity.arn = 'arn:aws:sts::111122223333:assumed-role/AppRole/suspicious-session'
  AND timestamp BETWEEN '2026/06/26' AND '2026/06/27' -- パーティション射影で絞る
ORDER BY eventtime ASC;

Query 2: signs of persistence — new access-key creation

An attacker creates access keys to solidify their foothold (corresponds to GuardDuty's Persistence:IAMUser/AnomalousBehavior).

-- 調査期間内に作成されたアクセスキー（誰が・誰に対して・どこから）
SELECT
  eventtime,
  useridentity.arn          AS actor,
  responseelements,         -- 作成されたキーIDやユーザー名が入る
  sourceipaddress
FROM cloudtrail_logs
WHERE eventname = 'CreateAccessKey'
  AND timestamp BETWEEN '2026/06/20' AND '2026/06/27'
ORDER BY eventtime ASC;

Query 3: signs of privilege escalation — IAM grant-system operations

-- 権限昇格に使われがちな IAM 操作を洗い出す
SELECT eventtime, useridentity.arn AS actor, eventname, requestparameters, sourceipaddress
FROM cloudtrail_logs
WHERE eventsource = 'iam.amazonaws.com'
  AND eventname IN (
    'AttachUserPolicy', 'AttachRolePolicy', 'AttachGroupPolicy',
    'PutUserPolicy', 'PutRolePolicy', 'PutGroupPolicy',
    'AddUserToGroup', 'CreatePolicyVersion', 'UpdateAssumeRolePolicy'
  )
  AND timestamp BETWEEN '2026/06/20' AND '2026/06/27'
ORDER BY eventtime ASC;

Back up, in investigation, the control-plane operations Exfiltration:IAMUser/AnomalousBehavior captures.

-- スナップショットの外部共有・S3公開化など、持ち出しに繋がる制御操作
SELECT eventtime, useridentity.arn AS actor, eventname, requestparameters, sourceipaddress
FROM cloudtrail_logs
WHERE eventname IN (
    'ModifySnapshotAttribute',     -- スナップショットを他アカウントへ共有
    'CreateSnapshot',
    'PutBucketPolicy',             -- バケットポリシーの書き換え
    'PutBucketAcl',
    'PutBucketReplication',        -- 別バケットへの複製設定
    'DeleteBucketPublicAccessBlock'-- パブリックアクセスブロック解除
  )
  AND timestamp BETWEEN '2026/06/20' AND '2026/06/27'
ORDER BY eventtime ASC;

Query 5: tracking source IPs / sessions — grasping lateral movement

Is the compromise confined to "one ARN," or is it moving laterally? With a suspicious IP as the axis, surface all the principals that operated from that IP.

-- 特定の不審 IP から行動したすべてのプリンシパルと操作
SELECT
  useridentity.arn AS actor,
  useridentity.type AS actor_type,
  eventname,
  COUNT(*) AS calls,
  MIN(eventtime) AS first_seen,
  MAX(eventtime) AS last_seen
FROM cloudtrail_logs
WHERE sourceipaddress = '203.0.113.45'
  AND timestamp BETWEEN '2026/06/20' AND '2026/06/27'
GROUP BY useridentity.arn, useridentity.type, eventname
ORDER BY calls DESC;

Query 6: surfacing MFA-less sessions

-- MFA 認証されていないセッションでの書き込み系操作（侵害セッションの疑い）
SELECT eventtime, useridentity.arn AS actor, eventname, sourceipaddress
FROM cloudtrail_logs
WHERE useridentity.sessioncontext.attributes.mfaauthenticated = 'false'
  AND readonly = false
  AND timestamp BETWEEN '2026/06/26' AND '2026/06/27'
ORDER BY eventtime ASC;

A cost pitfall: always attach the WHERE timestamp BETWEEN ... partition-projection condition every time. Forget this and it full-scans years' worth, all-Regions logs, and billing spikes. Because Athena bills by the GB scanned, make "narrow by date first" a habit even in forensics, until it's second nature.

5. Evidence preservation and non-repudiation: prove "this credential really did it"

Once the scope is fixed in investigation, the last step is evidence preservation. Neglect this and the hard-won investigation result loses its power in an audit, legal proceedings, or internal explanation, because "it can't be proven untampered."

Log integrity validation: tamper detection and non-repudiation

CloudTrail's log-file integrity validation is built with hashing by SHA-256 and digital signing by SHA-256 with RSA. Every hour, a digest file containing references and hashes to the group of log files delivered in that hour is created, and each digest forms a chain by including the signature of the previous digest. With this, you can detect tampering, deletion, and forgery of the logs, and further can actively assert even "that not a single log was delivered in a certain period."

The official wording succinctly expresses this feature's value.

A validated log file lets you assert positively that "a particular user credential performed particular API activity" — this is the core of non-repudiation in security / forensic investigation.

Validation can be run with a single AWS CLI command.

# S3 上の指定期間のログを digest チェーンに照らして検証する
aws cloudtrail validate-logs \
  --trail-arn arn:aws:cloudtrail:ap-northeast-1:111122223333:trail/org-audit-trail \
  --start-time 2026-06-26T00:00:00Z \
  --end-time   2026-06-27T00:00:00Z

If you want to move the logs to another location (an isolation bucket, etc.) and validate there, because the CLI validation presupposes the original location where the logs were delivered, implement custom validation (the official docs have the procedure). If you make an isolated copy of the evidence, weave this point into the design.

Storage design for non-repudiation

Even if the validation algorithm is robust, it's meaningless if the logs themselves are deleted later. You need storage design that ensures their value as evidence.

Countermeasure	What it prevents	Note
S3 Object Lock (WORM)	Overwriting/deleting logs and digests (even with root permission)	In compliance mode, no one can delete during the retention period
S3 MFA Delete	Illegal deletion of digest files	Officially stated as a digest-protection measure. Requires MFA for deletion
An isolated account for the log archive	Log destruction from a compromised account	Put the log S3 in a separate account (Log Archive) and allow only write from the production account
SSE-KMS + a strict key policy	Encryption, and invalidating logs by deleting the key	Monitor CMK disabling with CIS's CloudWatch.7 (Chapter 2)

The design philosophy: ideally, separate "the account that generates logs" and "the account that stores logs." Even if the production account is completely compromised, the WORM-ized logs in the Log Archive account remain intact and function as non-repudiable evidence. "Creating from the start a state where correctness can be proven after the fact" on a payment platform is exactly this stance.

6. Summary: a detection → investigation → preservation cheat sheet

Let me fold into one sheet, at the end, the key points of elevating CloudTrail from "a log storehouse" to "the foundation of security operations."

Phase	What to do	Tool used	Source / basis
Top-priority detection	Detect trail tampering (`StopLogging`/`DeleteTrail`/`UpdateTrail`/`PutEventSelectors`) in seconds	EventBridge → Lambda/SNS	CloudTrail EventBridge integration
Sign detection	root usage, unauthorized APIs, sign-in without MFA, IAM escalation, CloudTrail config change, etc.	CloudWatch metric filters + alarms	CIS AWS Foundations Benchmark / Security Hub CloudWatch.1–14
Managed detection	Unknown anomalous APIs with ML/threat intel	GuardDuty (CloudTrail is a foundational data source)	GuardDuty IAM findings (official)
Posture scoring	Visualize detection holes (monitoring gaps)	Security Hub (CIS/FSBP)	Security Hub standards
Investigation	Reconstruct all of the compromised ARN's actions chronologically, fix the scope	Athena (partition projection) / CloudTrail Lake	—
Evidence preservation	Tamper detection, non-repudiation	`validate-logs`, S3 Object Lock/MFA Delete, an isolated account	Log integrity validation (SHA-256 / digest chain)

The line you must not cross is this.

A detector becomes a detector only when the notification is connected (CloudWatch.15/17). Evidence becomes evidence only when the logs become tamper- and delete-proof (integrity validation + WORM). "An alarm exists but there's no notification target," "logs exist but can be deleted" are the same as nonexistent.

And let me repeat the most important honesty point. Standard alerts like root usage and unauthorized-API monitoring come from the CIS AWS Foundations Benchmark (Security Hub's controls), not CloudTrail's official 'examples.' Being able to correctly cite the source is accountability itself in an audit or review.

Security operations isn't "set it and forget it"; its value is decided by whether you can fully design the loop of detection → investigation → preservation into a state anyone can run. In the "money moves" domain of payments, I led the design of reliability / auditing that underpins 0 double charges. Dropping "a state where correctness can be proven non-repudiably after the fact" into code from the start — I can apply that experience to the security monitoring / incident-response design of your AWS environment.

If you're troubled by designing a threat-detection / forensics / IR posture centered on CloudTrail, feel free to consult me from contact. From surfacing the current detection holes, let's nail it down together.

Related deep-dive articles: for CloudTrail's basic design see the CloudTrail complete guide, for GuardDuty's multi-layer detection the GuardDuty threat-detection guide, for network-layer defense the AWS WAF defense-in-depth guide, and for data-store permission design the DynamoDB security guide.

Detecting Security Threats and Investigating Incidents with AWS CloudTrail (2026 Edition): CIS Benchmark Monitoring, GuardDuty/Security Hub Integration, and Forensic Investigation in Practice

0. Mental model: CloudTrail is the foundation of "detection" and "investigation," and the "evidence"

1. The top-priority detection: an attacker comes to "stop the trail" first

The EventBridge rule (Terraform)

The triage Lambda (TypeScript)

2. Detecting attack signs: a collection of CIS-benchmark-compliant alerts

A list of CIS-compliant monitoring alerts (Security Hub CloudWatch controls)

Implementation example 1: detecting unauthorized APIs (AccessDenied surge) — CIS 3.1 / CloudWatch.2

Implementation example 2: detecting root-user usage — CIS 1.7/4.3 / CloudWatch.1

3. Riding on managed detection: GuardDuty / Security Hub / Detective

GuardDuty: representative findings derived from CloudTrail (the official IAM finding types)

Security Hub: visualize "the holes in the detection net" with the CIS score

4. The incident-investigation playbook: forensics with Athena / Lake

The IR playbook: detection → triage → scope → containment → evidence preservation

Query 1: reconstruct "all actions" of the compromised principal chronologically

Query 2: signs of persistence — new access-key creation

Query 3: signs of privilege escalation — IAM grant-system operations

Query 5: tracking source IPs / sessions — grasping lateral movement

Query 6: surfacing MFA-less sessions

5. Evidence preservation and non-repudiation: prove "this credential really did it"

Log integrity validation: tamper detection and non-repudiation

Storage design for non-repudiation

6. Summary: a detection → investigation → preservation cheat sheet

The Complete AWS CloudTrail Guide (2026 Edition): Designing API Activity Auditing, Trails, CloudTrail Lake, Athena Analysis, and Real-Time Detection at Production Quality

Preparing for Compliance Audits with AWS CloudTrail (2026 Edition): Leaving the Audit Trail for PCI DSS, SOC 2, ISO 27001, and HIPAA as Tamper-Proof Evidence

AWS CloudTrail Lake Practical Guide (2026 Edition): Analyzing Events with Trino SQL and How to Choose Between It and Athena+S3 — The Realistic Answer After New-Customer Onboarding Ended

Building a Company-Wide CloudTrail Audit Platform with AWS Organizations (2026 Edition): Aggregating Every Account's Trail with Organization Trails, Delegated Administrators, a Log Archive Account, SCPs, and Control Tower

Also worth reading

A thorough explanation of GuardDuty Extended Threat Detection and attack-sequence findings: reading and responding to weak-signal correlation, the 24-hour window, and Critical multi-stage attacks

Echo file-upload production design: receiving safely with multipart, S3 streaming, presigned URLs, and validation

GuardDuty × Amazon Detective: The 'Next Step' After Detection—A Workflow to Investigate Root Cause and Blast Radius

0. Mental model: CloudTrail is the foundation of "detection" and "investigation," and the "evidence"

1. The top-priority detection: an attacker comes to "stop the trail" first

The EventBridge rule (Terraform)

The triage Lambda (TypeScript)

2. Detecting attack signs: a collection of CIS-benchmark-compliant alerts

A list of CIS-compliant monitoring alerts (Security Hub CloudWatch controls)

Implementation example 1: detecting unauthorized APIs (AccessDenied surge) — CIS 3.1 / CloudWatch.2

Implementation example 2: detecting root-user usage — CIS 1.7/4.3 / CloudWatch.1

3. Riding on managed detection: GuardDuty / Security Hub / Detective

GuardDuty: representative findings derived from CloudTrail (the official IAM finding types)

Security Hub: visualize "the holes in the detection net" with the CIS score

4. The incident-investigation playbook: forensics with Athena / Lake

The IR playbook: detection → triage → scope → containment → evidence preservation

Query 1: reconstruct "all actions" of the compromised principal chronologically

Query 2: signs of persistence — new access-key creation

Query 3: signs of privilege escalation — IAM grant-system operations

Query 4: signs of data exfiltration — snapshot sharing, bucket publication

Query 5: tracking source IPs / sessions — grasping lateral movement

Query 6: surfacing MFA-less sessions

5. Evidence preservation and non-repudiation: prove "this credential really did it"

Log integrity validation: tamper detection and non-repudiation

Storage design for non-repudiation

6. Summary: a detection → investigation → preservation cheat sheet

Related articles

The Complete AWS CloudTrail Guide (2026 Edition): Designing API Activity Auditing, Trails, CloudTrail Lake, Athena Analysis, and Real-Time Detection at Production Quality

Preparing for Compliance Audits with AWS CloudTrail (2026 Edition): Leaving the Audit Trail for PCI DSS, SOC 2, ISO 27001, and HIPAA as Tamper-Proof Evidence

AWS CloudTrail Lake Practical Guide (2026 Edition): Analyzing Events with Trino SQL and How to Choose Between It and Athena+S3 — The Realistic Answer After New-Customer Onboarding Ended

Building a Company-Wide CloudTrail Audit Platform with AWS Organizations (2026 Edition): Aggregating Every Account's Trail with Organization Trails, Delegated Administrators, a Log Archive Account, SCPs, and Control Tower

Also worth reading

A thorough explanation of GuardDuty Extended Threat Detection and attack-sequence findings: reading and responding to weak-signal correlation, the 24-hour window, and Critical multi-stage attacks

Echo file-upload production design: receiving safely with multipart, S3 streaming, presigned URLs, and validation

GuardDuty × Amazon Detective: The 'Next Step' After Detection—A Workflow to Investigate Root Cause and Blast Radius