AWS - S3

February 19, 2022

Server side encryption using data keys fully managed by the customer outside of AWS
S3 does not store encryption key
HTTPS must be used
Key must be provided in HTTP headers for every request made

Store objects using write-once-read-many (WORM)
Prevents objects from being overwritten or deleted for a fixed amount of time or indefinitely
Helps meet regulatory requirements
Provides two ways to manage retention
- Retention period: Specifies a fixed period of time during which an object is locked
- Legal hold - Same as retention period, but has no expiration. In place until removed
Only works on versioned buckets
A new version of object can still be created
Object can gave both retention period and legal hold, one but not the other or none.

JSON based policies
- Resources: buckets and objects
- Actions: Set of APIs to ALLOW or DENY
- Principal: The account or users to apply the policy to
Use S3 bucket policy for:
- Grant public access to bucket
- Force objects to be encrypted
- Grant access to another account (cross account)
Block public access to buckets and objects granted through new ACLS, any ACLS, new public bucket or access point
Block public and cross-account access to buckets & objects through any public bucket or access point policies
These settings were created to prevent company data leaks
If you know the bucket should never be public leave these settings on. They can be set at the account level

Transition actions
- Defines when objects are transitioned to another storage class
Expiration actions
- Configure objects to delete after some time
- Can be used to delete old versions of files (if versioning enabled)
Rules can be created for a certain prefix
Rules can be created for object logs

S3 autoscales to high request rates
Your app can achieve at least 3500 PUT/COPY/POST/DELETE and 5500 GET/HEAD requests per second per prefix
No limits to no. of prefixes in a bucket
If you spread reads across prefixes evenly, you can achieve 22000 requests per second for GET and HEAD requests
Multi-part upload:
- recommended for files > 100 MB
- must use for files > 5GB
- can help parallellize uploads
- file divided into parts and uploaded
S3 Transfer Acceleration
- Increase transfer speed by transferring file to an AWS edge location which will forward the data to the S3 bucket in target region
- Compatible with multi-part upload

User based
- IAM policies: which API calls should be allowed for a specific user
Resource based
- Bucket policies: bucket wide rules from the S3 console allows cross account
- Object Access Control List (ACL) - finer grain
- Bucket Access Control List (ACL) - less common
An IAM principal can access object if:
- the user permissions allow it OR the resource policy allows it AND no explicit DENY
Networking
- Supports VPC endpoints
Logging & Audit
- S3 Access logs can be stored in another bucket
- API calls logged in CloudTrail

If a client does a corss origin request on our S3 bucket we need to enable the correct CORS headers
Can allow for a specific origin or all origins

After a successful write of a new object or an overwrite or delete any subsequent request immediately recieves the latest version of the object (read after write consistency)
Any subsequent list request immediately reflects changes (list consistency)

Must enable versioning in source and destination
Cross region replication (CRR)
Same Region replication (SRR)
Buckets can be in different accounts
Copying is asynchronous
Needs proper IAM permissions
CRR uses cases:
- compliance, lower latency access, replication across accounts
SRR use cases:
- log aggregation, live replication between test and production accounts
Only new objects can be replicated
For DELETE operations:
- Can replicate delete markers from source to target
- Deletions with a version ID are not replicated
No chaining of replication
- eg. If bucket 1 replicates to bucket 2, which repilcates to bucket 3, objects in 1 are not replicated to 3.

Amazon Glacier - 3 retrieval options
- Expedited (1-5 mins)
- Standard (3-5 hrs)
- Bulk (5 - 12 hrs)
- Minimum storage duration of 90 days
Amazon Glacier Deep Archive - for long term storage - cheaper
- Standard (12 hrs)
- Bulk (48 hrs)
- Min. storage duration of 180 days