How can I tell how many objects I ve stored in an S3 bucket

Question

Unless I m missing something  it seems that none of the APIs I ve looked at will tell you how many objects are in an  lt S3 bucket gt   lt folder gt   Is there any way to get a count

User · Answer

The easiest way is to use the developer console, for example, if you are on chrome, choose Developer Tools, and you can see following, you can either find and count or do some match, like 280-279 + 1 = 2

...

User · Answer

One of the simplest ways to count number of objects in s3 is  Step 1  Select root folder Step 2  Click on Actions - gt  Delete  obviously  be careful - don t delete it  Step 3  Wait for a few mins aws will show you number of objects and its total size

User · Answer

If you re looking for specific files  let s say  jpg images  you can do the following   aws s3 ls s3   your bucket   grep jpg   wc -l

User · Answer

You can just execute this cli command to get the total file count in the bucket or a specific folder  Scan whole bucket  aws s3api list-objects-v2 --bucket testbucket   grep  Key    wc -l aws s3api list-objects-v2 --bucket BUCKET NAME   grep  Key    wc -l   you can use this command to get in details  aws s3api list-objects-v2 --bucket BUCKET NAME   Scan a specific folder  aws s3api list-objects-v2 --bucket testbucket --prefix testfolder --start-after testfolder    grep  Key    wc -l  aws s3api list-objects-v2 --bucket BUCKET NAME --prefix FOLDER NAME --start-after FOLDER NAME    grep  Key    wc -l

User · Answer

There is no way  unless you    list them all in batches of 1000  which can be slow and suck bandwidth - amazon seems to never compress the XML responses   or log into your account on S3  and go Account - Usage  It seems the billing dept knows exactly how many objects you have stored    Simply downloading the list of all your objects will actually take some time and cost some money if you have 50 million objects stored   Also see this thread about StorageObjectCount - which is in the usage data   An S3 API to get at least the basics  even if it was hours old  would be great

User · Answer

In s3cmd  simply run the following command  on a Ubuntu system    s3cmd ls -r s3   mybucket   wc -l

User · Answer

3Hub is discontinued  There s a better solution  you can use Transmit  Mac only   then you just connect to your bucket and choose Show Item Count from the View menu

User · Answer

You can use AWS cloudwatch metrics for s3 to see exact count for each bucket

User · Answer

Following is how you can do it using java client    lt dependency gt       lt groupId gt com amazonaws lt  groupId gt       lt artifactId gt aws-java-sdk-s3 lt  artifactId gt       lt version gt 1 11 519 lt  version gt   lt  dependency gt    import com amazonaws ClientConfiguration  import com amazonaws Protocol  import com amazonaws auth AWSStaticCredentialsProvider  import com amazonaws auth BasicAWSCredentials  import com amazonaws services s3 AmazonS3  import com amazonaws services s3 AmazonS3ClientBuilder  import com amazonaws services s3 model ObjectListing   public class AmazonS3Service        private static final String S3 ACCESS KEY ID    ACCESS KEY       private static final String S3 SECRET KEY    SECRET KEY       private static final String S3 ENDPOINT    S3 URL        private AmazonS3 amazonS3       public AmazonS3Service             ClientConfiguration clientConfiguration   new ClientConfiguration            clientConfiguration setProtocol Protocol HTTPS           clientConfiguration setSignerOverride  S3SignerType            BasicAWSCredentials credentials   new BasicAWSCredentials S3 ACCESS KEY ID  S3 SECRET KEY           AWSStaticCredentialsProvider credentialsProvider   new AWSStaticCredentialsProvider credentials           AmazonS3ClientBuilder EndpointConfiguration endpointConfiguration   new AmazonS3ClientBuilder EndpointConfiguration S3 ENDPOINT  null           amazonS3   AmazonS3ClientBuilder standard   withCredentials credentialsProvider  withClientConfiguration clientConfiguration                   withPathStyleAccessEnabled true  withEndpointConfiguration endpointConfiguration  build               public int countObjects String bucketName            int count   0          ObjectListing objectListing   amazonS3 listObjects bucketName           int currentBatchCount   objectListing getObjectSummaries   size            while  currentBatchCount    0                count    currentBatchCount              objectListing   amazonS3 listNextBatchOfObjects objectListing               currentBatchCount   objectListing getObjectSummaries   size                      return count

User · Answer

The api will return the list in increments of 1000   Check the IsTruncated property to see if there are still more  If there are  you need to make another call and pass the last key that you got as the Marker property on the next call  You would then continue to loop like this until IsTruncated is false   See this Amazon doc for more info  Iterating Through Multi-Page Results

User · Answer

From the command line in AWS CLI  use ls plus --summarize  It will give you the list of all of your items and the total number of documents in a particular bucket  I have not tried this with buckets containing sub-buckets   aws s3 ls  s3   MyBucket  --summarize   It make take a bit long  it took listing my 16 K documents about 4 minutes   but it s faster than counting 1K at a time

User · Answer

You can potentially use Amazon S3 inventory that will give you list of objects in a csv file

User · Answer

If you use the s3cmd command-line tool  you can get a recursive listing of a particular bucket  outputting it to a text file    s3cmd ls -r s3   logs mybucket subfolder   gt  listing txt   Then in linux you can run a wc -l on the file to count the lines  1 line per object    wc -l listing txt

User · Answer

2020 10 22 With AWS Console Use AWS Cloudwatch s metrics With AWS CLI Number of objects  or  aws s3api list-objects --bucket  lt BUCKET NAME gt  --prefix  quot  lt FOLDER NAME gt  quot    wc -l  or  aws s3 ls s3    lt BUCKET NAME gt   lt FOLDER NAME gt   --recursive --summarize --human-readable   grep  quot Total Objects quot   or with s4cmd  s4cmd ls -r s3    lt BUCKET NAME gt   lt FOLDER NAME gt     wc -l  Objects size  aws s3api list-objects --bucket  lt BUCKET NAME gt  --output json --query  quot  sum Contents   Size   length Contents     quot    awk   NR  2  print  0 next   NR  2  print  0 1024 1024 1024 quot  GB quot     or  aws s3 ls s3    lt BUCKET NAME gt   lt FOLDER NAME gt   --recursive --summarize --human-readable   grep  quot Total Size quot   or with s4cmd  s4cmd du s3    lt BUCKET NAME gt   or with CloudWatch metrics  aws cloudwatch get-metric-statistics --metric-name BucketSizeBytes --namespace AWS S3 --start-time 2020-10-20T16 00 00Z --end-time 2020-10-22T17 00 00Z --period 3600 --statistics Average --unit Bytes --dimensions Name BucketName Value  lt BUCKET NAME gt  Name StorageType Value StandardStorage --output json   grep  quot Average quot

User · Answer

If you are using AWS CLI on Windows  you can use the Measure-Object from PowerShell to get the total counts of files  just like wc -l on  nix   PS C   gt  aws s3 ls s3   mybucket  --recursive   Measure-Object  Count      25 Average    Sum        Maximum    Minimum    Property     Hope it helps

User · Answer

Here s the boto3 version of the python script embedded above   import sys import boto3  s3   boto3 resource  s3   s3bucket   s3 Bucket sys argv 1   size   0 totalCount   0  for key in s3bucket objects all        totalCount    1     size    key size  print  total size    print    3f GB     size 1 0 1024 1024 1024   print  total count    print totalCount

User · Answer

You can download and install s3 browser from http   s3browser com   When you select a bucket in the center right corner you can see the number of files in the bucket  But  the size it shows is incorrect in the current version    Gubs

User · Answer

Using AWS CLI  aws s3 ls s3   mybucket  --recursive   wc -l    or   aws cloudwatch get-metric-statistics     --namespace AWS S3 --metric-name NumberOfObjects     --dimensions Name BucketName Value BUCKETNAME                 Name StorageType Value AllStorageTypes     --start-time 2016-11-05T00 00 --end-time 2016-11-05T00 10     --period 60 --statistic Average   Note  The above cloudwatch command seems to work for some while not for others  Discussed here  https   forums aws amazon com thread jspa threadID 217050  Using AWS Web Console  You can look at cloudwatch s metric section to get approx number of objects stored    I have approx 50 Million products and it took more than an hour to count using aws s3 ls

User · Answer

Old thread  but still relevant as I was looking for the answer until I just figured this out   I wanted a file count using a GUI-based tool  i e  no code    I happen to already use a tool called 3Hub for drag  amp  drop transfers to and from S3   I wanted to know how many files I had in a particular bucket  I don t think billing breaks it down by buckets      So  using 3Hub   - list the contents of the bucket  looks basically like a finder or explorer window  - go to the bottom of the list  click  show all  - select all  ctrl a  - choose copy URLs from right-click menu - paste the list into a text file  I use TextWrangler for Mac   - look at the line count     I had 20521 files in the bucket and did the file count in less than a minute

User · Answer

You can easily get the total count and the history if you go to the s3 console  Management  tab and then click on  Metrics     Screen shot of the tab

User · Answer

Can also be done with gsutil du  Yes  a Google Cloud tool   gsutil du s3   mybucket    wc -l

User · Answer

There is a --summarize switch which includes bucket summary information   i e  number of objects  total size    Here s the correct answer using AWS cli   aws s3 ls s3   bucketName path  --recursive --summarize   grep  Total Objects    Total Objects  194273   See the documentation

User · Answer

As of November 18  2020 there is now an easier way to get this information without taxing your API requests  AWS S3 Storage Lens The default  built-in  free dashboard allows you to see the count for all buckets  or individual buckets under the  quot Buckets quot  tab  There are many drop downs to filter and sort almost any reasonable metric you would look for

User · Answer

I used the python script from scalablelogic com  adding in the count logging   Worked great      usr local bin python  import sys  from boto s3 connection import S3Connection  s3bucket   S3Connection   get bucket sys argv 1   size   0 totalCount   0  for key in s3bucket list        totalCount    1     size    key size  print  total size   print    3f GB     size 1 0 1024 1024 1024  print  total count   print totalCount

User · Answer

None of the APIs will give you a count because there really isn t any Amazon specific API to do that  You have to just run a list-contents and count the number of results that are returned

User · Answer

aws s3 ls s3   bucket-name folder-prefix-if-any --recursive   wc -l

User · Answer

There is an easy solution with the S3 API now  available in the AWS cli    aws s3api list-objects --bucket BUCKETNAME --output json --query   length Contents        or for a specific folder   aws s3api list-objects --bucket BUCKETNAME --prefix  folder subfolder   --output json --query   length Contents

User · Answer

Go to AWS Billing  then reports  then AWS Usage reports  Select Amazon Simple Storage Service  then Operation StandardStorage  Then you can download a CSV file that includes a UsageType of StorageObjectCount that lists the item count for each bucket

User · Answer

Although this is an old question  and feedback was provided in 2015  right now it s much simpler  as S3 Web Console has enabled a  Get Size  option     Which provides the following

[file] How can I tell how many objects I've stored in an S3 bucket?

Examples related to file

Examples related to count

Examples related to amazon-s3

Examples related to amazon-web-services