[arvados] Triggering Keep garbage collection

George Chlipala gchlip2 at uic.edu
Thu Mar 2 18:50:44 EST 2017


I tried running the keep-balance and it does not seem to delete the actual
blocks.  I am using the superuser token created during the installation,
i.e. via sudo -u apache RAILS_ENV=production bundle exec
./script/create_superuser_token.rb

When I set keep-balance.yml to the following:

Client:
  APIHost: "<REMOVED>"
  AuthToken: "<REMOVED>"
  Insecure: false
RunPeriod: 1h
KeepServiceTypes:
    - disk
CollectionBatchSize: 100000
CollectionBuffers: 1000

I recieve the following from keep-balance

[root at arvados-dev ~]# keep-balance -once -commit-trash
2017/03/02 17:38:17 Run: start
2017/03/02 17:38:17 skipping cridv-bi6l4-jq4hc7zhh8ipkvt with service type
"proxy"
2017/03/02 17:38:17 clearing existing trash lists, in case the new
rendezvous order differs from previous run
2017/03/02 17:38:17 cridv-bi6l4-pb945ioz1yrqkq9 (172.17.1.117:25107, disk):
send trash list: start
2017/03/02 17:38:17 cridv-bi6l4-42p7ovknp27lsj1 (172.17.1.121:25107, disk):
send trash list: start
2017/03/02 17:38:17 cridv-bi6l4-42p7ovknp27lsj1 (172.17.1.121:25107, disk):
send trash list: took 2.153563ms
2017/03/02 17:38:17 cridv-bi6l4-pb945ioz1yrqkq9 (172.17.1.117:25107, disk):
send trash list: took 2.200265ms
2017/03/02 17:38:17 cridv-bi6l4-42p7ovknp27lsj1 (172.17.1.121:25107, disk):
send trash list: request failed: {http  <nil> 172.17.1.121:25107 /trash
 %!s(bool=false)  }: 401 Unauthorized
2017/03/02 17:38:17 cridv-bi6l4-pb945ioz1yrqkq9 (172.17.1.117:25107, disk):
send trash list: request failed: {http  <nil> 172.17.1.117:25107 /trash
 %!s(bool=false)  }: 401 Unauthorized
2017/03/02 17:38:17 Run: took 123.919044ms
2017/03/02 17:38:17 cridv-bi6l4-pb945ioz1yrqkq9 (172.17.1.117:25107, disk):
send trash list: request failed: {http  <nil> 172.17.1.117:25107 /trash
 %!s(bool=false)  }: 401 Unauthorized

When I set the keep-balance.yml to the following:

Client:
  APIHost: "<REMOVED>"
  AuthToken: "<REMOVED>"
  Insecure: false
RunPeriod: 1h
KeepServiceTypes: null
CollectionBatchSize: 100000
CollectionBuffers: 1000

I receive the following from keep-balance

[root at arvados-dev ~]# keep-balance -once -commit-trash
2017/03/02 17:40:54 Run: start
2017/03/02 17:40:54 skipping cridv-bi6l4-42p7ovknp27lsj1 with service type
"disk"
2017/03/02 17:40:54 skipping cridv-bi6l4-jq4hc7zhh8ipkvt with service type
"proxy"
2017/03/02 17:40:54 skipping cridv-bi6l4-pb945ioz1yrqkq9 with service type
"disk"
2017/03/02 17:40:54 GetCurrentState: start
2017/03/02 17:40:54 collections: 0/21
2017/03/02 17:40:54 collections: 21/21
2017/03/02 17:40:54 collections: 21/21
2017/03/02 17:40:54 GetCurrentState: took 208.558523ms
2017/03/02 17:40:54 ComputeChangeSets: start
2017/03/02 17:40:54 ComputeChangeSets: took 515.354µs
2017/03/02 17:40:54 ===
2017/03/02 17:40:54 1118 replicas (559 blocks, 70393357406 bytes) lost
(0=have<want)
2017/03/02 17:40:54 0 replicas (0 blocks, 0 bytes) underreplicated
(0<have<want)
2017/03/02 17:40:54 0 replicas (0 blocks, 0 bytes) just right (have=want)
2017/03/02 17:40:54 0 replicas (0 blocks, 0 bytes) overreplicated
(have>want>0)
2017/03/02 17:40:54 0 replicas (0 blocks, 0 bytes) unreferenced
(have>want=0, new)
2017/03/02 17:40:54 0 replicas (0 blocks, 0 bytes) garbage (have>want=0,
old)
2017/03/02 17:40:54 ===
2017/03/02 17:40:54 1118 replicas (559 blocks, 70393357406 bytes) total
commitment (excluding unreferenced)
2017/03/02 17:40:54 0 replicas (0 blocks, 0 bytes) total usage
2017/03/02 17:40:54 ===
2017/03/02 17:40:54 ===
2017/03/02 17:40:54 Replication level distribution (counting N replicas on
a single server as N):
2017/03/02 17:40:54  0: 559
##########################################################
2017/03/02 17:40:54  1:   0
2017/03/02 17:40:54 ===
2017/03/02 17:40:54 Run: took 332.677865ms


Any help would be greatly appreciated.

Thanks!




George Chlipala, Ph.D.
Senior Research Specialist
Research Resources Center
University of Illinois at Chicago

phone: 312-413-1700
email: gchlip2 at uic.edu

On Mon, Feb 13, 2017 at 4:08 PM, George Chlipala <gchlip2 at uic.edu> wrote:

> Peter -
>
> Thanks for the information!
>
> Is there a recommended user to use for the API token?
>
>
>
> George Chlipala, Ph.D.
> Senior Research Specialist
> Research Resources Center
> University of Illinois at Chicago
>
> phone: 312-413-1700 <(312)%20413-1700>
> email: gchlip2 at uic.edu
>
> On Mon, Feb 13, 2017 at 1:45 PM, Peter Amstutz <
> peter.amstutz at curoverse.com> wrote:
>
>> Hi George,
>>
>> The garbage collection service is called "keep-balance" and is located
>> at arvados/services/keep-balance.  It is responsible for both deleting
>> unreferenced blocks and rebalancing blocks among storage servers.
>>
>> Unfortunately it doesn't seems to be properly documented.  It replaces
>> an older implementation called "data manager".
>>
>> Thanks,
>> Peter
>>
>>
>> On Mon, Feb 13, 2017 at 2:35 PM, George Chlipala <gchlip2 at uic.edu> wrote:
>> > Is there any information about tuning/configuring garbage collection
>> for the
>> > keep data manager?  Or is garbage collection not yet implemented?
>> >
>> >
>> > George Chlipala, Ph.D.
>> > Senior Research Specialist
>> > Research Resources Center
>> > University of Illinois at Chicago
>> >
>> > phone: 312-413-1700
>> > email: gchlip2 at uic.edu
>> >
>> > On Thu, Jan 26, 2017 at 11:57 AM, George Chlipala <gchlip2 at uic.edu>
>> wrote:
>> >>
>> >> Is a way to trigger the keep data manager to perform garbage
>> collection or
>> >> if it is possible to know/modify the garbage collection cycle?
>> >>
>> >> We were performing a deletion test from within arvados to see if the
>> keep
>> >> would delete blocks that are no longer used.  We had uploaded new data
>> and
>> >> saw that the disk usage for the keep store had increased by 8GB.
>> However,
>> >> when we deleted the collection from the arvados workbench, we did not
>> see
>> >> the disk usage decrease by the same amount.  Between the adding and
>> deletion
>> >> we did not upload any additional data that might reference the new
>> blocks as
>> >> well, so I would expect that the keep store should release that space.
>> >>
>> >> Any help in this matter would be greatly appreciated.
>> >>
>> >> Thanks!
>> >>
>> >> George Chlipala, Ph.D.
>> >> Senior Research Specialist
>> >> Research Resources Center
>> >> University of Illinois at Chicago
>> >>
>> >> phone: 312-413-1700
>> >> email: gchlip2 at uic.edu
>> >
>> >
>> >
>> > _______________________________________________
>> > arvados mailing list
>> > arvados at arvados.org
>> > http://lists.arvados.org/mailman/listinfo/arvados
>> >
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.arvados.org/pipermail/arvados/attachments/20170302/46e9f49d/attachment.html>


More information about the arvados mailing list