Add IPFS as a remote cache #3510

felipecruz91 · 2023-01-16T20:18:10Z

Hi,

I'd like to open this PR to introduce IPFS as an additional remote cache for BuildKit. It contains a first implementation to export and import the build cache to IPFS and replicate it among all the peers.

Motivation

The main motivation is to explore new ways to achieve faster build times by leveraging the use of IPSF as a remote, distributed cache of image blobs among multiple peers of a cluster.

Use-case

In a company, software developers that work on the same project happen to build an application that is likely to have been already built by a teammate/peer. Exporting the BuildKit cache to a remote registry is a convenient solution, however, sometimes it can be time-consuming to download the cache, especially when building large images.

By distributing the cache using IPFS across many cluster peers, the blocks that compose the blobs can be downloaded in parallel from multiple peers instead of from one single place (a remote registry).

Try me

Set up an IPFS cluster if you don't have one. See this example.
Create a builder from felipecruz/buildkit:ipfs-cluster

docker buildx create --name buildkitd-builder --driver docker-container --driver-opt image=felipecruz/buildkit:ipfs-cluster --use

In one host, build an image and export its cache to IPFS

docker buildx build \
  --cache-to type=ipfs,cluster_api=192.168.65.2:9094,daemon_api=192.168.65.2:5001,mode=max \
  -t my-image .

In another host, import the cache

docker buildx build \
  --cache-from type=ipfs,cluster_api=192.168.65.2:9094,daemon_api=192.168.65.2:5001,mode=max \
  -t my-image .

/cc @tonistiigi @crazy-max @AkihiroSuda

AkihiroSuda · 2023-01-16T21:36:09Z

Can we just exec the Kubo binary to reduce the Go dependencies?

tonistiigi · 2023-01-16T23:01:58Z

Can we just exec the Kubo binary to reduce the Go dependencies?

Could we check what the difference in binary and buildkit image size would be for both cases. Also, maybe there is a smaller tool or a minimal build that we could include in the image.

AkihiroSuda · 2023-01-16T23:06:05Z

Can we just exec the Kubo binary to reduce the Go dependencies?

Could we check what the difference in binary and buildkit image size would be for both cases.

Not really for binary footprint, rather for avoiding the vendor hell.

tonistiigi · 2023-01-16T23:09:15Z

Not really for binary footprint, rather for avoiding the vendor hell.

Yes, but if including kubo increases binary(image) size too much compared to vendoring then we might still prefer vendoring. We need to know the numbers(and if there are alternatives).

AkihiroSuda · 2023-01-16T23:13:58Z

Not really for binary footprint, rather for avoiding the vendor hell.

Yes, but if including kubo increases binary(image) size too much compared to vendoring then we might still prefer vendoring. We need to know the numbers(and if there are alternatives).

Another alternative would be to reimplement the IPFS API client with the stdlib net/http

ktock · 2023-01-17T13:36:08Z

cache/remotecache/ipfs/ipfs.go

+
+		if !exists {
+			layerDone := progress.OneOff(ctx, fmt.Sprintf("writing layer %s", l.Blob))
+			dt, err := content.ReadBlob(ctx, dgstPair.Provider, dgstPair.Descriptor)


Can we use io.Copy here instead of fully reading the layer to slice?

ktock · 2023-01-17T13:39:51Z

cache/remotecache/ipfs/ipfs.go

+	go func() {
+		for {
+			j, more := <-out
+			if more {
+				logrus.Debugf("added item: %+v", j)
+				cid = j.Cid
+			} else {
+				logrus.Debugf("added all items")
+				done <- true
+				return
+			}
+		}
+	}()


This should be cancellable via ctx?

ktock · 2023-01-17T13:48:31Z

cache/remotecache/ipfs/ipfs.go

+		}
+
+		logrus.Debugf("unpinning previous pin: %s\n", prevPinCID.Cid)
+		_, err = clusterClient.Unpin(ctx, *prevPinCID)


Is it enough to only unpinning 1 CID? Isn't it possible that previously multiple CIDs being associated to that pin name?

felipecruz91 · 2023-01-18T16:27:05Z

Find below the size comparison:

TL;DR

Using vendoring the buildkitd binary size is increased from 50M to 66M (32%).
If we were to ship both the ipfs (62.8M) and ipfs-cluster-ctl (32.5M) binaries as part of the moby/buildkit:latest image, it would mean an increase from the current image size from 168MB to 263.3M (56.7%).

See below the binaries used for the comparison:

Details

Binary size without the IPFS implementation.

Branch: master
Commit: 983480b80ad82f98959b49b89bb2af0f84df72d9

Original:

make binaries
...

ls -lh ./bin/
...
-rwxr-xr-x@ 1 felipecruz  staff    50M 18 Jan 16:14 buildkitd

Binary size using IPFS Go libraries (vendoring)

Branch: feature/ipfs-cache
Commit: 492fc6d8b0485d98791c8d605b4cc8ea5a9181b7

make binaries
...

ls -lh ./bin/
...
-rwxr-xr-x@ 1 felipecruz  staff    66M 18 Jan 16:01 buildkitd

AkihiroSuda · 2023-01-19T00:06:24Z

If we were to ship both the ipfs (62.8M) and ipfs-cluster-ctl (32.5M) binaries as part of the moby/buildkit:latest image, it would mean an increase from the current image size from 168MB to 263.3M (56.7%).

Maybe these binaries should be only present in a separate image like moby/buildkit:vX.Y.Z-ipfs?

That might be also helpful for some enterprise companies that have "no P2P" policies.

tonistiigi · 2023-01-19T01:02:08Z

Maybe these binaries should be only present in a separate image like

If we do that then it probably makes sense to move the s3/azure backends also to that image (cc @bpaquet) . And include things like Nydus if they are ready that is somewhat supported today but not in release image(cc @hsiangkao ).

hsiangkao · 2023-01-20T02:08:04Z

Maybe these binaries should be only present in a separate image like

If we do that then it probably makes sense to move the s3/azure backends also to that image (cc @bpaquet) . And include things like Nydus if they are ready that is somewhat supported today but not in release image(cc @hsiangkao ).

Hi! Actually I'm not responsible for main Nydus implementation (One part of my main jobs is in-kernel EROFS), I'm Ccing proper Nydus people here. cc @imeoer @jiangliu @changweige

imeoer · 2023-01-20T03:24:26Z

@tonistiigi @AkihiroSuda If image size is a concern, nydus is also ready to build related binary into a separate image like moby/buildkit:vX.Y.Z-nydus.

Add IPFS as a remote cache

492fc6d

AkihiroSuda requested a review from ktock January 16, 2023 21:33

ktock reviewed Jan 17, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add IPFS as a remote cache #3510

Add IPFS as a remote cache #3510

Uh oh!

felipecruz91 commented Jan 16, 2023 •

edited

Loading

Uh oh!

AkihiroSuda commented Jan 16, 2023

Uh oh!

tonistiigi commented Jan 16, 2023 •

edited

Loading

Uh oh!

AkihiroSuda commented Jan 16, 2023

Uh oh!

tonistiigi commented Jan 16, 2023

Uh oh!

AkihiroSuda commented Jan 16, 2023

Uh oh!

ktock Jan 17, 2023

Uh oh!

ktock Jan 17, 2023

Uh oh!

ktock Jan 17, 2023

Uh oh!

felipecruz91 commented Jan 18, 2023 •

edited

Loading

Binary size without the IPFS implementation.

Binary size using IPFS Go libraries (vendoring)

Uh oh!

AkihiroSuda commented Jan 19, 2023

Uh oh!

tonistiigi commented Jan 19, 2023

Uh oh!

hsiangkao commented Jan 20, 2023 •

edited

Loading

Uh oh!

imeoer commented Jan 20, 2023

Uh oh!

Uh oh!

Add IPFS as a remote cache #3510

Are you sure you want to change the base?

Add IPFS as a remote cache #3510

Uh oh!

Conversation

felipecruz91 commented Jan 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Use-case

Try me

Uh oh!

AkihiroSuda commented Jan 16, 2023

Uh oh!

tonistiigi commented Jan 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AkihiroSuda commented Jan 16, 2023

Uh oh!

tonistiigi commented Jan 16, 2023

Uh oh!

AkihiroSuda commented Jan 16, 2023

Uh oh!

ktock Jan 17, 2023

Choose a reason for hiding this comment

Uh oh!

ktock Jan 17, 2023

Choose a reason for hiding this comment

Uh oh!

ktock Jan 17, 2023

Choose a reason for hiding this comment

Uh oh!

felipecruz91 commented Jan 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Binary size without the IPFS implementation.

Binary size using IPFS Go libraries (vendoring)

Uh oh!

AkihiroSuda commented Jan 19, 2023

Uh oh!

tonistiigi commented Jan 19, 2023

Uh oh!

hsiangkao commented Jan 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imeoer commented Jan 20, 2023

Uh oh!

Uh oh!

felipecruz91 commented Jan 16, 2023 •

edited

Loading

tonistiigi commented Jan 16, 2023 •

edited

Loading

felipecruz91 commented Jan 18, 2023 •

edited

Loading

hsiangkao commented Jan 20, 2023 •

edited

Loading