Suggestion: More granular approach #12

mokagio · 2016-03-09T04:15:40Z

Hey @guidomb, first of all nice gem! It was pretty easy to set it up and it worked straightaway. Great job.

I have a suggestion, on how the tool could behave to perform more efficiently when used across multiple projects.

What do you think of the idea of zipping and uploading the dependencies one by one, rather than the whole Carthage/Build folder?

Here's a scenario in which I think it would be useful:

Project #1 Cartfile.resolved:

github "foo/cool-library" "1.0.0"
github "bar/useful-library" "1.0.0"

Project #2 Cartfile.resolved

github "foo/cool-library" "1.0.0"
github "baz/utils" "1.0.0"

Project #3 Cartfile.resolved

github "bar/useful-library" "1.0.0"
github "baz/utils" "1.0.0"

Assuming that we were to setup the projects in order, 1 then 2 then 3, this is what would happen:

Project #1 Setup

> No cache available
> carthage bootstrap
> Setup cache

Project #2 Setup

> No cache available
> carthage bootstrap
> Setup cache

Project #3 Setup

> No cache available
> carthage bootstrap
> Setup cache

Each of the three projects had to perform carthage bootstrap and upload its cache.

If we were instead to have a cached version of each build dependency, this is what would happen:

Project #1 Setup

> No cached version of "foo/cool-library" available
> No cached version of "bar/useful-library" available
> carthage bootstrap
> Setup cached version of "foo/cool-library"
> Setup cached version of "bar/useful-library"

Project #2 Setup

> Found a cache version of "foo/cool-library".
> Downloading and unzipping "foo/cool-library" from cache.
> No cached version of "baz/utils" available
> carthage bootstrap
> Setup cached version of "baz/utils"

Project #3 Setup

> Found a cache version of "bar/useful-library".
> Downloading and unzipping "bar/useful-library" from cache.
> Found a cache version of "baz/utils".
> Downloading and unzipping "baz/utils" from cache.

In this scenario only Project 1 had to go through the whole carthage bootstrap process, and Project 3 was simply able to download cached builds.

This means that "at scale" each project using carthage_cache would tap into the cache of the others

There is a gotcha, give the current behaviour of carthage if one of the dependencies is not cached carthage bootstrap would have to be run anyway. I'm confident we can find a way to work around this issue, or maybe close an eye for the moment.

What's your take?

Cheers 🍻

The text was updated successfully, but these errors were encountered:

mokagio · 2016-03-09T04:18:02Z

@guidomb I forgot to say, if you like the idea I'm happy to help with the implementation 😄

guidomb · 2016-03-09T13:05:07Z

I like the idea. For the gotcha you mentioned the easiest solution I can think of is keeping the current behavior (uploading a zip for the whole Carthage/Build directory) and also upload a zip for each dependency as you mentioned. CarthageCache will then first try to find a cache for the whole Cartfile.resolved if it doesn't find it then it will try to find a cache for each dependency. For the ones it didn't find it could run (lets say dependencies A B C) carthage bootstrap A B C and only download the missing ones.

This has the problem of uploading more files. The other option would be just to upload the Carthage/Build zip and then have a AWS lambda function that unzips it and checks if it needs to generate a zip for each individual dependency.

The third option would be to upload every dependency cache individualy and create a small web services that given a Cartfile.resolved it assembles the appropiate Carthage/Build cache and it case of missing dependencies it tells you which are the ones that don't have a cache in order for the client to build them. This service could also cache the result of building a Carthage/Build zip in S3.

What do you think?

mokagio · 2016-03-09T23:51:53Z

Hey @guidomb, I'm glad you like the idea 😄

I like the idea of doing the minimum amount of work on the client, and leveraging a lamba function to do more advance stuff on the bucket.

Having said that I think that a good first step would be to implement your first suggestion: uploading both single dependencies and build folder.

I think we could put it in place relatively quickly, and we'd have a working version that we could then improve with the lamba. We would need to put some logic in place, i.e unzip of single dependencies, for that to work anyway.

How does it sound? I'll start working on some related tasks straightaway.

guidomb · 2016-03-11T02:20:49Z

If you want to start working on this I will be more than happy. I would like to have a solid code coverage to make sure that we don't brake anything (gonna try to work on this on the weekend). If you want to go for the client based approach I think that the user should be able to opt-out of this feature to avoid the extra transfer. Maybe the best approach would be to turn it off by default and add command flag and/or an attribute in .carthage-cache.yml file to turn it on.

I'm gonna do some research on the AWS approach because I wanted to use lambda function in a real case scenario and haven't got the chance yet.

I think we could create a folder for each user / organization slug and then a zip file with the name of the dependency and the commit or tag as it appear in the Cartfile.resolved. For example lets say we have the following Cartfile.resolved:

github "Foo/LibA" "0.5.0"
github "Foo/LibB" "f7c8c3c939a75349ea912bd48ad09b3a64585e59"
github "Bar/LibC" "master"

then in the S3 bucket we would have the following folder structure

 |
 |----> Foo
 |           |
 |           |------> LibA-0.5.0.zip
 |           |
 |           |------> LibB-f7c8c3c939a75349ea912bd48ad09b3a64585e59.zip
 |
 |----> Bar
             |
             |-------> LibC-master.zip

And the approach could be first try to find a zip file for all the dependencies as is working now and if there is no archive then try to fetch each dependency individually. Finally it there still are missing dependency we could either execute carthage bootstrap MissingLibA MissingLibB or just print the list and let caller decide what to do next.

What do you think?

mokagio · 2016-03-11T05:39:48Z

Good stuff!

I would like to have a solid code coverage to make sure that we don't brake anything (gonna try to work on this on the weekend).

I'm a big TDD fan myself, wouldn't have it any other way 😃

If you want to go for the client based approach I think that the user should be able to opt-out of this feature to avoid the extra transfer. Maybe the best approach would be to turn it off by default and add command flag and/or an attribute in .carthage-cache.yml file to turn it on.

I think it makes sense. For example --granular-cache and granular_cache: true.

+1 on all the rest, only two things:

What would be the value of having a folder per user and organization, rather that naming the files Foo-LibA-0.5.0.zip and so on? Easier management of the bucket?
I am not sure that carthage bootstrap MissingLibA MissingLibB actually works 😳. Regardless of that I would let the caller decide what to do next, seems like a safer approach.

FYI I thought I'd been able to start working on this soon, but it might not be till the middle of next week.

guidomb · 2016-03-11T15:51:13Z

I though having the folders would make it easier if you wanted to navigate using the AWS UI but whatever you think is best or makes it easier (as far as I'm concerned creating folders in S3 is just prepending the name of the folder when you set the name of the file.)

Yeah probably for bootstrap it does not work but lets list the dependency and let the user decide what to do.

guidomb · 2016-03-14T04:07:37Z

I've just added some test to get us covered -> https://codeclimate.com/github/guidomb/carthage_cache/coverage

mokagio · 2016-03-14T21:27:05Z

Fantastic ✨

I should be able to start working on this before the end of the week. Sorry about the delay 😳

guidomb · 2016-03-14T21:52:37Z

No need to be sorry. Thanks for contributing!

mokagio · 2016-03-22T04:04:23Z

So I've been spending a bit of time onto this, I have some code I could use to open a PR, but... I bumped into an issue which I think might compromise the whole granular cache idea.

Carthage doesn't provide a biderectional way to go from built framework name to Cartfile.resolved entry, which would make generating and downloading archives for certain type of single dependencies impossible.

For example give this:

# Cartfile

github "realm/realm-cocoa"
git "git@github.com:danielgindi/ios-charts.git"

---

# Cartfile.resolved

git "git@github.com:danielgindi/ios-charts.git" "v2.2.3"
github "realm/realm-cocoa" "v0.98.5"

Running carthage bootstrap The generate folder structure is:

Carthage/
  Build/
    iOS/
      Charts.framework
      Charts.framework.dSYM
      ChartsRealm.framework
      ChartsRealm.framework.dSYM
      Realm.framework
      Realm.framework.dSYM
      RealmSwift.framework
      RealmSwift.framework.dSYM
    Mac/
      Charts.framework
      Charts.framework.dSYM
      Realm.framework
      Realm.framework.dSYM
      RealmSwift.framework
      RealmSwift.framework.dSYM
    tvOS/
      Charts.framework
      Charts.framework.dSYM
      ChartsRealm.framework
      ChartsRealm.framework.dSYM
      Realm.framework
      Realm.framework.dSYM
      RealmSwift.framework
      RealmSwift.framework.dSYM
    watchOS/
      Realm.framework
      Realm.framework.dSYM
      RealmSwift.framework
      RealmSwift.framework.dSYM

I don't see any robust way to go from the Cartfile entry to the build frameworks, as such I don't know how much value the granular cache would actually be.

On top of that in the few tests I've made the download speed of the whole cache versus the single dependencies archives is always comparable.

My take is that given that a carthage update is an operation that is usually order of magnitudes less frequent than a CI build triggered by a SCM check-in, the value provided by having a granular cache is not that much to be fair.

How do you feel about it?

I feel quite dump for not realizing this earlier... On the other hand, I've run with carthage_cache on my client's CI for more than a week and everyone is happy with the performance improvement already 😄

guidomb · 2016-03-23T00:29:16Z

I think you have a good point and any attempt to implement such feature in a robust way will probably make carthage cache way more complex. We can close this for now and in case anyone is interested in this we can re-open it.

Jeehut · 2016-09-02T11:49:08Z

I am interested! :)

Well I've just read through all your comments and got quite excited until the last message from @mokagio where he says it's not adding much of real world use. But it does for us: We frequently run into a situation where a project has apps for different platforms (say iOS, tvOS, macOS) and we want to share as much logic as possible between them. So our apps for the different platforms merely serve as the UI for the framework that holds most of the logic which we integrate using Carthage.

Now when we need to make a change in the framework for a new feature on the iOS app, we need to update the Framework project and rebuild it using Carthage. The shared framework of course has its own dependencies (also included via Carthage) and those are installed each time we make a change in the shared framework and want to update all apps (iOS, tvOS and macOS). So we finally end up in a mess like this:

-- iOS Project
---- Dependency A
---- Dependency B
---- Dependency Shared Framework
-- macOS Project
---- Dependency A
---- Dependency B
---- Dependency Shared Framework
-- tvOS Project
---- Dependency A
---- Dependency B
---- Dependency Shared Framework
-- Shared Framework
---- Dependency C
---- Dependency D
---- Dependency E
---- Dependency F

Now when we make a change in the shared framework, we need to update the dependencies of each platform. Although we are using the carthage update SharedFramework --platform ios option to only build that one dependency for only one platform it still builds all sub dependencies (C, D, E and F) again. Using carthage_cache as it currently is doesn't solve this problem but having a granular approach would solve it and only build the changed framework.

I hope I could explain the situation where this is needed.

As for the rest, I have one comment to make:

I don't see any robust way to go from the Cartfile entry to the build frameworks, as such I don't know how much value the granular cache would actually be.

Well, Carthage happens to make that connection somehow, doesn't it? And the good thing about it is that Carthage is open source, so we can have a look into the code. I think, basically, Carthage makes a web request to the Git project and has a look into the xcscheme – we could do the same using the exact same method Carthage does. So that should not be a big deal.

guidomb · 2016-09-05T18:14:33Z

@Dschee We do also have shared internal libraries but we haven't had this issue because those internal shared libraries don't change that often.

In one particular project that required to frequently add changes to the shared library we ended up using the master branch and using git submodules for all dependencies in that project. That solved the issue of having to build all the dependencies every time we added a change to the shared library. This might or might not suit your needs.

I created this tool to solve the paint points of our particular development flow. Currently I only add features to this project that are small and don't add to much complexity to the tool or if they do they add significant value to all users. That being said I'd be more than happy to consider adding such feature if you manage to implement it in a robust way. Personally I don't have the time nor the need to implement such feature but I have no problem to give you feedback and reviewing a PR.

guidomb · 2017-02-03T22:32:48Z

@mokagio This has come to life again in #35

guidomb closed this as completed Mar 23, 2016

kingfai mentioned this issue Feb 3, 2017

Per framework caching #35

Open

guidomb mentioned this issue Aug 9, 2017

Is it possible to cache multiple projects in the same Bucket? #42

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion: More granular approach #12

Suggestion: More granular approach #12

mokagio commented Mar 9, 2016

mokagio commented Mar 9, 2016

guidomb commented Mar 9, 2016

mokagio commented Mar 9, 2016

guidomb commented Mar 11, 2016

mokagio commented Mar 11, 2016

guidomb commented Mar 11, 2016

guidomb commented Mar 14, 2016

mokagio commented Mar 14, 2016

guidomb commented Mar 14, 2016

mokagio commented Mar 22, 2016

guidomb commented Mar 23, 2016

Jeehut commented Sep 2, 2016

guidomb commented Sep 5, 2016

guidomb commented Feb 3, 2017

Suggestion: More granular approach #12

Suggestion: More granular approach #12

Comments

mokagio commented Mar 9, 2016

mokagio commented Mar 9, 2016

guidomb commented Mar 9, 2016

mokagio commented Mar 9, 2016

guidomb commented Mar 11, 2016

mokagio commented Mar 11, 2016

guidomb commented Mar 11, 2016

guidomb commented Mar 14, 2016

mokagio commented Mar 14, 2016

guidomb commented Mar 14, 2016

mokagio commented Mar 22, 2016

guidomb commented Mar 23, 2016

Jeehut commented Sep 2, 2016

guidomb commented Sep 5, 2016

guidomb commented Feb 3, 2017