Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-integrate jpm into janet #1444

Open
tionis opened this issue May 6, 2024 · 57 comments
Open

Re-integrate jpm into janet #1444

tionis opened this issue May 6, 2024 · 57 comments

Comments

@tionis
Copy link
Contributor

tionis commented May 6, 2024

Many janet installs out there are broken or only partially correct, simplifying the setup might help improve this.
To quote @bakpakin :

Really just making the installation easier would be a good start. One major issue is that non standard, broken installs are pretty common - the easiest way to make that not possible is to have just one binary - Janet

Any approach bringing a tighter integration with the runtime and package management should reduce possible misconfigurations and make it simpler for people to get started with janet.

There are still a few open questions like:

  • should the package manager be part of janet (in a single binary) or installed as janet script with the default installation?
  • should all components of jpm be kept as is or are there some paint points that could be addressed with such a migration
  • how to maintain backwards compatibility with current projects?
@sogaiu
Copy link
Contributor

sogaiu commented May 7, 2024

I think having something like jpm be bundled with janet has merits.

I'm not so sure about trying to bring jpm itself back in though. Possibly a new thing might be better? Haven't thought too deeply about it though.

@pepe
Copy link
Member

pepe commented May 7, 2024

Once mentioned idea of moving jpm into spork and making spork part of the janet build and release sounds the best to me.

@bakpakin
Copy link
Member

bakpakin commented May 7, 2024

I'm not so sure about trying to bring jpm itself back in though. Possibly a new thing might be better? Haven't thought too deeply about it though.

I agree with that. jpm has usablility problems that make that not ideal.

So current jpm issues as I see it:

  • Hard to install, harder to configure
    • too many env variables, to easy to make a partially working installation
  • project.janet format is informal and is just a slightly modified janet script
  • A baked-in C compilation flow is not compatible with all projects

I guess I don't think the entirety of jpm should be in Janet, but I think it would be interesting if janet itself had some simple functions for modifying the contents of it's own syspath. The flow could be something like this:

$ janet
Janet 1.34.0-0da640ef linux/x64/gcc - '(doc)' for help
repl:1:> (bundle/list) # traverse (dyn *syspath*) and show install packages
nil
repl:2:> (bundle/install-dir "/home/crose/my-local-project") # install from a directory
Doing local install...
...
done!
nil
repl:3:> (bundle/install-git "http://github.com/janet-lang/spork.git") # download spork using CLI git
Doing git install...
...
done!
repl:4:> (bundle/list)
@["spork", "my-local-project"]
repl:5:>

This kind of flow is pretty common in repl-first languages, and avoids some of the annoying and complicated setup of jpm, where it needs to be installed system wide in a compatible way to Janet.

Configuration would be done solely by where (dyn *syspath*) pointed to. Basically, Janet would just get more functions for dealing with (dyn *syspath*)

I also don't think "installation" into a tree should be that complicated by default - a git project that was installable this way could just be a project that had a bundle/setup.janet, bundle/test.janet, bundle/morestuff.janet, etc.

Lastly, the make-style build system built into jpm can be nice but I think is something that could just be left in spork if people needed it - most projects I've seen really don't need it.

Curious on peoples thought

@bakpakin
Copy link
Member

bakpakin commented May 7, 2024

The hardest part is of course an opinionated C build system (which is actually what jpm started out as, and then morphed into a bad package manager), but can come later and be a package itself (probably part of spork to make things simpler) as a module that would just help users find compilers, linkers, libraries and headers on their system.

@tionis
Copy link
Contributor Author

tionis commented May 7, 2024

I like the ideas of having a few simple primitives for module management etc. in janet itself and keeping more complicated parts in spork to import as needed.
Perhaps more complicated projects could import from spork/build in some setup script to compile deps?
That begs the question though if spork should be kept as is or made part of janet releases (by, e.g., prepopulating the syspath with it)

@sogaiu
Copy link
Contributor

sogaiu commented May 8, 2024

I guess I don't think the entirety of jpm should be in Janet

...

the make-style build system built into jpm can be nice but I think is something that could just be left in spork if people needed it - most projects I've seen really don't need it.

Interested in this kind of decoupling along with having some key feature(s) make it into janet.

Having build system bits be in spork or elsewhere might be nice for extensibility and experimentation by 3rd parties.

@sogaiu
Copy link
Contributor

sogaiu commented May 8, 2024

An observation about the specific example repl session above...

IIUC, it demos the idea of being able to install spork with just an installation of janet. ATM, I think a C compiler is required in the case of spork because it has native modules.

Possibly the janet-built-in stuff can use a C compiler if one is detected and bail otherwise?

@bakpakin
Copy link
Member

bakpakin commented May 11, 2024

An observation about the specific example repl session above...

IIUC, it demos the idea of being able to install spork with just an installation of janet. ATM, I think a C compiler is required in the case of spork because it has native modules.

Possibly the janet-built-in stuff can use a C compiler if one is detected and bail otherwise?

So spork already actually contains some of the needed functionality for building C modules (see spork/cc) - my idea is that we would eventually have sport be able to self bootstrap with it's own utilities.

EDIT: as far as installing jpm without a C compiler, also a possibility but a number of modules depend on other native modules like json.

@bakpakin
Copy link
Member

bakpakin commented May 12, 2024

So I was able to a working version of the bundle/ proposal with spork such that spork could build and install itself to (dyn *syspath*). This initial version of this bundle module lives currently in https://github.com/janet-lang/janet/tree/bundle-tools

An example of what the spork hooks look like can be found here: https://github.com/janet-lang/spork/tree/bundle-hooks/hooks

Usage would be something like the following:

$ git clone --single-branch --branch bundle-hooks https://github.com/janet-lang/spork.git
...
$ janet
Janet 1.34.0-60e22d97 linux/x64/gcc - '(doc)' for help
repl:1:> (bundle/install "/home/crose/code/spork") # directory where git clone lives
running /home/crose/.hooks/spork/build.janet for bundle spork
compiling src/crc.c...
linking build/crc.so...
compiling src/json.c...
linking build/json.so...
compiling src/cmath.c...
linking build/cmath.so...
compiling src/utf8.c...
linking build/utf8.so...
compiling src/rawterm.c...
linking build/rawterm.so...
compiling src/tarray.c...
linking build/tarray.so...
compiling src/zip.c...
compiling deps/miniz/miniz.c...
linking build/zip.so...
running /home/crose/.hooks/spork/install.janet for bundle spork
adding /home/crose/spork
adding /home/crose/tarray.h
adding /home/crose/spork/schema.janet
adding /home/crose/spork/base64.janet
adding /home/crose/spork/test.janet
adding /home/crose/spork/argparse.janet
adding /home/crose/spork/pgp.janet
adding /home/crose/spork/path.janet
adding /home/crose/spork/cron.janet
adding /home/crose/spork/misc.janet
adding /home/crose/spork/msg.janet
adding /home/crose/spork/mdz.janet
adding /home/crose/spork/data.janet
adding /home/crose/spork/randgen.janet
adding /home/crose/spork/tasker.janet
adding /home/crose/spork/getline.janet
adding /home/crose/spork/cjanet.janet
adding /home/crose/spork/rpc.janet
adding /home/crose/spork/htmlgen.janet
adding /home/crose/spork/build-rules.janet
adding /home/crose/spork/regex.janet
adding /home/crose/spork/stream.janet
adding /home/crose/spork/math.janet
adding /home/crose/spork/init.janet
adding /home/crose/spork/ev-utils.janet
adding /home/crose/spork/temple.janet
adding /home/crose/spork/cc.janet
adding /home/crose/spork/netrepl.janet
adding /home/crose/spork/http.janet
adding /home/crose/spork/fmt.janet
adding /home/crose/spork/channel.janet
adding /home/crose/spork/httpf.janet
adding /home/crose/spork/sh.janet
adding /home/crose/spork/generators.janet
adding /home/crose/spork/services.janet
adding /home/crose/spork/cmath.so
adding /home/crose/spork/rawterm.so
adding /home/crose/spork/crc.so
adding /home/crose/spork/utf8.so
adding /home/crose/spork/zip.so
adding /home/crose/spork/tarray.so
adding /home/crose/spork/json.so
nil
repl:2:>

Curious on feedback here - I know this does not include all of the features that jpm has currently, by design.

This could also be put into spork rather than Janet as there really isn't anything here that needs integration with janet itself, but keeping this interface for installing packages (bundles) inside libjanet means we can more easily install packages without requiring a big, external library like spork or jpm.

@tionis
Copy link
Contributor Author

tionis commented May 12, 2024

I just tried it, and it seems to be working well for me.
I do wonder however how updates should be handled as it currently refuses to override an existing module.

@sogaiu
Copy link
Contributor

sogaiu commented May 13, 2024

bakpakin wrote:

keeping this interface for installing packages (bundles) inside libjanet means we can more easily install packages without requiring a big, external library like spork or jpm.

This sounds good to me.


tionis wrote:

I do wonder however how updates should be handled as it currently refuses to override an existing module.

You mean this sort of thing?

repl:3:> (bundle/install "./spork")
error: bundle is already installed
  in bundle/install [boot.janet] on line 4064, column 5
  in _thunk [repl] (tailcall) on line 3, column 1

On a side note, bundle/uninstall and bundle/list seemed to work for me too.

@bakpakin
Copy link
Member

Did some more work on the branch, and added bundle/reinstall and bundle/pack for updating bundles from local disk, as well as changed the way files are stored. Also taking some care to make sure that installation will properly rollback if there is an issue during the installation. None of the format are interface are final.

All that said, I can see functionality here growing and growing and growing, and at some point a line needs to be drawn. Not only should the functionality be simple but genuinely useful, the implementation needs to be fairly simple.

@sogaiu
Copy link
Contributor

sogaiu commented May 14, 2024

Did some more work on the branch, and added bundle/reinstall and bundle/pack for updating bundles from local disk, as well as changed the way files are stored.

Tried out briefly -- seemed to work here.

I can see functionality here growing and growing and growing, and at some point a line needs to be drawn. Not only should the functionality be simple but genuinely useful, the implementation needs to be fairly simple.

I agree about these endpoint characteristics -- wonder how to get there though (^^;

@sogaiu
Copy link
Contributor

sogaiu commented May 14, 2024

Tried it briefly on Windows.

Encountered at least one issue with bundle/install -- it looked path-related (sorry, I don't have the error output ATM).

@bakpakin
Copy link
Member

I haven't tested any of this on windows but I wouldn't be surprised if there are path issues.

@pyrmont
Copy link
Contributor

pyrmont commented May 14, 2024

@bakpakin Is the idea that module authors will include hooks files in the module to run on build and install? Or is this just an aspect of the proof of concept that will have improved UX later on? I will say that I like the way modules all currently have a project.janet file and you have one place to look to understand a module.

@bakpakin
Copy link
Member

This is all proof of concept, having each "hook" be a function inside a single "project" file in some ways makes more sense. I will likely change over to this way, and was playing with this locally. However, one of the issues we ran into with JPM was how to evaluate such files before dependencies were installed - separate files avoids that problem.

I'm hesitant to commit even to this whole idea since JPM does already work, but I do want to explore it.

@bakpakin
Copy link
Member

@bakpakin Is the idea that module authors will include hooks files in the module to run on build and install? Or is this just an aspect of the proof of concept that will have improved UX later on? I will say that I like the way modules all currently have a project.janet file and you have one place to look to understand a module.

The idea was that authors would include their own hooks, but UX would certainly be improved. However, not sure the same level of "declarativeness" as is in JPM is likely.

@pyrmont
Copy link
Contributor

pyrmont commented May 14, 2024

@bakpakin I realise it's annoying when people compare Janet to Clojure and so at the risk of doing that in a way that turns you off completely, you could do something similar to what Clojure did when they introduced tools.deps: support a 'data only' file (deps.edn) that contains the information necessary for the built-in dependency installation tool.

Unlike Clojure, Janet's data-only file (project.jdn?) could also contain module metadata (similar to declare-project) as well as paths for build and install hook files.

Something like:

{
 :name "Foo"
 :description "The best module"
 :author "Janet Doe"
 :license "MIT"
 :url "https://git.example.org/foo"
 :repo "git+https://git.example.org/foo"
 :deps [“https://git.example.org/bar”]
 :src [“foo/a.janet”
       “foo/b.janet”]
 :build-file "./build.janet" # potentially this is only required if you want to use a different file from the default
 :install-file "./install.janet" # same as above
}

JPM could then be updated to look for this file if there is no project.janet file (or perhaps look for it first and then fall back to a project.janet). A 'data only' file like this could easily be evaluated without needing any other dependencies to be loaded first.

@CFiggers
Copy link

I like the idea of a project.jdn file. On top of project info, dependencies, and build recipes, it sounds pretty useful for tooling to tap into as well—if arbitrary metadata can be added to the project.jdn file as long as it doesn't collide with anybody else's stuff, then I can very easily use that as a canonical location to store small stateful per-project config options for CLI tools, Janet LSP, or editor plugins like Janet++.

If we went that route, we might want to settle on some clear conventions (maybe enforced by the Janet binary to one extent or other?) about how to include "third party" project metadata consistently.

@bakpakin
Copy link
Member

I realise it's annoying when people compare Janet to Clojure and so at the risk of doing that in a way that turns you off completely, you could do something similar to what Clojure did when they introduced tools.deps: support a 'data only' file (deps.edn) that contains the information necessary for the built-in dependency installation tool.

Not at all, that is a good comparison and a valid point. The data only file would allow more universal processing of bundles, and is relatively low effort to implement. I don't expect we would do much processing of this data in the bundle/ module besides move it around uncorrupted - not even sure we want to have the capacity to fetch remote dependencies over a network.

Currently there really isn't even the concept of dependencies - the user would be responsible for providing them. There are a lot of ways to package and distribute software versions and dependencies that are not quite orthogonal, (npm/cargo/go style language package managers, system package managers, version control (git submodules, monorepos), containers, VMs, tarballs,
disk images, linux file system conventions, etc. The longer I do this, the more I prefer version control for dependency management (sometimes called "vendoring") so I can get perfect control over code I am building/running, but people have different wants here. Frankly, git is way better at tracking versions than npm, cargo, etc. I realize this is not exactly the most mainstream opinion.

One thing though that might be feasible and useful though is a way to check all installed bundles and make sure that they are working with all other installed bundles, basically a "check-installation" hook where authors of packages could enforce any constraints they wanted on the environment, and raise a (nice) error if there is a problem. For example, if a software author chooses to use a dependency that releases using semver, incremental version numbers, date ranges, etc., then they could do something like:

# check if sporky installed with compatible version
(check-semver-for-installed-bundle "sporky" ">1.2.3,<1.2.8")
# check if sparky is installed with the correct git hash
(require-exact-git-hash "sparky" "abc1234")
# check if spot is installed and built within the specified dates
(require-build-date-range "spot" "2023-05-06" "2024-01-28")
# or use feature checking
(require-import-and-binding "spork/zig" zig-compile-and-link-v2)
# or only warn on mismatched bundle
(warn-exact-git-hash "spark" "abc1234" "Only git hash abc1234 has been tested and verified as dependency for this version")

# And finally run a simple smoke test
(import spot)
(spot/smoke-test)

inside a hook. However, trying to automatically "resolve" versioning conflicts and search for the best version of packages for compatibility is something that should be up to users. Or at least, outside the scope of bundle/.

@pyrmont
Copy link
Contributor

pyrmont commented May 14, 2024

I don't expect we would do much processing of this data in the bundle/ module besides move it around uncorrupted - not even sure we want to have the capacity to fetch remote dependencies over a network.

Even if you're using vendoring as a way to distribute code together with its dependencies, you still need some way to get the dependencies into the project tree in the first place and if you want to make that something you can do from the REPL, downloading remote dependencies is still an important piece of that, isn't it?

@tionis
Copy link
Contributor Author

tionis commented May 14, 2024

Well in the case of vendoring I would think that the dependencies would be managed using git-subtree, git-subrepo, git submodules or maybe just by extracting an archive and commiting it.

@amano-kenji
Copy link
Contributor

amano-kenji commented May 15, 2024

Vendoring increases repository size. Even though rust statically links a binary, it doesn't force repositories to carry dependencies. Haskell allows a program to depend on functional packages installed onto the system or ~/.cabal.

I prefer functional packages that you can see in haskell, guix, and nix.

I think haskell dependency model is a role model for janet.

@amano-kenji
Copy link
Contributor

Git submodules increase size of archives.

@bakpakin
Copy link
Member

I don't expect we would do much processing of this data in the bundle/ module besides move it around uncorrupted - not even sure we want to have the capacity to fetch remote dependencies over a network.

Even if you're using vendoring as a way to distribute code together with its dependencies, you still need some way to get the dependencies into the project tree in the first place and if you want to make that something you can do from the REPL, downloading remote dependencies is still an important piece of that, isn't it?

Yes, but doing so is actually fairly simple. E.g. with git:

(defn bundle/install-git
   "Install from git"
   [repo-url name]
   (def tempdir (string (dyn *syspath*) "/" (os/cryptorand 16) ".git"))
   (os/mkdir tempdir)
   (defer (os/execute ["rm" "-rf" "--" tempdir] :px)
     (os/execute ["git" "clone" "--depth=1" repo-url tempdir] :px)
     (bundle/install tempdir name))
   name)

You could do similar with mercurial, curl + tar, svn, cvs, perforce, etc. However, the only lightweight way to embed into janet is to shell out, so I'm hesitant to include it in Janet itself.

@bakpakin
Copy link
Member

Vendoring increases repoisitory size. Even though rust statically links a binary, it doesn't force repositories to carry dependencies. Haskell allows a program to depend on functional packages installed onto the system or ~/.cabal.

I prefer functional packages that you can see in haskell, guix, and nix.

I think haskell dependency model is a role model for janet.

So Haskell + Cabal seems to use something called PVP, which seems like a slightly different flavor of semantic versioning.

As for vendoring, it will not be the only way to do things, but I would encourage it and it needs to be something that could be officially well supported.

bundle/install doesn't really do much besides optionally build things like shared objects then copy a minimal set of files to (dyn syspath). Notably, we don't just blindly copy the entire repository into the syspath. You should also be able to delete the original source code, uninstall bundles, prevent collisions of installed files and bad states, and be resistant to messing up the syspath.

The idea being that dependency resolution and fetching sources/binaries over a network sits at a layer above this.

@amano-kenji
Copy link
Contributor

I'm not particularly fond of PvP. I prefer semantic versioning for its simplicity.

@llmII
Copy link
Contributor

llmII commented May 15, 2024

@bakpakin wrote:

The idea being that dependency resolution and fetching sources/binaries over a network sits at a layer above this.

I agree regarding those should be at a separate layer.

I tend to think of package installation as a flow of steps:

  1. Fetch
  2. Extract (optional, sort of a tarball only thing)
  3. Build
  4. Install to path

Fetch should likely be something along the lines of a dispatcher that sees an URI and finds a URI handler that exists for it and calls it to execute a fetch. Build should probably have some nice "shortcuts" available (maybe in spork or something) but is entirely package defined. Extract probably needs a "what's that file end with" dispatcher.

I think having bundle/install or whatever the end name is handle steps 3 & 4 (Build, Install) makes great sense, and that perhaps in the future there would be a bundle/fetch with dispatching support as well as a bundle/extract that is ran when what is fetched happens to be an archive. There is also the issue of if the package decides to find the install directory and copy stuff over to that location manually. Protecting against malicious use of the bundle/install call is difficult, no idea how that'd be done aside from chroot or making sure the install directory is owned by a certain user that isn't what the Build step is ran under.

Dependency and conflict resolution is a difficult thing and it'd be nice if there was something like a bundle/process that read a file and did all the steps listed above for each dependency with each thing having the correct versions... recursively. Unwinding that in the event of failure is difficult unless the "Build" step happens in a "staged" directory, in which case it just deletes the directory.

Having Janet able to manage installing packages out of the box sounds like a pretty good idea, but once someone needs more than that, perhaps a library (spork, install-utils, whatever) to provide for managing such at a higher level, handling all the fetching, dependencies, conflicts, and so forth would be the best of both, especially if that library is self contained with no other dependencies other than Janet so it'd be as simple as fetching it manually and doing (bundle/install package).

The one issue I see is, if Janet has a bundle/uninstall and it doesn't do dependency and conflict resolution (as well as bundle/upgrade or similar) then it'll be quite easy to screw up dependencies or versions. The only thing I can think of regarding this is that all the bundle/* are "low-level" calls, and once someone starts using something like the bundle/process thing above I mentioned, then it's up to the user to remember to continue using such for all further tasks? That sounds pretty bad, maybe someone will have a better idea. In general, the idea of what goes on in a lower level and a higher level might need a lot more fleshing out.

On another note... something like bundle/installed? could be useful, a user could have a script that isn't really a package itself and do stuff like

(when (not (bundle/installed? "package")) 
  (bundle/install package))

Having all this easily available from within Janet, REPL or Script or Project/Package, would likely feel more fluid and more easily approachable.

@tionis
Copy link
Contributor Author

tionis commented May 15, 2024

I think the main idea would be to have the base module handling (steps 3 & 4) in janet itself and keep the other parts in spork.
So janet releases could be supplied with spork preinstalled, so that it can update itself over the network and also handle the other protocols you suggested.
This would keep janet leaner and keep frequently changing code in spork.

@sogaiu
Copy link
Contributor

sogaiu commented May 16, 2024

I'm not too keen on all of spork being preinstalled with janet.

It now has a fair number of pieces by various authors at various levels of "completeness" / "stability".

I wonder if there is a way to get "just the necessary bits"...or perhaps there could be some other arrangement.

Edit 2: Perhaps a jpm-like tool could depend on code in spork but not cause spork to be installed as a library (so like what is possible now with declare-executable in jpm -- the tool could be a standalone binary).

So the tool could still end up as part of an installation of janet but it could remain unaffected for its basic functionality by what happens to live in janet's syspath. I hope I'm making sense here (^^;


Perhaps I didn't quite understand what was meant by "janet releases" though...


Edit 1: It occurs to me that there may be projects that depend on specific versions of spork. It seems possible that some of these might break if a newer version of spork gets installed when an upgrade to janet occurs. Not sure if this is a valid concern, but FWIW.

@tionis
Copy link
Contributor Author

tionis commented May 16, 2024

Hm, these are some valid concerns, I'm not sure what the best solution would be regarding those.
Perhaps @bakpakin has some opinion on these problems?

@llmII
Copy link
Contributor

llmII commented May 16, 2024

I'm unsure if spork should be an automatically included "batteries" library that essentially achieves a status as a de-facto part of the Janet standard library.

It's starting to sound like package management shouldn't be part of spork, if one's goal is to in essence say "use latest version of package management and/or Janet". Which almost sounds like keeping a jpm of sorts somewhere though it might be different than the jpm of today, being that it relies on things built in to the Janet standard library.

@sogaiu wrote:

Edit 2: Perhaps a jpm-like tool could depend on code in spork but not cause spork to be installed as a library (so like what is possible now with declare-executable in jpm -- the tool could be a standalone binary).

So the tool could still end up as part of an installation of janet but it could remain unaffected for its basic functionality by what happens to live in janet's syspath. I hope I'm making sense here (^^;

This would be nice but then package management wouldn't be very REPL-ish aside from low level concerns of uninstall/remove/reinstall.

If it makes sense for package management to be done via REPL and for the higher level package management tool depend on spork then spork might need a split between "stable" and "evolving", that or the REPL higher level package management library should intern spork basically (privatize all the symbols or namespace them behind a prefix like pm/spork/*).

Having wrote all this it almost seems like we need many small libraries instead of a few conglomerates but I don't really like that idea either (that's Javascript-esque... there's a library for that for everything). So now we have a better and more difficult idea that might satisfy all the things in the next sections.

Versioned imports. If we have a directory structure something like:

$syspath/spork/@latest
$syspath/spork/@$version

Then we could let spork be installed by default, the higher level package management tool could pin to a version, a user could request the latest, and unversioned imports from within Janet default to the latest. This means one could do something like (import spork :version 'version) or whatnot.

With that idea, spork could be included as well as a higher level package management library for the REPL, without clashing with what users need. Whether or not it's a good idea is another question, and even if it's a good idea, it's questionable if it's worth the extra complexity outside of the fact that this saga we're currently participating in will not be the only case where such a need is likely to come up wherein one needs 2 versions of the same library to service tools and service user code.

@bakpakin
Copy link
Member

I don't really plan on spork being released with Janet. If that was the goal, it would just be merged into Janet. However, spork contains a lot of the core functionality of JPM, just better.

On windows, it might make sense to have a spork setup option in the installer but ultimately it should be pretty easy to setup manually. Currently we have a Makefile target to try and setup JPM, so something like that might make it in.

However, the key idea is that you could have different versions of spork per project because it's just another pacakage in the tree

@amano-kenji
Copy link
Contributor

Is this going to create breaking changes?

@bakpakin
Copy link
Member

Is this going to create breaking changes?

No, this should be purely additive. JPM will still work for the foreseeable future, and won't require any changes to existing projects. Existing packages that use JPM will still work as well, and I see no reason why a project could not support both besides a bit of extra work. There are various levels of integration that we could go for here (should jpm installed packages be visible to bundle/install and bundle/uninstall), but the addition of plain functions won't mess things up here.

As per usual though, to avoid breaking changes, stick with release versions of Janet rather than latest master.

@bakpakin
Copy link
Member

bakpakin commented May 17, 2024

Versioned imports. If we have a directory structure something like:
$syspath/spork/@latest
$syspath/spork/@$version

Then we could let spork be installed by default, the higher level package management tool could pin to a version, a user could request the latest, and unversioned imports from within Janet default to the latest. This means one could do something like (import spork :version 'version) or whatnot.

So I'm open to ideas, but at first glance versioned imports seems like something I'm very much not going to implement. Rather, it would be a versioned install and the import would be something like

(bundle/install "/downloads/spork"  "spork-1.2.3") # install script dynamically chooses what root directory to install to based on bundle name

(import spork-1.2.3 :as spork)
(require (string "spork-" (calculate-needed-version)))

This looks awful to me in most cases, but its implementable via the usual mechanisms.

@amano-kenji
Copy link
Contributor

amano-kenji commented May 18, 2024

Perhaps, can project.janet specify something like this?

(declare-project
  :dependencies ["spork >= 1.3 && < 2"])

This is the haskell way of specifying dependencies. In ~/.cabal, various versions of spork are available.

@sogaiu
Copy link
Contributor

sogaiu commented May 20, 2024

it would be a versioned install

IIUC, at least this could help mitigate the problem of installation of one project clobbering the dependencies of another.

As an example, with how things work with jpm currently, suppose:

  • project A depends on version X of spork
  • project B depends on version Y of spork

Assume further that X < Y [1] and Y has changes that are not compatible with some functionality in X.

If someone has project A installed and then installs project B, project A may break because version X of spork has been replaced with version Y of spork.

With the proposed "versioned install" idea, if various projects make use of the mechanism, it seems like the aforementioned problem could be avoided (or at least lessened) -- at least by projects that make use of the mechanism.

[1] Strictly speaking, I don't think one needs any kind of version "number" ordering to express the problem mentioned above. IIUC, all that is necessary is that there are two different versions with some incompatibilities (e.g. imagine the case of two different commits from a particular repository where there are no version "numbers" associated with either commit).


As a side note, vendoring can also protect against the issue of disappearing dependencies (e.g. this recent PR). It looks like other communities have "immutable" stores that can address this.

May be that's not so important at this stage.

@llmII
Copy link
Contributor

llmII commented May 20, 2024

@bakpakin wrote:

So I'm open to ideas, but at first glance versioned imports seems like something I'm very much not going to implement. Rather, it would be a versioned install and the import would be something like

(bundle/install "/downloads/spork"  "spork-1.2.3") # install script dynamically chooses what root directory to install to based on bundle name

(import spork-1.2.3 :as spork)
(require (string "spork-" (calculate-needed-version)))

This looks awful to me in most cases, but its implementable via the usual mechanisms.

It does look awful, but versioned install solves the issue I had in mind. Someone could potentially "monkey-patch" import and friends to support "versioned import" to make it look nicer (pretend there's a canonical way as defined by the higher level package management library to version installs and someone's likely to have a little library to make versioned imports look nicer eventually).

You are pretty spot on that built-in low-level import and package management need not necessarily support versioning directly though.

@sogaiu wrote:

With the proposed "versioned install" idea, if various projects make use of the mechanism, it seems like the aforementioned problem could be avoided (or at least lessened) -- at least by projects that make use of the mechanism.

If various projects make use of it soon it'll be the canonical thing, and somewhere in there someone's going to make it look much nicer probably.

@sogaiu wrote:

As a side note, vendoring can also protect against the issue of disappearing dependencies (e.g. this recent PR). It looks like other communities have "immutable" stores that can address this.

May be that's not so important at this stage.

The immutable store thing sounds awesome, until going about the implementation thereof. Vendoring is a lot easier at than making an immutable store that supports $favorite-vcs, $archive, $something-phenomenally-ridiculous-but -somehow-necessary-to-support.

@sogaiu
Copy link
Contributor

sogaiu commented May 21, 2024

The immutable store thing sounds awesome,

I messed up a bit with terminology above -- probably I should have written something like (mostly) "append-only". A specific example of this might be maven central.

Git repositories on GitHub with the default branch being protected are somewhat close but IIUC these can just "go away" if a user removes the repository (or like in the recent xz incident, MS decides to just make something unavailable).

May be there was no confusion, but FWIW.

until going about the implementation thereof. Vendoring is a lot easier at than making an immutable store that supports $favorite-vcs, $archive, $something-phenomenally-ridiculous-but -somehow-necessary-to-support.

Yeah, probably it is not a simple matter, but I suppose it depends on the specifics a bit regarding how complicated things can get. Over time, depending on popularity, "append only" things seem like they could get costly though because the stored content just keeps growing...and growing (^^;


Edit: Sorry if this is getting a bit off-topic. May be it wouldn't be bad to make a list somewhere of related "concerns" (e.g. installed dependency clobbering, dependency disappearance, etc.) though. Possibly it would be helpful for present and future discussions?

@bakpakin
Copy link
Member

bakpakin commented May 22, 2024

Been doing some thinking about dependencies and bundle/install and bundle/uninstall, and I do think we should allow users to prevent breaking dependencies. Essentially, this is something that bundle/uninstall and bundle/install would check about what's currently installed before doing anything.

bundle/install would check that all required dependent files were present on the system before build and install, and bundle/uninstall would somehow check that no other bundles depend on any of its files. If either case fails, we also need to give a nice, instructive error message or result.

This does some what require that we distinguish between "incidentally" installed bundles and "directly" installed bundles.

(defn bundle/install
  ...
  (def deps (read-deps-data-jdn))
  (def missing (seq [d :in deps :unless (bundle/installed? d)] d))
  (when (next missing) (errorf "missing required dependencies: %s" (string/join missing ", ")))
  ...)
  
(defn bundle/uninstall
  ...
  (def deps (read-installed-deps-data-jdn))
  (def dependents (seq [d :in deps :if (bundle/installed-directly? d)] d))
  (when (next dependents) (errorf "cannot uninstall due to dependent bundles: %s" (string/join dependents ", ")))
  # uninstall my files, then uninstall dependent files
  (each d deps
    (if (= 1 (bundle-dep-ref-count d))
      (bundle/uninstall d)))

This works without needing to know anything about how to fetch dependencies or versions - the constraint is a function of what's in the syspath so we could definitely put it into the bundle/ module.

EDIT: The concept of directly installed bundles vs. incidentally installed bundles is basically "autoremove" in apt-get, or "orphaned packages" in pacman.

@sogaiu
Copy link
Contributor

sogaiu commented May 24, 2024

I've been trying to figure out how this:

bundle/install would check that all required dependent files were present on the system before build and install

would be done.

Is this a matter of checking existing manifest files?

@bakpakin
Copy link
Member

I've been trying to figure out how this:

bundle/install would check that all required dependent files were present on the system before build and install

would be done.

Is this a matter of checking existing manifest files?

I was thinking so, just check that existing bundles are installed first. It could also include a manual "deps" hook where you could also search for system dependencies or even do feature testing a la autoconf.

@sogaiu
Copy link
Contributor

sogaiu commented May 25, 2024

Thanks for the clarification.

It could also include a manual "deps" hook where you could also search for system dependencies or even do feature testing a la autoconf.

That sounds quite interesting.

@stutonk
Copy link

stutonk commented Jun 12, 2024

As a user of Janet and someone who thinks a lot about package managers, I've got some opinions/feedback about this. Since some of what I have to say is critical, I would like to preface this with a statement of my gratitude for all the thought and work that's gone into Janet including that of its community members helping answer questions on the forums as well as publishing excellent-quality information about various specific Janet topics. Having used almost every parens language, this one is absolutely the best and easiest to use/understand except for one enormous papercut: the relation between and usage of janet, jpm and spork. As it stands, it's way more difficult than it should be to get all three of these things up and running. The standard operation of jpm also has some serious issues.

My apologies for a bit of an aside here about spork since this thread is supposed to be about jpm. These three pieces of software are treated as separate but, due to their entanglement and some of the functionality in spork, I would argue are all actually de facto a part of the "standard Janet distribution", if you will.

I'm not too keen on all of spork being preinstalled with janet.

I'm unsure if spork should be an automatically included "batteries" library that essentially achieves a status as a de-facto part of the Janet standard library.

I don't really plan on spork being released with Janet. If that was the goal, it would just be merged into Janet. However, spork contains a lot of the core functionality of JPM, just better.

I primarily use Arch and Alpine. Currently, Alpine packages the most recent release of Janet but without jpm. In order to get spork in an Alpine container you have to install git and an entire C toolchain, build jpm and install it to your root tree not-under-package-management, and then use it to install spork. Arch does not package janet in its official repos. In the Arch AUR, there are several incompatible packages that either do or do not include jpm and thus some attempts to individually package spork will fail to build because of cross-confusion between their respective maintainers about the relationship between janet, jpm, and spork.

Just look how many times spork is mentioned in this thread. Spork documentation is included in the standard library documentation. Some of the functionality (such as netrepl) is critical editor integration tooling. And spork depends on jpm for delivery. Since some of the talk here is about functionality varying from commit to commit in spork. If nothing else changes, spork needs releases. That way it can be packaged by external packagers and installed without jpm or a C compiler as well as having better-defined information about what's in what version. But my opinion is that it should officially be a part of the stdlib (or have parts of it moved into the stdlib) and distributed with Janet or else have its documentation moved elsewhere. Having confusion that creates friction getting this stuff up and running is likely turning people away from the language. It's certainly frustrating to me.

In addition to what's mentioned above about hidden state via env variables creating problems, my primary issue with jpm is that the default workflow is to use superuser privileges to install packages to my root file tree. For me this absolutely non-negotiably does not fly (hence running it in a container) nor should it for anyone. Even in the absence of malicious intent, the potential for a critical mistake somewhere is enormous. Installing and uninstalling jpm packages will also not necessarily clean up all the files, allowing pollution to accumulate in the root tree. The other two installation options are not much better. They are to either specify some environment variables (which the original go package manager proved to be extremely brittle) or to use jpm -l which combines node_modules-style pollution with python-style environments. None of these three options are necessary. On Linux (and potentially other Unix-like systems), there's a (somewhat) well-defined list of standard locations to install things that don't require elevated permissions such as XDG_CACHE_HOME, $HOME/.cache, etc. Even ~/.janet, while not ideal, would be better. On Windows I believe there are also user-specific data/cache directories. I believe that some of the ideas presented here already such as the bundle family of functions have the potential to address some of these concerns but I have some additional ideas on how that might be made better.

At the top level, the interface to the whole thing could be along the lines of

(import-dep '{:git-url "https://github.com/janet-lang/spork :commit "ec40c7a"} :as spork)

Under the hood it would, upon first run/build use perhaps bundle to fetch, build in a tmpdir, and install the dependency to wherever in the user's tree. The specification would just be some struct whose content would vary depending on the specifics. For example, you could specify a :tag for git instead of a commit. The source doesn't have to be git. And the user would potentially be able to supply their own. With a central package repo it could look like (import-dep lib@version :as lib). Subsequent runs would just resolve to something like (import @the_user_install_dir/spork-ec40c7a :as spork). Specifying deps this ways has numerous advantages:

  • exact dependency specification without metadata or lockfiles
  • since deps are exact, builds are reproducible by default
  • no need for a built-in SAT solver to figure out things like lib version >= 0.1.3 <= 0.7.5
  • allow multiple versions of the same library even within the same project if necessary
  • multiple versions of the same library don't clobber each other
  • anything can be put under the hood to get the deps from any type of source
  • installing in a standardized, central location in the user tree removes the need for env variables or superuser privileges, doesn't create a per-project node_modules style trashpile, and Janet always knows where to find dependencies
  • however building and installing is handled can change without affecting the interface
  • it's still possible to make a file like deps.janet that imports everything for your project in one file and re-exports them in some shorter format for use in other files

As for building and installing, having a just-data package.jdn or whatever with some metadata in a struct (as mentioned above by pyrmont) to be possibly consumed by a central package repo along with a build.janet which ultimately just a script that can perhaps leverage some standard tooling seems sensible. The build.janet would be able to use whatever specific tools are necessary to build the project such as the full on C compiler apparatus for native modules or just a list of which files to install in the simple case of a pure Janet module. The build.janet itself could even have it's own import-deps.

I realize that some or all of these ideas may not be that great. Having deps build/install lazily and recursively on first run/build might not be ideal. Maybe it's too different of a workflow for existing projects to want to adopt. The functionality of jpm is currently covering two separate concerns (building janet projects and getting other people's code) which may be better left separate. However, Janet is still young enough to do something better without an enormous amount of friction. Many other languages have failed miserably at this admittedly extremely difficult problem and are locked in with their pile of crap. I implore you to please not reproduce the faults of luarocks, pip, gem, npm, or go's first package managers. For lack of better ideas, just copy cargo. Cargo only needs one metadata file per crate and doesn't clobber differently-versioned deps. It needs no env variables, virtual environments, per package dependency folders, or superuser privileges and has such sane defaults that it rarely requires configuration. It uses lock files and requires packages to be versioned so that builds are reproducible.

If not any of that, a robust and modular build system that's part of the stdlib like zig that lets distro packagers or CI easily package/build releases and so that vendoring is viable (which seems where this thread is tending) would be most welcome.

Once again, thank you all for all the hard work, excellent programming language, and effort around its community.

@bakpakin
Copy link
Member

@stutonk, agree with most of what is said here. It's not that well documented yet, but the bundle-tools branch has most of what you asked for already. Unlike JPM, packages are basically a set of plain Janet scripts + some extra static metadata for dependencies and whatever else you want, with not requirement on a C toolchain or multiple paths. There is also no requirement for any external tools or even networking - it operates simply on local paths.

Show and tell: The bundle-tools branch adds the following functions to Janet core:

For managing your local bundles (packages):

  • (bundle/install path &keys config) - Install a bundle from the local filesystem with a name bundle-name.
  • (bundle/uninstall bundle-name) - remove an installed bundle if no-one depends on it
  • (bundle/prune) - uninstall all bundles that are not directly installed (dependencies that are no longer needed).
  • (bundle/reinstall bundle-name &keys config) - rerun the installer of a local bundle. If it fails, revert.
  • (bundle/update-all) - reinstall all bundles

For inspecting current bundles:

  • (bundle/list) - list all installed bundles
  • (bundle/installed? bundle-name) - check if a bundle is installed.
  • (bundle/topolist) list all install bundles in an order such that each bundle is listed after its dependencies.
  • (bundle/manifest bundle-name) - get the manifest data for a given installed bundled.

For building a manifest:

  • bundle/add - adds files and directories recursively to the manifest during bundle install
  • bundle/add-directory - adds a single directory to the manifest during install
  • bundle/add-file - adds a file to a bundle during install

There is also example usage of the bundle/* functions in test/suite-bundle.janet as well as in https://github.com/janet-lang/spork/tree/bundle-hooks

Lately I've been working more on the compile-opt branch, but I think I will likely merge the bundle-tools branch to master soon, as I don't know if there is much more to take away that wouldn't really hinder usability.

@bakpakin
Copy link
Member

My apologies for a bit of an aside here about spork since this thread is supposed to be about jpm. These three pieces of software are treated as separate but, due to their entanglement and some of the functionality in spork, I would argue are all actually de facto a part of the "standard Janet distribution", if you will.

That would be nice, but I think that is the step after integrating JPM, as spork has C dependencies and I want it to be able to be built and developed independently of Janet. The bundle-hooks branch of spork can build itself without JPM. Currently, we don't distribute anything with Janet that isn't baked into the binary.

Originally jpm and spork were all separate so I didn't need to make so many frequent releases, but that is less of an issue now.

@pyrmont
Copy link
Contributor

pyrmont commented Jun 13, 2024

I thought there might be some benefit in summarising where things are. It looks like the following:

  1. In the near future, releases of Janet will include a collection of functions with the bundle/ 'namespace' that allows for the installation and uninstallation of bundles of code (including transitive dependencies) that are located on the local filesystem.

  2. In theory, users will be able to write code in their own projects to relatively easily bring in dependencies without relying on JPM. If these dependencies in turn support installation using the bundle/ functions, this would mean JPM is not required even for code that requires compilation. (To be clear, this does require the dependency authors to update their code to support this kind of installation.)

Additionally:

  1. JPM is not presently being changed to use the bundle/ functions. This could happen later but is not part of what @bakpakin is working on right now.

  2. The default module loading functions in Janet are not presently being changed. This means that if a user wants to do something like use different versions of the same module, they could do this but would have to do something like use different directory names for different versions (which they can already do).

This issue, as originally opened by @tionis, read to me as being about whether JPM should be integrated into Janet. The resolution of the above is that JPM won't be integrated into Janet. Rather, a collection of functions will be built-in to core. These functions could be used by users to install dependencies on their own (thereby avoiding any use of JPM) or to create their own package manager (thereby obviating JPM entirely).

@noncom
Copy link

noncom commented Oct 7, 2024

Bump.

What is the current status of this? I'm looking into organizing my workflow, and maybe automating some of the builds and project mainenance, and therefore it would be good to know if I should base it on the jpm, or if there's going to be some redesigned tooling soon.

@sogaiu
Copy link
Contributor

sogaiu commented Oct 9, 2024

There has definitely been work starting in June and through the most recent commit regarding the bundle bits.

This repository's Makefile now has:

janet/Makefile

Lines 363 to 367 in 3d93028

install-spork-git: $(JANET_TARGET)
mkdir -p build
rm -rf build/spork
git clone --depth=1 --branch='$(SPORK_TAG)' https://github.com/janet-lang/spork.git build/spork
$(JANET_TARGET) -e '(bundle/install "build/spork")'

A bit that shows building and installing spork using bundle-related callables.

Possibly of use are some of the tests to get a sense of what pieces are being exercised [1].

In short, I think it depends on specifically what one wants to do whether it'd be good to start using bundle and co.


[1] This directory has spork's use of bundle for interested parties.

@noncom
Copy link

noncom commented Oct 9, 2024

Thanks for the references! That's actually really cool, I browsed the code, and it really looks like something I'd use.
Using Janet to write the build script as if it was a program, in a more imperative style, is what I'm probably looking for.
I'll think it's not ready yet, but I might try it.

I think it would also be cool to have a declarative DSL like the one jpm provides, but maybe as an option that works together with the script. All in all, in a lisp, declarative vs imperative is a bit blurred distinction, and the declarative form works more like a common-case template/snippet for the scripted API rather than the full API of itself. As I understand, that's mostly what jpm does anyway, but it seems to be using some different cc.janet than what is there in bundle. Not exactly sure why though.

One thing I'd note though, is the hard coupling of Windows<->MSVS. It is very often that GCC is used on Windows, and while it works with .dlls and .libs, it has the full GCC command line, and not the MSVS one. Currently, the hard coupling is in https://github.com/janet-lang/spork/blob/master/bundle/build.janet. But as far as I've seen, cc.janet in Spork doesn't make that coupling, and the compiler can be chosen with *cc*, and then (compile-c) can be called rather than (msvc-compile-c), even on Windows.

@sogaiu
Copy link
Contributor

sogaiu commented Oct 10, 2024

As I understand, that's mostly what jpm does anyway, but it seems to be using some different cc.janet than what is there in bundle. Not exactly sure why though.

Regarding cc.janet, here is some info about it:

### Improved version of the C Compiler abstraction from JPM that should be more correct, composable, and
### have less configuration.

So may be it's ok to view spork's cc.janet as kind of a successor of what is in jpm.


Re: windows remarks -- I don't know if it makes any difference, but there is some :mingw support which behaves differently than :windows in janet IIUC. But may be you knew this already and if so, please feel free to ignore (^^;

@amano-kenji
Copy link
Contributor

Is cc a breaking change to jpm?

@bakpakin
Copy link
Member

@noncom That is a pretty accurate summary of the current state of things - cc.janet in spork is basically a wrapper around various C compilers to make it easier to build both Janet native module as well as arbitrary C and C++ based programs. I am using this for some of my own projects where I want more control over the build/compile processes without needing to write a lot of shell script or rely on Make/ninja.

As far as the DSL for making things easier, that would also be nice and is just something I haven't gotten around to. My eventual goal is probably to pretty closely copy the jpm build interface into spork.

As I understand, that's mostly what jpm does anyway, but it seems to be using some different cc.janet than what is there in bundle. Not exactly sure why though.

Because it was the first version and is frankly not as good. It makes some assumptions as to what compilers you are using and how your code is structured.

One thing I'd note though, is the hard coupling of Windows<->MSVS. It is very often that GCC is used on Windows, and while it works with .dlls and .libs, it has the full GCC command line, and not the MSVS one.

True, gcc on windows though is usually used with mingw. Clang is also commonly used. This is just something we should fix and have an interface for auto-detecting. That initial build.janet needs some better auto-detection logic for the current toolchain, as well as an override.

@bakpakin
Copy link
Member

Is cc a breaking change to jpm?

No, jpm is mostly unchanged and will remain unchanged for a while.

@rwtolbert
Copy link
Contributor

This very reasoned discussion is one of the many reasons I love using Janet. And for some reason, I to work on cross-platform problems so I've been playing with builds on Windows and a small C/C++ module.

I had planned to add jpm git install into the Windows build paths (the .bat file and some Windows-specific sections in meson.build), but reading all this makes me think I need to focus elsewhere.

Does it make sense to do more near-term work on jpm install etc. or focus more on the bundle side of things?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests