Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decide and implement a solution for internal-facing APIs #4467

Open
1 of 4 tasks
sffc opened this issue Dec 19, 2023 · 14 comments
Open
1 of 4 tasks

Decide and implement a solution for internal-facing APIs #4467

sffc opened this issue Dec 19, 2023 · 14 comments
Assignees
Labels
C-meta Component: Relating to ICU4X as a whole S-medium Size: Less than a week (larger bug fix or enhancement)

Comments

@sffc
Copy link
Member

sffc commented Dec 19, 2023

We've hit multiple snags recently where we wanted to change a mostly-internal function that was pub and were blocked because of semver:

  • cmp_iter functions in icu_locid
  • new_owned and Cart types in icu_provider
  • Probably more

Some thoughts/comments/problems:

  1. Even if these had been public-internal functions, we still have some desire to keep them stable-ish because then we can break hard tilde dependencies between components/utils (Revisit policy around component crate interdependent versions #4343)
  2. If the functions are #[doc(hidden)], they are less discoverable including to ICU4X developers. We can't expect ICU4X developers to look into the source code just to see if a function they want happens to be there.
  3. These functions should most likely be excluded from FFI by default
  4. If we are poking holes in a util crate, the holes we poke should probably be part of the public API of that crate, because it is a utility crate. However, within ICU4X component and provider crates, it is okay and expected for us to poke holes.
  5. Previous discussion: Document traits as either sealed or implementable #4338 (comment)

Here's a proposal:

Add a macro_rules that generates the following boilerplate:

#[doc = "<div class=\"stab unstable\">"]
#[doc = "🚧 This code is internal to ICU4X; it may change at any time, in breaking or non-breaking ways,"]
#[doc = "including in SemVer minor releases. It can be enabled with the \"icu4x_internal\" Cargo feature."]
#[doc = "</div>"]
#[cfg(feature = "icu4x_internal")]

The macro_rules can live in icu_locid for now. We can move it later.

To annotate an internal API, you can do this:

/// These are docs for my internal API
icu_locid::_internal::icu4x_internal_api!()
pub struct Foo {}

The icu4x_internal Cargo feature does not bubble up to the metacrate. It can be selectively enabled whenever it is needed. One advantage of making an icu4x_internal feature is that we can disable it when uploading docs to docs.rs (https://docs.rs/about/metadata), although I don't think there's a way to disable just that one feature; we would need to switch from --all-features to an explicit feature list per-crate, which is error-prone. Alternatively, I'm fine if we say we don't introduce a Cargo feature for this and simply let these APIs live in the docs.

In the 1.5 or 2.0 timeframe, I suggest that we scrub our API surface and add this annotation where appropriate. A lot of things that are currently #[doc(hidden)] should probably use the annotation instead, and likewise things that are currently public stable should also probably use the annotation.

If we agree on this, I may suggest we make a similar annotation for the provider module.

Needs approval or input from:

@sffc sffc added C-meta Component: Relating to ICU4X as a whole needs-approval One or more stakeholders need to approve proposal labels Dec 19, 2023
@sffc
Copy link
Member Author

sffc commented Dec 19, 2023

Another solution I'm open to considering is to have the macro generate #[doc(hidden)], in addition to the internal warning, and then when generating internal docs use --document-private-items which includes doc(hidden) APIs.

It would make me very very happy if Rust itself had an annotation for this. Anything like "this API might not be semver safe" which generates a warning when using it cross-crate.

@Manishearth
Copy link
Member

I think what we should do for these is that we have the macro generate cfg_attr(not(internal_docs), doc(hidden)) and generate our internal docs with RUSTFLAGS=--cfg internal_docs. We shouldn't use a cargo feature, we should use a cfg (since cargo features get enabled with --all-features)

Generating that doc header seems nice, if we use a macro we can tweak the precise solution over time.

@sffc sffc removed the needs-approval One or more stakeholders need to approve proposal label Dec 19, 2023
@sffc
Copy link
Member Author

sffc commented Dec 19, 2023

@Manishearth's proposal SGTM.

Either way I would like a macro so that it's easy to find and tweak over time.

@Manishearth
Copy link
Member

Concrete proposal:

  • We have a macro
  • It marks things as doc(hidden) except in cfg(icu4x_internal_docs) mode
  • ICU4X docs-gen has that CFG set
  • (optional) there's a nice "unstable feature" header

LGTM to me and Shane.

@sffc
Copy link
Member Author

sffc commented Dec 19, 2023

Specifically, the cfg should be set on the job that creates the docs uploaded to https://unicode-org.github.io/icu4x/rustdoc/, and the "unstable feature" box should be included so that people referencing those docs don't think the function is public stable.

@sffc sffc added this to the 1.5 Blocking ⟨P1⟩ milestone Feb 29, 2024
@sffc sffc added this to icu4x 2.0 Feb 29, 2024
@sffc sffc moved this to Unclaimed for sprint in icu4x 2.0 Feb 29, 2024
@sffc sffc added the S-medium Size: Less than a week (larger bug fix or enhancement) label Feb 29, 2024
@sffc
Copy link
Member Author

sffc commented Mar 13, 2024

Open question: where should the internal docs macro live? I propose it lives in a macros file that we copy around the repo into each crate and have a CI job to ensure it remains updated, similar to LICENSE.

Needs approval:

@sffc sffc added the needs-approval One or more stakeholders need to approve proposal label Mar 13, 2024
@robertbastian
Copy link
Member

Can we use symlinks? I'd rather not copy around, because then we'd have another CI job to enforce consistency, and another make job to "generate" these.

@sffc
Copy link
Member Author

sffc commented Mar 13, 2024

Symlinks are annoying for Windows users, but it appears there are solutions that require some manual setup.

I tend to feel that in order of priority of constituencies, the experience for new contributors should be favored over the experience of core ICU4X developers. Avoiding symlinks is harder on the core developers but easier for new contributors since they don't need any special configurations.

@robertbastian
Copy link
Member

We've had a symlink in the repo for months now and I haven't heard any complaints.

@robertbastian
Copy link
Member

I tend to feel that in order of priority of constituencies, the experience for new contributors should be favored over the experience of core ICU4X developers.

I also disagree with this. ICU4X core contributors spend a lot of time contributing to ICU4X. We just removed a bunch of generated files from the repo which again and again lead to failing CI and delays for core contributors.

@sffc
Copy link
Member Author

sffc commented Mar 13, 2024

The removal of the generated files you reference was done after establishing that doing so didn't substantially impact other constituencies.

@sffc
Copy link
Member Author

sffc commented Mar 13, 2024

The existing symlink impacts diplomat-gen, which impacts all contributors but not clients.

The proposed symlink would impact anyone building icu4x from the repo.

@sffc sffc moved this from Unclaimed for sprint to Needs discussion to unblock in icu4x 2.0 Mar 14, 2024
@sffc sffc removed the needs-approval One or more stakeholders need to approve proposal label Mar 14, 2024
@sffc
Copy link
Member Author

sffc commented Mar 14, 2024

  • @robertbastian - Symlinks work for Windows: Git will either copy the files around or set up symlinks if admin privileges are there. But if a Windows user edits one of the files, then things could get messed up for them.
  • @Manishearth - I'm concerned about us editing files upstream and them not being reflected for users
  • @robertbastian - There's a clear workaround for Windows users. We can reommend that.
  • @sffc - I'm concerned that relying on symlinks will cause problems for more users since this impacts all clients who build ICU4X from Git
  • @robertbastian - These seem like hypothetical issues. Can we start with the symlink option? The CI thing is annoying; I don't like having to run scripts to update these files.
  • @sffc - Another issue is that I don't see a way for macro_rules to generate attributes
  • @Manishearth - I don't think the file should be included in crates that aren't using it. Clients ask questions about files in projects that aren't being used.
  • @sffc - This makes it more challenging for contributors. I agree that client needs should be more important than contributor needs, but I'm not convinced about the client needs in this case. But I'm okay starting with this policy.
  • @robertbastian - OK but if the CI starts breaking on my PRs I won't be happy

Points:

  • The main thing we seem to care about is the "box" in the docs saying if an API is internal/private/etc. We can have a #[doc = internal!()] macro for this, with different macros for different kinds of docs
  • #[cfg(not(icu4x_internal), doc(hidden))] does not appear to have much utility over just using --document-hidden-items

Proposal:

  • We have a macro or multiple internal macros for doc boxes
  • We publish ICU4X internal dev docs with --document-hidden-items, and potentially --document-private-items if it doesn't slow down search or otherwise cause problems. We should ask rustdoc to make it clear that it is hidden.
  • All instances of #[doc(hidden)] should have a comment on the same line explaining why it is hidden and also use the docs macro

LGTM: @Manishearth @sffc @robertbastian

Implementation proposal:

  • Augment the copy-license job to also copy the macros file
  • The files get copied to:
    • crate/LICENSE
    • crate/icu4x_shared.rs
  • When they are needed, they get include!d in the files that use them
  • LICENSE continues to have a denylist; icu4x_shared.rs has an allowlist

LGTM: @sffc @Manishearth

@sffc sffc moved this from Needs discussion to unblock to Unclaimed for sprint in icu4x 2.0 Mar 14, 2024
@robertbastian robertbastian self-assigned this Mar 20, 2024
@sffc sffc modified the milestones: 1.5 Blocking ⟨P1⟩, ICU4X 2.0 May 23, 2024
@Manishearth
Copy link
Member

Cross posting from #5101 (comment):

My opinion on this shifts if we plan to put unsafe code in icu4x_shared: i'd rather we used a separate crate since it seems suboptimal for every ICU4X crate to suddenly contain unsafe. This adds a ton of friction to import processes involving unsafe review; so far the fact that ICU4X is a large number of crates is manageable, but if it's a large number of crates that includes a bunch of unsafe code it's worse.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-meta Component: Relating to ICU4X as a whole S-medium Size: Less than a week (larger bug fix or enhancement)
Projects
Status: Unclaimed for sprint
Development

No branches or pull requests

3 participants