Skip to content

Tag the safety properties of the Rust standard library.

Notifications You must be signed in to change notification settings

Artisan-Lab/tag-std

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 

Repository files navigation

tag-std

This project aims to standardize the safety property annotation of the Rust core and standard library. There are three steps:

  • Define the primitive safety properties to be used for describing the safety concerns of unsafe APIs.
  • Label the unsafe APIs in Rust's core and standard library with primitive safety properties.
  • Detect and solve discripencies via program analysis.

Through this project, we aim to establish a foundation for Rust contract design and verification, serving as a preliminary step toward this goal. For example, consider Challenge 3: Verifying Raw Pointer Arithmetic Operations from Verify Rust Std Lib. In this context, we can annotate all safety properties of unsafe APIs with primitive safety properties. For instance, the unsafe API *const T::add() is described as follows:

pub const unsafe fn add(self, count: usize) -> Self
where
    T: Sized,

///Safety
///If any of the following conditions are violated, the result is Undefined Behavior:
///-The offset in bytes, count * size_of::<T>(), computed on mathematical integers (without ``wrapping around''), must fit in an isize.
///-If the computed offset is non-zero, then self must be derived from a pointer to some allocated object, and the entire memory range between self and the result must be in bounds of that allocated object. In particular, this range must not “wrap around” the edge of the address space.
///Allocated objects can never be larger than isize::MAX bytes, so if the computed offset stays in bounds of the allocated object, it is guaranteed to satisfy the first requirement. This implies, for instance, that vec.as_ptr().add(vec.len()) (for vec: Vec<T>) is always safe.
///Consider using wrapping_add instead if these constraints are difficult to satisfy. The only advantage of this method is that it enables more aggressive compiler optimizations.

We can tag the API with the following primitive safety property:

  • Primitive SP template: ValidInt(binop, x, y, T), which means $\text{T::MAX} \geq \text{binop}(x, y) \geq \text{T::MIN} $;
    • Specific primitive SP for the API: ValidInt(mul, count, sizeof(T), isize).

For another instance, the unsafe API ptr::copy() is described as follows:

pub const unsafe fn copy<T>(src: *const T, dst: *mut T, count: usize)

///Safety
///Behavior is undefined if any of the following conditions are violated:
///-src must be valid for reads of count * size_of::<T>() bytes, and must remain valid even when dst is written for count * size_of::<T>() bytes. (This means if the memory ranges overlap, the two pointers must not be subject to aliasing restrictions relative to each other.)
///-dst must be valid for writes of count * size_of::<T>() bytes, and must remain valid even when src is read for count * size_of::<T>() bytes.
///-Both src and dst must be properly aligned.
///Like read, copy creates a bitwise copy of T, regardless of whether T is Copy. If T is not Copy, using both the values in the region beginning at *src and the region beginning at *dst can violate memory safety.
///Note that even if the effectively copied size (count * size_of::<T>()) is 0, the pointers must be properly aligned.

We can tag the API with the following primitive safety property:

  • Primitive SP template: Bounded(p, T, range), which means $\forall offset\in range, \text{typeof}(*(p + \text{sizeof}(T) * offset)) = T $;
    • Specific primitive SP for the API: Bounded(src, T, 0..count) and Bounded(dst, T, 0..count)
  • Primitive SP template: Init(p, T, range), which means $$\forall offset \in range, \text{init}(*(p + \text{sizeof}(T) * offset), T) = true $$;
    • Specific primitive SP for the API: Init(dst, T, count)
  • Primitive SP template: NonOverlap(dst, src, T), which means $|dst - src| &gt; \text{sizeof}(T)$;
    • Specific primitive SP for the API: NonOverlap(dst, src, T)
  • Primitive SP template: Aligned(p, T), which means $p \% \text{alignment}(T) = 0$;
    • Specific primitive SP for the API: Aligned(src, T) and Aligned(dst, T)

These are the preconditions for calling the unsafe APIs. We need more properties to discribe the hazards when the content is not Copy.

  • Primitive SP template: Alias(p1, p2), which means $*p1 = *p2$;
    • Specific primitive SP for the API: Alias(*dst, *src)

When proving the soundness of String::remove() (see the code below), it is essential to verify that the primitive safety properties of its interior unsafe APIs ptr.add() and ptr::copy() are met in all cases.

pub fn remove(&mut self, idx: usize) -> char {
        let ch = match self[idx..].chars().next() {
            Some(ch) => ch,
            None => panic!("cannot remove a char from the end of a string"),
        };

        let next = idx + ch.len_utf8();
        let len = self.len();
        unsafe {
            ptr::copy(self.vec.as_ptr().add(next), self.vec.as_mut_ptr().add(idx), len - next);
            self.vec.set_len(len - (next - idx));
        }
        ch
    }

About

Tag the safety properties of the Rust standard library.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published