Code Monkey home page Code Monkey logo

path-clean's Introduction

path-clean

crates.io version build status docs.rs docs license

Installation

cargo add path-clean

Usage

use std::path::PathBuf;
use path_clean::{clean, PathClean};
assert_eq!(clean("hello/world/.."), PathBuf::from("hello"));
assert_eq!(
  PathBuf::from("/test/../path/").clean(),
  PathBuf::from("/path")
);

About

path-clean is a Rust port of the the cleanname procedure from the Plan 9 C library, and is similar to path.Clean from the Go standard library. It works as follows:

  1. Reduce multiple slashes to a single slash.
  2. Eliminate . path name elements (the current directory).
  3. Eliminate .. path name elements (the parent directory) and the non-. non-.., element that precedes them.
  4. Eliminate .. elements that begin a rooted path, that is, replace /.. by / at the beginning of a path.
  5. Leave intact .. elements that begin a non-rooted path.

If the result of this process is an empty string, return the string ".", representing the current directory.

It performs this transform lexically, without touching the filesystem. Therefore it doesn't do any symlink resolution or absolute path resolution. For more information you can see "Getting Dot-Dot Right".

For convenience, the [PathClean] trait is exposed and comes implemented for [std::path::PathBuf].

License

MIT OR Apache-2.0

path-clean's People

Contributors

adamreichold avatar danreeves avatar heyimalex avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

path-clean's Issues

Is this safe?

Hello,

There is ongoing discussion for sanitizing paths in typst, I was wondering if you consider this implementation to be acceptable as a safety feature? For example to check whether a script is trying to access a file outside of the predetermined root directory?

Thanks in advance,

Dherse

Don't assume valid UTF-8

Currently, your clean function requires &str and your trait just throws out any paths containing invalid UTF-8.

Given how easily one can have invalid UTF-8 on POSIX platforms from mojibake in filenames (eg. Load data onto a mobile device that uses latin1 on-disk using its USB Mass Storage Device mode, then pull the SD card out and plug it into a UTF-8 Linux device directly), this makes it of very limited use. (And Windows also allows unpaired UTF-16 surrogates in filenames for historical reasons.)

The proper solution would be to use the Path::components iterator so you don't need to convert to &str to match on component types or, for that matter, even think about what path separator the platform is using.

You'd just get a sequence of Prefix(PrefixComponent) (Stuff like C: or \\server\share), RootDir, CurDir, ParentDir, and Normal(&OsStr) values, and it'd collapse away //, /./, and trailing / and /. for you.

(That'd also allow you to eliminate an unnecessary intermediate copy in going from PathBuf to &str, since you can just impl your trait on Path.)

clean("c:/temp/..") results in "c:", not "c:/"

You would expect "c:/temp/.." to be normalized to "c:/", but it results in "c:". "c:" does not mean the root of windows file system, but rather current directory in drive c:.

    let buf = clean("c:/temp/..");
    assert_eq!(buf, "c:");

Consider adding #![forbid(unsafe_code)]

cargo-geiger marks crates in one of three ways:

  • Red for crates using unsafe and, thus, in need of more careful and skilled auditing.
  • Uncoloured for crates wth no unsafe
  • Green for crates which use #![forbid(unsafe_code)]

Given that the point of path-clean is to "[perform] this transform lexically, without touching the filesystem" (unlike normpath which depends on GetFullPathNameW on Windows), adding #![forbid(unsafe_code)] as an additional indicator of the intent to do it without FFI calls seems like a good idea.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.