Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Direct sub-classing of pathlib.Path #68320

Open
projetmbc mannequin opened this issue May 6, 2015 · 28 comments · May be fixed by #31691
Open

Direct sub-classing of pathlib.Path #68320

projetmbc mannequin opened this issue May 6, 2015 · 28 comments · May be fixed by #31691
Labels
3.11 stdlib type-feature

Comments

@projetmbc
Copy link
Mannequin

@projetmbc projetmbc mannequin commented May 6, 2015

BPO 24132
Nosy @pfmoore, @pitrou, @keithy, @qb-cea, @miss-islington, @FFY00, @barneygale, @Xtrem532, @nyuszika7h, @kfollstad
PRs
  • #6248
  • #25240
  • #25271
  • #25701
  • #26141
  • #26438
  • #26708
  • #26906
  • #31085
  • #31691
  • Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.

    Show more details

    GitHub fields:

    assignee = None
    closed_at = None
    created_at = <Date 2015-05-06.05:26:53.017>
    labels = ['type-feature', 'library', '3.11']
    title = 'Direct sub-classing of pathlib.Path'
    updated_at = <Date 2022-03-09.23:54:09.299>
    user = 'https://bugs.python.org/projetmbc'

    bugs.python.org fields:

    activity = <Date 2022-03-09.23:54:09.299>
    actor = 'barneygale'
    assignee = 'none'
    closed = False
    closed_date = None
    closer = None
    components = ['Library (Lib)']
    creation = <Date 2015-05-06.05:26:53.017>
    creator = 'projetmbc'
    dependencies = []
    files = []
    hgrepos = []
    issue_num = 24132
    keywords = ['patch']
    message_count = 28.0
    messages = ['242643', '242651', '242656', '242657', '242689', '242696', '242699', '242700', '242701', '242702', '242703', '242705', '246007', '305799', '305811', '305827', '305830', '305918', '310634', '314561', '314582', '314625', '365277', '381321', '392695', '401946', '412354', '414821']
    nosy_count = 15.0
    nosy_names = ['paul.moore', 'pitrou', 'bronger', 'elguavas', 'piotr.dobrogost', 'Kevin.Norris', 'projetmbc', 'keithy', 'qb-cea', 'miss-islington', 'FFY00', 'barneygale', 'Xtrem532', 'nyuszika7h', 'kfollstad']
    pr_nums = ['6248', '25240', '25271', '25701', '26141', '26438', '26708', '26906', '31085', '31691']
    priority = 'normal'
    resolution = None
    stage = 'patch review'
    status = 'open'
    superseder = None
    type = 'enhancement'
    url = 'https://bugs.python.org/issue24132'
    versions = ['Python 3.11']

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented May 6, 2015

    Hello.

    I have noticed a problem with the following code.

    from pathlib import Path
    
    class PPath(Path):
        def __init__(self, *args, **kwargs):
            super().__init__(*args, **kwargs)
    
    test = PPath("dir", "test.txt")

    This gives the following error message.

     
    Traceback (most recent call last):
      File "/Users/projetmbc/test.py", line 14, in <module>
        test = PPath("dir", "test.txt")
      File "/anaconda/lib/python3.4/pathlib.py", line 907, in __new__
        self = cls._from_parts(args, init=False)
      File "/anaconda/lib/python3.4/pathlib.py", line 589, in _from_parts
        drv, root, parts = self._parse_args(args)
      File "/anaconda/lib/python3.4/pathlib.py", line 582, in _parse_args
        return cls._flavour.parse_parts(parts)
    AttributeError: type object 'PPath' has no attribute '_flavour'

    This breaks the sub-classing from Python point of view.

    There is an ugly hack to sub-class Path but it's a bit unpythonic.

    @projetmbc projetmbc mannequin added the type-bug label May 6, 2015
    @projetmbc projetmbc mannequin changed the title Direct sub-classing of pathless.Path Direct sub-classing of pathlib.Path May 6, 2015
    @pfmoore
    Copy link
    Member

    @pfmoore pfmoore commented May 6, 2015

    One issue with your code - what would you expect str(test) to produce? "dir/test.txt" or "dir\test.txt"? That's the point of the "flavour" - is it a Windows path or a Unix path?

    Agreed that an easier method of creating Path subclasses that handle this type of thing would be useful, but any solution needs to make sure that developers don't overlook the Windows vs Unix implications.

    Can you give an actual use case (as opposed to the toy example)?

    @pitrou
    Copy link
    Member

    @pitrou pitrou commented May 6, 2015

    The Path classes were not designed to be subclassable by the user.
    I'm not against making subclassing easier, but someone will have to propose a viable approach for that.

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented May 6, 2015

    Hello.

    I will give a real example in 5 hours after my job. I will try tomorrow a
    solution to ease the subclassing using another dedicazted class PathPlus,
    sorry for the name. The idea would be to use this new class for
    customization, and also to define WindowsPath and PosixPath sub-classing
    this new class. By default PathPlus would be an empty class. I do not know
    if this works well. Maybe my idea is a bad one.

    *Christophe BAL*
    *Enseignant de mathématiques en Lycée **et développeur Python amateur*
    *---*
    *French math teacher in a "Lycée" **and **Python **amateur developer*

    2015-05-06 13:05 GMT+02:00 Antoine Pitrou <report@bugs.python.org>:

    Antoine Pitrou added the comment:

    The Path classes were not designed to be subclassable by the user.
    I'm not against making subclassing easier, but someone will have to
    propose a viable approach for that.

    ----------
    versions: +Python 3.5 -Python 3.4


    Python tracker <report@bugs.python.org>
    <http://bugs.python.org/issue24132\>


    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented May 6, 2015

    Here are for example two extra methods that I have implemented.

    def __sub__(cls, path):
        """
    This magic method allows to use ``onepath - anotherpath`` instead of the
    long
    version ``onepath.relative_to(anotherpath)`` given by ``pathlib.Path``.
        """
        return cls.relative_to(path)
    
    def _ppath_common_with(cls, paths):
        """
    This method returns the path of the smaller common "folder" of the current
    path
    and at least one paths.

    python::
    from mistool import os_use

        path   = os_use.PPath("/Users/projects/source/doc")
        path_1 = os_use.PPath("/Users/projects/README")
        path_2 = os_use.PPath("/Users/projects/source/misTool/os_use.py")
    
        print(path.common_with((path_1, path_2)))
        """
        if not isinstance(paths, (list, tuple)):
            paths = [paths]
    
        commonparts = list(cls.parts)
    
        for onepath in paths:
            i = 0
    
            for common, actual in zip(commonparts, onepath.parts):
                if common == actual:
                    i += 1
                else:
                    break
    
            commonparts = commonparts[:i]
    
            if not commonparts:
                break
    
        commonpath = pathlib.Path("")
    
        for part in commonparts:
            commonpath /= part
    
        return commonpath

    *Christophe BAL*
    *Enseignant de mathématiques en Lycée **et développeur Python amateur*
    *---*
    *French math teacher in a "Lycée" **and **Python **amateur developer*

    2015-05-06 14:13 GMT+02:00 Christophe BAL <report@bugs.python.org>:

    Christophe BAL added the comment:

    Hello.

    I will give a real example in 5 hours after my job. I will try tomorrow a
    solution to ease the subclassing using another dedicazted class PathPlus,
    sorry for the name. The idea would be to use this new class for
    customization, and also to define WindowsPath and PosixPath sub-classing
    this new class. By default PathPlus would be an empty class. I do not know
    if this works well. Maybe my idea is a bad one.

    *Christophe BAL*
    *Enseignant de mathématiques en Lycée **et développeur Python amateur*
    *---*
    *French math teacher in a "Lycée" **and **Python **amateur developer*

    2015-05-06 13:05 GMT+02:00 Antoine Pitrou <report@bugs.python.org>:

    >
    > Antoine Pitrou added the comment:
    >
    > The Path classes were not designed to be subclassable by the user.
    > I'm not against making subclassing easier, but someone will have to
    > propose a viable approach for that.
    >
    > ----------
    > versions: +Python 3.5 -Python 3.4
    >
    > _______________________________________
    > Python tracker <report@bugs.python.org>
    > <http://bugs.python.org/issue24132\>
    > _______________________________________
    >

    ----------


    Python tracker <report@bugs.python.org>
    <http://bugs.python.org/issue24132\>


    @pfmoore
    Copy link
    Member

    @pfmoore pfmoore commented May 6, 2015

    For that type of function, I'd suggest you use a standalone function rather than subclassing and methods or operator overloading. You don't gain enough to be worth the complexity of having to subclass path objects. And duck typing means that your function works for any subclass of (Pure)Path without change.

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented May 6, 2015

    I don't agree with you. I prefer to add new functionalities to the paths I
    use. This is the power of OOP. It is easier and cleaner to use
    *mypath.common_with(otherpath)* than *common_with(*mypath, **other path)
    .

    Python is highly OOP, so you can't say *"don't use subclassing in your
    case"*. As a user, I should have the possibility to use the method I want.

    Another example is the use of *onepath - anotherpath* instead of
    *onepath.relative_to(*another path) . That's the power of the magic
    method to add this kind of feature.

    *Christophe BAL*
    *Enseignant de mathématiques en Lycée **et développeur Python amateur*
    *---*
    *French math teacher in a "Lycée" **and **Python **amateur developer*

    2015-05-06 20:21 GMT+02:00 Paul Moore <report@bugs.python.org>:

    Paul Moore added the comment:

    For that type of function, I'd suggest you use a standalone function
    rather than subclassing and methods or operator overloading. You don't gain
    enough to be worth the complexity of having to subclass path objects. And
    duck typing means that your function works for any subclass of (Pure)Path
    without change.

    ----------


    Python tracker <report@bugs.python.org>
    <http://bugs.python.org/issue24132\>


    @pfmoore
    Copy link
    Member

    @pfmoore pfmoore commented May 6, 2015

    I have no problem with that - it's a style choice certainly.

    As I said, I'd like to see simpler subclassing of pathlib objects. I just think it'll be quite hard to do (given the complexities of classes for Windows/Unix as well as pure and concrete paths). So if it's just about examples like this, I personally would take the easier route and just go with standalone functions. If someone else felt strongly enough to design and implement a subclassing solution, that's fine though.

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented May 6, 2015

    Are you the author of path lib ?

    *Christophe BAL*
    *Enseignant de mathématiques en Lycée **et développeur Python amateur*
    *---*
    *French math teacher in a "Lycée" **and **Python **amateur developer*

    2015-05-06 21:01 GMT+02:00 Paul Moore <report@bugs.python.org>:

    Paul Moore added the comment:

    I have no problem with that - it's a style choice certainly.

    As I said, I'd like to see simpler subclassing of pathlib objects. I just
    think it'll be quite hard to do (given the complexities of classes for
    Windows/Unix as well as pure and concrete paths). So if it's just about
    examples like this, I personally would take the easier route and just go
    with standalone functions. If someone else felt strongly enough to design
    and implement a subclassing solution, that's fine though.

    ----------


    Python tracker <report@bugs.python.org>
    <http://bugs.python.org/issue24132\>


    @pfmoore
    Copy link
    Member

    @pfmoore pfmoore commented May 6, 2015

    Are you the author of path lib ?

    Nope, that's Antoine.

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented May 6, 2015

    OK.
    I will try to find a way to achieve an easier and cleaner way to sub class
    pathlib.Path and co.

    What is the good way to propose a patch ?

    *Christophe BAL*
    *Enseignant de mathématiques en Lycée **et développeur Python amateur*
    *---*
    *French math teacher in a "Lycée" **and **Python **amateur developer*

    2015-05-06 21:09 GMT+02:00 Paul Moore <report@bugs.python.org>:

    Paul Moore added the comment:

    > Are you the author of path lib ?

    Nope, that's Antoine.

    ----------


    Python tracker <report@bugs.python.org>
    <http://bugs.python.org/issue24132\>


    @pfmoore
    Copy link
    Member

    @pfmoore pfmoore commented May 6, 2015

    What is the good way to propose a patch ?

    If you have a patch, attach it here, and it will get reviewed.

    @KevinNorris
    Copy link
    Mannequin

    @KevinNorris KevinNorris mannequin commented Jul 1, 2015

    If I were designing pathlib from scratch, I would not have a separate Path class. I would instead do something like this:

    In pathlib.py:

        if os.name == 'nt':
            Path = WindowsPath
        else:
            Path = PosixPath

    Alternatively, Path() could be a factory function that picks one of those classes at runtime.

    Of course, that still leaves the issue of where to put the method implementations which currently live in Path. We could change the name of Path to _Path and use the code above to continue providing a Path alias, but that might be too confusing. Another possibility is to pull those methods out into top-level functions and then alias them into methods in WindowsPath and PosixPath (perhaps using a decorator-like-thing to pass the flavor, instead of attaching it to the class).

    The main thing, though, is that Path should not depend on its subclasses. That really strikes me as poor design, since it produces issues like this one.

    @elguavas
    Copy link
    Mannequin

    @elguavas elguavas mannequin commented Nov 8, 2017

    Using a set of paths with special properties and formats in a project, thought "the cleanest oop way to do this is try out python's oop paths in pathlib". Subclassed Path to implement my extra (non platfor specific) properties and fell at the first hurdle because of this issue...

    for me pathlib does not provide oop paths if i can't subclass Path, for whatever reason.

    reverted to treating paths as strings and writing functions to handle my special path properties and formats.

    i was also surprised when i found another bug report on this issue that said it was closed for 3.7, great i thought this has been solved, but no, the other report was closed because it was about the same issue as this ancient report.

    @pfmoore
    Copy link
    Member

    @pfmoore pfmoore commented Nov 8, 2017

    @elguavas the problem is, no-one has proposed a patch. There's not likely to be much movement on this until someone provides one.

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented Nov 8, 2017

    For the moment, you can take a look at this little script that acheives
    subclassing of Path :
    https://github.com/bc-python/mistool/blob/master/mistool/os_use.py
    (search for class Path).

    Le 08/11/2017 à 09:55, Paul Moore a écrit :

    Paul Moore <p.f.moore@gmail.com> added the comment:

    @elguavas the problem is, no-one has proposed a patch. There's not likely to be much movement on this until someone provides one.

    ----------


    Python tracker <report@bugs.python.org>
    <https://bugs.python.org/issue24132\>


    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented Nov 8, 2017

    Mistyping : /search for class PPath/ with two P.

    Le 08/11/2017 à 13:59, Christophe BAL a écrit :

    Christophe BAL <projetmbc@gmail.com> added the comment:

    For the moment, you can take a look at this little script that acheives
    subclassing of Path :
    https://github.com/bc-python/mistool/blob/master/mistool/os_use.py
    (search for class Path).

    Le 08/11/2017 à 09:55, Paul Moore a écrit :
    > Paul Moore <p.f.moore@gmail.com> added the comment:
    >
    > @elguavas the problem is, no-one has proposed a patch. There's not likely to be much movement on this until someone provides one.
    >
    > ----------
    >
    > _______________________________________
    > Python tracker <report@bugs.python.org>
    > <https://bugs.python.org/issue24132\>
    > _______________________________________
    ----------


    Python tracker <report@bugs.python.org>
    <https://bugs.python.org/issue24132\>


    @elguavas
    Copy link
    Mannequin

    @elguavas elguavas mannequin commented Nov 8, 2017

    @paul.moore is the original contributor mia? i seem to remember pathlib as once being marked 'provisional', i think it should have stayed that way until this problem was resolved. easy to say i know ;) when i don't have a patch.

    @projetmbc yes i found various work-arounds on the web and decided to not use any of them. really i feel this should be fixed as it's a jarring inconsistency with naturally expected behaviour for a class in python.

    so i added my report to this as a topic bump because i don't think this should be forgotten about and in case anyone might come up with an idea how to fix it.

    @keithy
    Copy link
    Mannequin

    @keithy keithy mannequin commented Jan 24, 2018

    Look at the architecture of Rio in Ruby (also ported to Squeak/Smalltalk)

    Leave Path to handle path stuff, and have another class to handle Platform stuff.

    https://rubygems.org/gems/rio/versions/0.6.0

    @qb-cea
    Copy link
    Mannequin

    @qb-cea qb-cea mannequin commented Mar 28, 2018

    Hi all,

    I made a pull request proposing a fix for this issue. There is still quite a lot to be done:

    • I exposed some variables (and probably methods too) that used to be hidden;
    • I did not update the documentation;
    • I did not add a proper test.

    I will try to fix those by the end of the week.

    The patch mainly consists of two things:

    • having Path (resp. PurePath) be a variable pointing at either (Pure)PosixPath or (Pure)WindowsPath, depending on the platform (like Kevin Norris suggested);
    • introducing two new abstract classes _PurePath and ConcretePath from which PurePosixPath, PureWindowsPath and PosixPath, WindowsPath classes inherit;
    • removing the _Flavor classes, and redistributing their method to platform-specific classes.

    Ideally I would like _PurePath to become a public class, but I could not come up with a proper name. Any feedback is more than welcome =]

    @projetmbc
    Copy link
    Mannequin Author

    @projetmbc projetmbc mannequin commented Mar 28, 2018

    Hello.

    What about AbstractPath instead of _PurePath ?

    Le 28/03/2018 à 02:30, qb-cea a écrit :

    qb-cea <quentin.bouget@cea.fr> added the comment:

    Hi all,

    I made a pull request proposing a fix for this issue. There is still quite a lot to be done:

    • I exposed some variables (and probably methods too) that used to be hidden;
    • I did not update the documentation;
    • I did not add a proper test.

    I will try to fix those by the end of the week.

    The patch mainly consists of two things:

    • having Path (resp. PurePath) be a variable pointing at either (Pure)PosixPath or (Pure)WindowsPath, depending on the platform (like Kevin Norris suggested);
    • introducing two new abstract classes _PurePath and ConcretePath from which PurePosixPath, PureWindowsPath and PosixPath, WindowsPath classes inherit;
    • removing the _Flavor classes, and redistributing their method to platform-specific classes.

    Ideally I would like _PurePath to become a public class, but I could not come up with a proper name. Any feedback is more than welcome =]

    ----------


    Python tracker <report@bugs.python.org>
    <https://bugs.python.org/issue24132\>


    @qb-cea
    Copy link
    Mannequin

    @qb-cea qb-cea mannequin commented Mar 29, 2018

    What about AbstractPath instead of _PurePath ?

    I will use this, thanks.

    @barneygale
    Copy link
    Mannequin

    @barneygale barneygale mannequin commented Mar 29, 2020

    I'm taking another look at making pathlib extensible. There's some discussion here: https://discuss.python.org/t/make-pathlib-extensible/3428

    List or preparatory bugfixes and tidy-ups: https://docs.google.com/spreadsheets/d/1TicFDMudKKA6CZcrscg1Xq9kt5Q8To8y0hADGw9u11I/edit#gid=0

    @qb-cea
    Copy link
    Mannequin

    @qb-cea qb-cea mannequin commented Nov 18, 2020

    Hi,

    Thanks for reviving this! Feel free to reuse any code I wrote in my PR (or the whole PR itself), I do not think I will ever get around to finishing this work myself.

    @iritkatriel iritkatriel added 3.10 stdlib type-feature and removed type-bug labels Mar 26, 2021
    @barneygale
    Copy link
    Mannequin

    @barneygale barneygale mannequin commented May 2, 2021

    Progress report:

    I've been working on tidying up the pathlib internals over the 3.9 and 3.10 releases. We're now in a position where:

    • pathlib._Flavour is entirely pure, and doesn't make any os calls
    • pathlib._Accessor handles all os access.

    The internal abstractions are now much tighter, which allows us to begin refactoring them with confidence!

    The next step is to remove accessors, in bpo-43012.

    After that I'll finally be in a position to start working on this bug!

    @nyuszika7h
    Copy link
    Mannequin

    @nyuszika7h nyuszika7h mannequin commented Sep 16, 2021

    I agree this would be nice. For now, I'm doing this as a hack:

    class Path(type(pathlib.Path())):
        ...

    @merwok merwok added 3.11 and removed 3.10 labels Jan 1, 2022
    @miss-islington
    Copy link
    Contributor

    @miss-islington miss-islington commented Feb 2, 2022

    New changeset 08f8301 by Barney Gale in branch 'main':
    bpo-43012: remove pathlib._Accessor (GH-25701)
    08f8301

    @barneygale
    Copy link
    Mannequin

    @barneygale barneygale mannequin commented Mar 9, 2022

    If/when python/issues-test-cpython#31691 lands, I think this bug can be resolved: the original repro case will no longer raise AttributeError, and subclasses will be able to customize behaviour without needing to define further "flavour" or "accessor" subclasses.

    @ezio-melotti ezio-melotti transferred this issue from another repository Apr 10, 2022
    Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
    Labels
    3.11 stdlib type-feature
    Projects
    None yet
    Development

    Successfully merging a pull request may close this issue.

    5 participants