• 设为首页
  • 点击收藏
  • 手机版
    手机扫一扫访问
    迪恩网络手机版
  • 关注官方公众号
    微信扫一扫关注
    迪恩网络公众号

treevalue: OpenDILab树形结构数据容器

原作者: [db:作者] 来自: 网络 收藏 邀请

开源软件名称:

treevalue

开源软件地址:

https://gitee.com/opendilab/treevalue

开源软件介绍:

treevalue

PyPIPyPI - Python VersionLocComments

Docs DeployCode TestBadge CreationPackage Releasecodecov

GitHub starsGitHub forksGitHub commit activityGitHub issuesGitHub pullsContributorsGitHub license

TreeValue is a generalized tree-based data structure mainly developed by OpenDILab Contributors.

Almost all the operation can be supported in form of trees in a convenient way to simplify the structure processing when the calculation is tree-based.

Installation

You can simply install it with pip command line from the official PyPI site.

pip install treevalue

For more information about installation, you can refer to the installation guide.

Documentation

The detailed documentation are hosted on https://opendilab.github.io/treevalue.

Only english version is provided now, the chinese documentation is still under development.

Quick Start

You can easily create a tree value object based on FastTreeValue.

from treevalue import FastTreeValueif __name__ == '__main__':    t = FastTreeValue({        'a': 1,        'b': 2.3,        'x': {            'c': 'str',            'd': [1, 2, None],            'e': b'bytes',        }    })    print(t)

The result should be

<FastTreeValue 0x7f6c7df00160 keys: ['a', 'b', 'x']>├── 'a' --> 1├── 'b' --> 2.3└── 'x' --> <FastTreeValue 0x7f6c81150860 keys: ['c', 'd', 'e']>    ├── 'c' --> 'str'    ├── 'd' --> [1, 2, None]    └── 'e' --> b'bytes'

And t is structure should be like this

Not only a visible tree structure, but abundant operation supports is provided.You can just put objects (such as torch.Tensor, or any other types) here and justcall their methods, like this

import torchfrom treevalue import FastTreeValuet = FastTreeValue({    'a': torch.rand(2, 5),    'x': {        'c': torch.rand(3, 4),    }})print(t)# <FastTreeValue 0x7f8c069346a0># ├── a --> tensor([[0.3606, 0.2583, 0.3843, 0.8611, 0.5130],# │                 [0.0717, 0.1370, 0.1724, 0.7627, 0.7871]])# └── x --> <FastTreeValue 0x7f8ba6130f40>#     └── c --> tensor([[0.2320, 0.6050, 0.6844, 0.3609],#                       [0.0084, 0.0816, 0.8740, 0.3773],#                       [0.6523, 0.4417, 0.6413, 0.8965]])print(t.shape)  # property access# <FastTreeValue 0x7f8c06934ac0># ├── a --> torch.Size([2, 5])# └── x --> <FastTreeValue 0x7f8c069346d0>#     └── c --> torch.Size([3, 4])print(t.sin())  # method call# <FastTreeValue 0x7f8c06934b80># ├── a --> tensor([[0.3528, 0.2555, 0.3749, 0.7586, 0.4908],# │                 [0.0716, 0.1365, 0.1715, 0.6909, 0.7083]])# └── x --> <FastTreeValue 0x7f8c06934b20>#     └── c --> tensor([[0.2300, 0.5688, 0.6322, 0.3531],#                       [0.0084, 0.0816, 0.7669, 0.3684],#                       [0.6070, 0.4275, 0.5982, 0.7812]])print(t.reshape((2, -1)))  # method with arguments# <FastTreeValue 0x7f8c06934b80># ├── a --> tensor([[0.3606, 0.2583, 0.3843, 0.8611, 0.5130],# │                 [0.0717, 0.1370, 0.1724, 0.7627, 0.7871]])# └── x --> <FastTreeValue 0x7f8c06934b20>#     └── c --> tensor([[0.2320, 0.6050, 0.6844, 0.3609, 0.0084, 0.0816],#                       [0.8740, 0.3773, 0.6523, 0.4417, 0.6413, 0.8965]])print(t[:, 1:-1])  # index operator# <FastTreeValue 0x7f8ba5c8eca0># ├── a --> tensor([[0.2583, 0.3843, 0.8611],# │                 [0.1370, 0.1724, 0.7627]])# └── x --> <FastTreeValue 0x7f8ba5c8ebe0>#     └── c --> tensor([[0.6050, 0.6844],#                       [0.0816, 0.8740],#                       [0.4417, 0.6413]])print(1 + (t - 0.8) ** 2 * 1.5)  # math operators# <FastTreeValue 0x7fdfa5836b80># ├── a --> tensor([[1.6076, 1.0048, 1.0541, 1.3524, 1.0015],# │                 [1.0413, 1.8352, 1.2328, 1.7904, 1.0088]])# └── x --> <FastTreeValue 0x7fdfa5836880>#     └── c --> tensor([[1.1550, 1.0963, 1.3555, 1.2030],#                       [1.0575, 1.4045, 1.0041, 1.0638],#                       [1.0782, 1.0037, 1.5075, 1.0658]])

For more quick start explanation and further usage, take a look at:

Speed Performance

Here is the speed performance of all the operations in FastTreeValue, the following table is the performance comparison result with dm-tree.

flattenflatten(with path)mappingmapping(with path)
treevalue---511 ns ± 6.92 ns3.16 µs ± 42.8 ns1.58 µs ± 30 ns
flattenflatten_with_pathmap_structuremap_structure_with_path
dm-tree830 ns ± 8.53 ns11.9 µs ± 358 ns13.3 µs ± 87.2 ns62.9 µs ± 2.26 µs

The following 2 tables are the performance comparison result with jax pytree.

mappingmapping(with path)flattenunflattenflatten_valuesflatten_keys
treevalue2.21 µs ± 32.2 ns2.16 µs ± 123 ns515 ns ± 7.53 ns601 ns ± 5.99 ns301 ns ± 12.9 ns451 ns ± 17.3 ns
tree_map(Not Implemented)tree_flattentree_unflattentree_leavestree_structure
jax pytree4.67 µs ± 184 ns---1.29 µs ± 27.2 ns742 ns ± 5.82 ns1.29 µs ± 22 ns1.27 µs ± 16.5 ns
flatten + allflatten + reduceflatten + reduce(with init)rise(given structure)rise(automatic structure)
treevalue425 ns ± 9.33 ns702 ns ± 5.93 ns793 ns ± 13.4 ns9.14 µs ± 129 ns11.5 µs ± 182 ns
tree_alltree_reducetree_reduce(with init)tree_transpose(Not Implemented)
jax pytree1.47 µs ± 37 ns1.88 µs ± 27.2 ns1.91 µs ± 47.4 ns10 µs ± 117 ns---

This is the comparison between dm-tree, jax-libtree and us, with flatten and mapping operations (lower value means less time cost and runs faster)

Time cost of flatten operation

Time cost of mapping operation

The following table is the performance comparison result with tianshou Batch.

getsetinitdeepcopystackcatsplit
treevalue51.6 ns ± 0.609 ns64.4 ns ± 0.564 ns750 ns ± 14.2 ns88.9 µs ± 887 ns50.2 µs ± 771 ns40.3 µs ± 1.08 µs62 µs ± 1.2 µs
tianshou Batch43.2 ns ± 0.698 ns396 ns ± 8.99 ns11.1 µs ± 277 ns89 µs ± 1.42 µs119 µs ± 1.1 µs194 µs ± 1.81 µs653 µs ± 17.8 µs

And this is the comparison between tianshou Batch and us, with cat , stack and split operations (lower value means less time cost and runs faster)

Time cost of cat operation

Time cost of stack operation

Time cost of split operation

Test benchmark code can be found here:

Extension

If you need to translate treevalue object to runnable source code, you may use the potc-treevalue plugin with the installation command below

pip install potc-treevalue

Or just install it with treevalue itself

pip install treevalue[potc]

In potc, you can translate the objects to runnable python source code, which can be loaded to objects afterwards by the python interpreter, like the following graph

potc system

For more information, you can refer to

Contribution

We appreciate all contributions to improve treevalue, both logic and system designs. Please refer to CONTRIBUTING.md for more guides.

And users can join our slack communication channel, or contact the core developer HansBug for more detailed discussion.

License

treevalue released under the Apache 2.0 license.


鲜花

握手

雷人

路过

鸡蛋
该文章已有0人参与评论

请发表评论

全部评论

专题导读
热门推荐
热门话题
阅读排行榜

扫描微信二维码

查看手机版网站

随时了解更新最新资讯

139-2527-9053

在线客服(服务时间 9:00~18:00)

在线QQ客服
地址:深圳市南山区西丽大学城创智工业园
电邮:jeky_zhao#qq.com
移动电话:139-2527-9053

Powered by 互联科技 X3.4© 2001-2213 极客世界.|Sitemap