vectorbt学习_10PortfolioOptimization

投资组合优化，需要一定背景知识，否则不清楚整篇文章干嘛的，达到什么目的。
“马科维茨”投资组合模型实践——第三章投资组合优化：最小方差与最大夏普比率：https://www.jianshu.com/p/400758e58768

随机搜索最优权重#

构造随机权重#

1
np.random.seed(42)
2

3
# Generate random weights, n times
4
weights = []
5
for i in range(num_tests):
6
    w = np.random.random_sample(len(symbols))
7
    w = w / np.sum(w)
8
    weights.append(w)
9

10
print(len(weights))
11
2000
12

13
weights
14
[array([0.18205878, 0.46212909, 0.35581214]),
15
 array([0.65738127, 0.17132261, 0.17129612]),
16
 array([0.03807826, 0.56784481, 0.39407693]),
17
 array([0.41686469, 0.01211874, 0.57101657]),
18
 array([0.67865488, 0.173111  , 0.14823412]),
19
 array([0.18115758, 0.3005149 , 0.51832752]),
20

21
3列是由于本例子使用的symbols标的有3个
22
symbols = [
23
    '510050.XSHG', '510300.XSHG', '159901.XSHE'
24
]

数据准备#

1
# Build column hierarchy such that one weight corresponds to one price series
2
# 3列数据，变为3*num_tests=》3*2000=6000列
3
_price = price.vbt.tile(num_tests, keys=pd.Index(np.arange(num_tests), name='symbol_group'))
4
_price = _price.vbt.stack_index(pd.Index(np.concatenate(weights), name='weights'))
5

6
print(_price.columns)
7
MultiIndex([( 0.18205877561639985,    0, '510050.XSHG'),
8
            ( 0.46212908544657766,    0, '510300.XSHG'),
9
            ，，，
10
            ( 0.34668046300795724, 1999, '510300.XSHG'),
11
            (  0.1067148038247113, 1999, '159901.XSHE')],
12
           names=['weights', 'symbol_group', 'symbol'], length=6000)

tile用法样例

1
price.vbt.tile(3, keys=pd.Index(list('abc'), name='symbol_group'))
2
#简单来说，原始数据列，复制出3份，3份在column的mulitindex索引标识为a,b,c

del01

stack_index用法样例

1
num_tests= 3
2
tmpp=price.vbt.tile(num_tests, keys=pd.Index(np.arange(num_tests), name='symbol_group'))
3
weights = []
4
for i in range(3):
5
    w = np.random.random_sample(len(symbols))
6
    w = w / np.sum(w)
7
    weights.append(w)
8
print(np.concatenate(weights))
9
# [0.28035754 0.08989454 0.62974792 0.41627195 0.11541703 0.46831101 0.02859779 0.75619858 0.21520363]
10
tmpp.vbt.stack_index(pd.Index(np.concatenate(weights), name='weights'))
11
# 将新增的pd.Index，attach到原有的multiIndex上。

del01

生成订单#

1
# Run simulation
2
pf = vbt.Portfolio.from_orders(
3
    close=_price,
4
    size=size,# size只有 初始的第一行，意味着不会调仓
5
    size_type='targetpercent', # size中保存的数据是标的百分比,由于单组weight已经做了sum=1的计算保证
6
    # 购买时，是OrderContext.cash_now的百分比。
7
    # 卖出时，是OrderContext.position_now的百分比。
8
    # 卖空时为OrderContext.free_cash_now的百分比。
9
    # 卖出和卖空（即反转仓位）时，是OrderContext.position_now和OrderContext.free_cash_now的百分比。
10

11
    group_by='symbol_group',# 结果分组
12
    cash_sharing=True
13
) # all weights sum to 1, no shorting, and 100% investment in risky assets
14

15
print(len(pf.orders))
16
6000

波动率收益回报率,可视化#

1
annualized_return = pf.annualized_return()
2
# 前文截图case为例：
3
# a    2.273208
4
# b   -0.737391
5
# Name: annualized_return, dtype: float64
6

7

8
annualized_return.index = pf.annualized_volatility()
9
# 前文截图case为例：
10
# a    0.090345
11
# b    0.091265
12
# Name: annualized_volatility, dtype: float64
13

14
# 可见是2个series
15

16
# 此时annualized_return是一个series，index=波动率，value=收益率
17
annualized_return.vbt.scatterplot(
18
    trace_kwargs=dict(
19
        mode='markers',
20
        marker=dict(
21
            color=pf.sharpe_ratio(),
22
            colorbar=dict(
23
                title='sharpe_ratio'
24
            ),
25
            size=5,
26
            opacity=0.7
27
        )
28
    ),
29
    xaxis_title='annualized_volatility',
30
    yaxis_title='annualized_return'
31
).show_svg()

del01

取得最优组合信息#

1
# Get index of the best group according to the target metric
2
best_symbol_group = pf.sharpe_ratio().idxmax()
3

4
print(best_symbol_group)
5
400
6

7
print(pf.sharpe_ratio().max())
8
print(pf.sharpe_ratio().idxmax())
9
print(pf.sharpe_ratio()[pf.sharpe_ratio().idxmax()])
10
0.7277965995778561
11
400
12
0.7277965995778561
13

14

15
# Print best weights
16
print(weights[best_symbol_group])
17
[0.94197268 0.03054375 0.02748357]
18

19
# Compute default stats
20
print(pf.iloc[best_symbol_group].stats())

del01

月再平衡(重置回初始权重)#

收益计算#

按照月重新平衡，虽然再平衡权重没变，但由于标的价格变化，购买时的size对应的targetpercent，其实是现金比例，所以实际仓位也会变化。等于在原始持续持有的基础上，卖出了上涨幅度大的（由于上涨，导致reset时，实际targetpercent高于初始取值，所以会卖出部分，维持资金占比）。

1
# Select the first index of each month
2
rb_mask = ~_price.index.to_period('m').duplicated()
3

4
print(rb_mask.sum())
5
36 # 说明共36个月

这部分如何理解？

1
_price.index.to_period('m')
2
# =>
3
# PeriodIndex(['2017-01', '2017-01', '2017-01', '2017-01', '2017-01', '2017-01',
4
             '2017-01', '2017-01', '2017-01', '2017-01',
5
             ...
6
             '2019-12', '2019-12', '2019-12', '2019-12', '2019-12', '2019-12',
7
             '2019-12', '2019-12', '2019-12', '2019-12'],
8
            dtype='period[M]', name='date', length=731)
9

10
_price.index.to_period('m').duplicated()
11
# array([False,  True,  True,  True,  True,  True,  True,  True,  True,
12
        True,  True,  True,  True,  True,  True,  True,  True,  True,
13
       False,  True,  True,  True,  True,  True,  True,  True,  True,
14
        True,  True,  True,  True,  True,  True,  True,  True,  True,
15
       False,  True,  True,  True,  True,  True,  True,  True,  True,
16
        True,  True,  True,  True,  True,  True,  True,  True,  True,
17
        True,  True,  True,  True,  True, False,  True,  True,  True,
18
        True,  True,  True,  True,  True,  True,  True,  True,  True,
19
        731个，false的都是每个月的第一日
20

21
~_price.index.to_period('m').duplicated()
22
# 取反后，每个月第一日从False变为True

再平衡日，重新设置权重

1
rb_size = np.full_like(_price, np.nan)
2
rb_size[rb_mask, :] = np.concatenate(weights)  # allocate at mask # 再平衡日，重设权重
3

4
print(rb_size.shape)
5
(731, 6000)

重新计算再平衡收益

1
# Run simulation, with rebalancing monthly
2
rb_pf = vbt.Portfolio.from_orders(
3
    close=_price,
4
    size=rb_size,
5
    size_type='targetpercent',
6
    group_by='symbol_group',
7
    cash_sharing=True,
8
    call_seq='auto'  # important: sell before buy
9
)
10

11
print(len(rb_pf.orders))
12
rb_best_symbol_group = rb_pf.sharpe_ratio().idxmax()
13

14
print(rb_best_symbol_group)
15
print(weights[rb_best_symbol_group])
16

17
216000
18
400
19
[0.94197268 0.03054375 0.02748357]
20

21
print(rb_pf.iloc[rb_best_symbol_group].stats())

del01

权值可视化#

1
def plot_allocation(rb_pf):
2
    # Plot weights development of the portfolio
3
    rb_asset_value = rb_pf.asset_value(group_by=False)
4
    rb_value = rb_pf.value()
5
    rb_idxs = np.flatnonzero((rb_pf.asset_flow() != 0).any(axis=1))
6
    rb_dates = rb_pf.wrapper.index[rb_idxs]
7
    fig = (rb_asset_value.vbt / rb_value).vbt.plot(
8
        trace_names=symbols,
9
        trace_kwargs=dict(
10
            stackgroup='one'
11
        )
12
    )
13
    for rb_date in rb_dates:
14
        fig.add_shape(
15
            dict(
16
                xref='x',
17
                yref='paper',
18
                x0=rb_date,
19
                x1=rb_date,
20
                y0=0,
21
                y1=1,
22
                line_color=fig.layout.template.layout.plot_bgcolor
23
            )
24
        )
25
    fig.show_svg()
26
plot_allocation(rb_pf.iloc[rb_best_symbol_group])  # best group

del01

搜索和30日再平衡#

1
srb_sharpe = np.full(price.shape[0], np.nan)
2

3
@njit
4
def pre_sim_func_nb(c, every_nth):
5
    # Define rebalancing days
6
    c.segment_mask[:, :] = False
7
    c.segment_mask[every_nth::every_nth, :] = True
8
    return ()
9

10
@njit
11
def find_weights_nb(c, price, num_tests):
12
    # Find optimal weights based on best Sharpe ratio
13
    returns = (price[1:] - price[:-1]) / price[:-1]
14
    returns = returns[1:, :]  # cannot compute np.cov with NaN
15
    mean = nanmean_nb(returns)
16
    cov = np.cov(returns, rowvar=False)  # masked arrays not supported by Numba (yet)
17
    best_sharpe_ratio = -np.inf
18
    weights = np.full(c.group_len, np.nan, dtype=np.float_)
19

20
    for i in range(num_tests):
21
        # Generate weights
22
        w = np.random.random_sample(c.group_len)
23
        w = w / np.sum(w)
24

25
        # Compute annualized mean, covariance, and Sharpe ratio
26
        p_return = np.sum(mean * w) * ann_factor
27
        p_std = np.sqrt(np.dot(w.T, np.dot(cov, w))) * np.sqrt(ann_factor)
28
        sharpe_ratio = p_return / p_std
29
        if sharpe_ratio > best_sharpe_ratio:
30
            best_sharpe_ratio = sharpe_ratio
31
            weights = w
32

33
    return best_sharpe_ratio, weights
34

35
@njit
36
def pre_segment_func_nb(c, find_weights_nb, history_len, ann_factor, num_tests, srb_sharpe):
37
    if history_len == -1:
38
        # Look back at the entire time period
39
        close = c.close[:c.i, c.from_col:c.to_col]
40
    else:
41
        # Look back at a fixed time period
42
        if c.i - history_len <= 0:
43
            return (np.full(c.group_len, np.nan),)  # insufficient data
44
        close = c.close[c.i - history_len:c.i, c.from_col:c.to_col]
45

46
    # Find optimal weights
47
    best_sharpe_ratio, weights = find_weights_nb(c, close, num_tests)
48
    srb_sharpe[c.i] = best_sharpe_ratio
49

50
    # Update valuation price and reorder orders
51
    size_type = SizeType.TargetPercent
52
    direction = Direction.LongOnly
53
    order_value_out = np.empty(c.group_len, dtype=np.float_)
54
    for k in range(c.group_len):
55
        col = c.from_col + k
56
        c.last_val_price[col] = c.close[c.i, col]
57
    sort_call_seq_nb(c, weights, size_type, direction, order_value_out)
58

59
    return (weights,)
60

61
@njit
62
def order_func_nb(c, weights):
63
    col_i = c.call_seq_now[c.call_idx]
64
    return order_nb(
65
        weights[col_i],
66
        c.close[c.i, c.col],
67
        size_type=SizeType.TargetPercent
68
    )
69

70

71
ann_factor = returns.vbt.returns.ann_factor
72

73

74
# Run simulation using a custom order function
75
srb_pf = vbt.Portfolio.from_order_func(
76
    price,
77
    order_func_nb,
78
    pre_sim_func_nb=pre_sim_func_nb,
79
    pre_sim_args=(30,),
80
    pre_segment_func_nb=pre_segment_func_nb,
81
    pre_segment_args=(find_weights_nb, -1, ann_factor, num_tests, srb_sharpe),
82
    cash_sharing=True,
83
    group_by=True
84
)
85

86

87
# Plot best Sharpe ratio at each rebalancing day
88
pd.Series(srb_sharpe, index=price.index).vbt.scatterplot(trace_kwargs=dict(mode='markers')).show_svg()
89

90
print(srb_pf.stats())

from_order_func(有点复杂,暂跳过)#

先搞清楚from_order_func，参考：https://vectorbt.dev/api/portfolio/base/#vectorbt.portfolio.base.Portfolio.from_order_func

1
# Run simulation using a custom order function
2
srb_pf = vbt.Portfolio.from_order_func(
3
    price, #行情信息
4
    order_func_nb,#订单生成函数
5
    pre_sim_func_nb=pre_sim_func_nb,# Function called before simulation. Defaults to no_pre_func_nb().
6
    pre_sim_args=(30,),# Packed arguments passed to pre_sim_func_nb. Defaults to ().
7
    pre_segment_func_nb=pre_segment_func_nb,# Function called before each segment. Defaults to no_pre_func_nb().
8
    pre_segment_args=(find_weights_nb, -1, ann_factor, num_tests, srb_sharpe), #Packed arguments passed to pre_segment_func_nb. Defaults to ().
9
    cash_sharing=True, # Whether to share cash within the same group.
10
                        # If group_by is None, group_by becomes True to form a single group with cash sharing.
11
    group_by=True
12
)

关于from_order_func，几个比较容易混淆的重要的函数

1
order_func_nb: callable
2
    订单生成功能。
3
post_order_func_nb: callable
4
    订单处理后调用的回调。
5

6
pre(post)_sim_func_nb: callable
7
    模拟之前调用的函数。默认为no_pre_func_nb()。
8

9
pre/post_group_func_nb：
10
    在每组之前调用的函数。默认为no_pre_func_nb()。
11
    仅当 为 False 时才调用row_wise。
12

13
pre/post_row_func_nb: callable
14
    在每行之前调用的函数。默认为no_pre_func_nb()。
15
    仅当为 True 时才调用row_wise。
16

17
pre/post_segment_func_nb: callable # 段是组和行之间的交集。它是一个实体，定义如何以及以何种顺序处理同一组和行中的元素。
18
    在每个段之前调用的函数。默认为no_pre_func_nb()。
19
    segment_mask: int或array_like的bool
20
        是否应执行特定段的掩码。
21
        提供一个整数将激活每第 n 行。提供布尔值或布尔值数组将广播到行数和组数。
22
        不与close和一起广播broadcast_named_args，仅针对最终形状。
23
    call_pre_segment: bool
24
        是否打电话pre_segment_func_nb不管segment_mask。
25
    call_post_segment: bool
26
        是否打电话post_segment_func_nb不管segment_mask。

执行官方提供最简单demo

1
import numpy as np
2
import pandas as pd
3
from datetime import datetime
4
from numba import njit
5

6
import vectorbt as vbt
7
from vectorbt.utils.colors import adjust_opacity
8
from vectorbt.utils.enum_ import map_enum_fields
9
from vectorbt.base.reshape_fns import broadcast, flex_select_auto_nb, to_2d_array
10
from vectorbt.portfolio.enums import SizeType, Direction, NoOrder, OrderStatus, OrderSide
11
from vectorbt.portfolio import nb
12

13
@njit
14
def order_func_nb(c, size):
15
    return nb.order_nb(size=size)
16

17
close = pd.Series([1, 2, 3, 4, 5])
18
pf = vbt.Portfolio.from_order_func(close, order_func_nb, 10)
19

20
nb.order_nb(size=5) #本身返回一个order对象，故order_func_nb可看做order构造函数，生成一系列order
21
Order(size=5.0, price=inf, size_type=0, direction=2, fees=0.0, fixed_fees=0.0, slippage=0.0, min_size=0.0, max_size=inf, size_granularity=nan, reject_prob=0.0, lock_cash=False, allow_partial=True, raise_reject=False, log=False)
22

23
print(pf.assets())
24
print(pf.cash())
25

26
0    10.0
27
1    20.0
28
2    30.0
29
3    40.0
30
4    40.0
31
dtype: float64
32
0    90.0
33
1    70.0
34
2    40.0
35
3     0.0
36
4     0.0
37
dtype: float64
38

39
输出分析：每次买入10份，每份价格分别：1，2，3，4，5，交易记录和消耗资金如下
40
buy:10*1(-10)
41
buy:10*2(-20)
42
buy:10*3(-30)
43
buy:10*4(-40)
44
assets对应股票份额，每天增加10，最多买到40就到头了(资金不足)

有效边界法(PyPortfolioOpt)#

1
# Calculate expected returns and sample covariance amtrix
2
avg_returns = expected_returns.mean_historical_return(price)
3
symbol
4
510050.XSHG    0.135305
5
510300.XSHG    0.098036
6
159901.XSHE    0.096895
7
dtype: float64
8

9
cov_mat = risk_models.sample_cov(price) # 协方差矩阵

del01

1
# Get weights maximizing the Sharpe ratio
2
ef = EfficientFrontier(avg_returns, cov_mat)
3
weights = ef.max_sharpe()
4
weights
5
OrderedDict([('510050.XSHG', 1.0), ('510300.XSHG', 0.0), ('159901.XSHE', 0.0)])
6

7
clean_weights = ef.clean_weights()
8
clean_weights
9
OrderedDict([('510050.XSHG', 1.0), ('510300.XSHG', 0.0), ('159901.XSHE', 0.0)])
10

11
pyopt_weights = np.array([clean_weights[symbol] for symbol in symbols])
12
print(pyopt_weights)
13
[1. 0. 0.]

填充初始权值

1
pyopt_size = np.full_like(price, np.nan)
2
pyopt_size[0, :] = pyopt_weights  # allocate at first timestamp, do nothing afterwards
3

4
print(pyopt_size[:5])
5
print(pyopt_size.shape)
6
[[ 1.  0.  0.]
7
 [nan nan nan]
8
 [nan nan nan]
9
 [nan nan nan]
10
 [nan nan nan]]
11
(731, 3)

只进行一次初始化时的交易回测

1
# Run simulation with weights from PyPortfolioOpt
2
pyopt_pf = vbt.Portfolio.from_orders(
3
    close=price,
4
    size=pyopt_size,
5
    size_type='targetpercent',
6
    group_by=True,
7
    cash_sharing=True
8
)
9

10
print(len(pyopt_pf.orders))
11
1
12

13
收益统计
14
print(pyopt_pf.stats())
15
Start                         2017-01-03 00:00:00+00:00
16
End                           2019-12-31 00:00:00+00:00
17
Period                                731 days 00:00:00
18
Start Value                                       100.0
19
End Value                                    144.428008
20
Total Return [%]                              44.428008
21
Benchmark Return [%]                           35.42267
22
Max Gross Exposure [%]                            100.0
23
Total Fees Paid                                     0.0
24
Max Drawdown [%]                               29.64467
25
Max Drawdown Duration                 462 days 00:00:00
26
Total Trades                                          1
27
Total Closed Trades                                   0
28
Total Open Trades                                     1
29
Open Trade PnL                                44.428008
30
Win Rate [%]                                        NaN
31
Best Trade [%]                                      NaN
32
Worst Trade [%]                                     NaN
33
Avg Winning Trade [%]                               NaN
34
Avg Losing Trade [%]                                NaN
35
Avg Winning Trade Duration                          NaT
36
Avg Losing Trade Duration                           NaT
37
Profit Factor                                       NaN
38
Expectancy                                          NaN
39
Sharpe Ratio                                   0.735009
40
Calmar Ratio                                   0.455758
41
Omega Ratio                                     1.14102
42
Sortino Ratio                                  1.082667
43
Name: group, dtype: object

有效边界的按月再平衡#

原文中有这么一段描述

1
You can't use third-party optimization packages within Numba (yet). #不确定为啥
2
Here you have two choices:
3
1) Use os.environ['NUMBA_DISABLE_JIT'] = '1' before all imports to disable Numba completely 2) Disable Numba for the function, but also for every other function in the stack that calls it
4
We will demonstrate the second option.

重写了weight方法

1
def pyopt_find_weights(sc, price, num_tests):  # no @njit decorator = it's a pure Python function
2
    # Calculate expected returns and sample covariance matrix
3
    price = pd.DataFrame(price, columns=symbols)
4
    avg_returns = expected_returns.mean_historical_return(price)
5
    cov_mat = risk_models.sample_cov(price)
6

7
    # Get weights maximizing the Sharpe ratio
8
    ef = EfficientFrontier(avg_returns, cov_mat)
9
    weights = ef.max_sharpe()
10
    clean_weights = ef.clean_weights()
11
    weights = np.array([clean_weights[symbol] for symbol in symbols])
12
    best_sharpe_ratio = base_optimizer.portfolio_performance(weights, avg_returns, cov_mat)[2]
13

14
    return best_sharpe_ratio, weights

计算组合收益

1
pyopt_srb_sharpe = np.full(price.shape[0], np.nan)
2

3
# Run simulation with a custom order function
4
pyopt_srb_pf = vbt.Portfolio.from_order_func(
5
    price,
6
    order_func_nb,
7
    pre_sim_func_nb=pre_sim_func_nb,
8
    pre_sim_args=(30,),
9
    pre_segment_func_nb=pre_segment_func_nb.py_func,  # run pre_segment_func_nb as pure Python function
10
    pre_segment_args=(pyopt_find_weights, -1, ann_factor, num_tests, pyopt_srb_sharpe),
11
    cash_sharing=True,
12
    group_by=True,
13
    use_numba=False  # run simulate_nb as pure Python function
14
)

夏普值的可视化

1
pd.Series(pyopt_srb_sharpe, index=price.index).vbt.scatterplot(trace_kwargs=dict(mode='markers')).show_svg()

del01

绩效评估

1
print(pyopt_srb_pf.stats())
2
Start                           2017-01-03 00:00:00+00:00
3
End                             2019-12-31 00:00:00+00:00
4
Period                                  731 days 00:00:00
5
Start Value                                         100.0
6
End Value                                      130.474091
7
Total Return [%]                                30.474091
8
Benchmark Return [%]                             35.42267
9
Max Gross Exposure [%]                              100.0
10
Total Fees Paid                                       0.0
11
Max Drawdown [%]                                  31.0145
12
Max Drawdown Duration                   471 days 00:00:00
13
Total Trades                                           13
14
Total Closed Trades                                    12
15
Total Open Trades                                       1
16
Open Trade PnL                                  26.174785
17
Win Rate [%]                                    58.333333
18
Best Trade [%]                                  23.399167
19
Worst Trade [%]                                -11.947833
20
Avg Winning Trade [%]                            8.563981
21
Avg Losing Trade [%]                            -4.049498
22
Avg Winning Trade Duration    107 days 03:25:42.857142856
23
Avg Losing Trade Duration                78 days 00:00:00
24
Profit Factor                                    1.788768
25
Expectancy                                       0.358275
26
Sharpe Ratio                                     0.563374
27
Calmar Ratio                                     0.309651
28
Omega Ratio                                      1.108617
29
Sortino Ratio                                    0.820358
30
Name: group, dtype: object

权值可视化

1
plot_allocation(pyopt_srb_pf)

del01

附录#

Portfolio.from_orders#

参考：https://vectorbt.dev/api/portfolio/base/#from-orders
样例
del01

需要注意的是size<1>,-1,1,-1需要结合不同direction会生成不同的sell，buy信号，shortonly时的size=1表示卖出。

from_signals#

参考：https://vectorbt.dev/api/portfolio/base/#from-signals

黄金矿工

随机搜索最优权重#

构造随机权重#

数据准备#

生成订单#

波动率收益回报率,可视化#

取得最优组合信息#

月再平衡(重置回初始权重)#

收益计算#

权值可视化#

搜索和30日再平衡#

from_order_func(有点复杂,暂跳过)#

有效边界法(PyPortfolioOpt)#

有效边界的按月再平衡#

附录#

Portfolio.from_orders#

from_signals#