Amino acid dipepetide frequency for Apis mellifera associated microvirus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.874AlaAla: 7.874 ± 3.284
2.147AlaCys: 2.147 ± 1.462
6.442AlaAsp: 6.442 ± 2.146
6.442AlaGlu: 6.442 ± 2.15
4.295AlaPhe: 4.295 ± 0.887
7.158AlaGly: 7.158 ± 3.916
1.432AlaHis: 1.432 ± 0.71
4.295AlaIle: 4.295 ± 0.952
3.579AlaLys: 3.579 ± 1.539
3.579AlaLeu: 3.579 ± 0.971
0.0AlaMet: 0.0 ± 0.0
3.579AlaAsn: 3.579 ± 1.235
2.147AlaPro: 2.147 ± 1.042
5.011AlaGln: 5.011 ± 2.594
6.442AlaArg: 6.442 ± 1.018
5.727AlaSer: 5.727 ± 1.109
5.011AlaThr: 5.011 ± 2.007
7.158AlaVal: 7.158 ± 2.265
0.716AlaTrp: 0.716 ± 0.644
0.716AlaTyr: 0.716 ± 0.495
0.0AlaXaa: 0.0 ± 0.0
Cys
2.147CysAla: 2.147 ± 0.707
0.0CysCys: 0.0 ± 0.0
0.716CysAsp: 0.716 ± 0.74
0.716CysGlu: 0.716 ± 0.644
0.716CysPhe: 0.716 ± 0.495
0.716CysGly: 0.716 ± 0.644
0.0CysHis: 0.0 ± 0.0
0.716CysIle: 0.716 ± 0.644
0.0CysLys: 0.0 ± 0.0
2.147CysLeu: 2.147 ± 1.058
0.0CysMet: 0.0 ± 0.0
2.147CysAsn: 2.147 ± 1.008
0.716CysPro: 0.716 ± 0.787
0.0CysGln: 0.0 ± 0.0
0.716CysArg: 0.716 ± 0.644
0.716CysSer: 0.716 ± 0.644
0.716CysThr: 0.716 ± 0.644
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.716CysTyr: 0.716 ± 0.495
0.0CysXaa: 0.0 ± 0.0
Asp
4.295AspAla: 4.295 ± 1.098
0.716AspCys: 0.716 ± 0.644
6.442AspAsp: 6.442 ± 2.302
5.011AspGlu: 5.011 ± 1.992
6.442AspPhe: 6.442 ± 2.429
2.147AspGly: 2.147 ± 1.058
1.432AspHis: 1.432 ± 0.518
3.579AspIle: 3.579 ± 1.235
3.579AspLys: 3.579 ± 2.204
4.295AspLeu: 4.295 ± 1.451
0.0AspMet: 0.0 ± 0.0
5.011AspAsn: 5.011 ± 1.198
2.863AspPro: 2.863 ± 1.363
2.147AspGln: 2.147 ± 1.042
2.147AspArg: 2.147 ± 0.782
2.863AspSer: 2.863 ± 1.784
2.863AspThr: 2.863 ± 1.581
3.579AspVal: 3.579 ± 0.999
0.0AspTrp: 0.0 ± 0.0
3.579AspTyr: 3.579 ± 1.665
0.0AspXaa: 0.0 ± 0.0
Glu
7.158GluAla: 7.158 ± 1.47
0.716GluCys: 0.716 ± 0.644
2.147GluAsp: 2.147 ± 1.042
1.432GluGlu: 1.432 ± 0.518
5.011GluPhe: 5.011 ± 0.768
2.863GluGly: 2.863 ± 2.101
1.432GluHis: 1.432 ± 0.518
5.727GluIle: 5.727 ± 1.786
5.011GluLys: 5.011 ± 3.926
5.011GluLeu: 5.011 ± 0.813
0.0GluMet: 0.0 ± 0.0
1.432GluAsn: 1.432 ± 1.573
0.716GluPro: 0.716 ± 0.495
2.863GluGln: 2.863 ± 0.814
2.863GluArg: 2.863 ± 1.229
2.863GluSer: 2.863 ± 1.413
0.716GluThr: 0.716 ± 0.787
2.147GluVal: 2.147 ± 1.042
0.716GluTrp: 0.716 ± 0.495
4.295GluTyr: 4.295 ± 2.941
0.0GluXaa: 0.0 ± 0.0
Phe
4.295PheAla: 4.295 ± 1.658
0.0PheCys: 0.0 ± 0.0
5.727PheAsp: 5.727 ± 1.82
2.863PheGlu: 2.863 ± 1.148
3.579PhePhe: 3.579 ± 2.022
2.863PheGly: 2.863 ± 1.981
0.0PheHis: 0.0 ± 0.0
2.147PheIle: 2.147 ± 0.691
3.579PheLys: 3.579 ± 1.316
2.863PheLeu: 2.863 ± 0.705
1.432PheMet: 1.432 ± 0.687
2.863PheAsn: 2.863 ± 1.036
0.0PhePro: 0.0 ± 0.0
2.147PheGln: 2.147 ± 0.549
4.295PheArg: 4.295 ± 1.871
3.579PheSer: 3.579 ± 2.106
2.863PheThr: 2.863 ± 1.981
3.579PheVal: 3.579 ± 1.231
2.147PheTrp: 2.147 ± 0.782
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.727GlyAla: 5.727 ± 4.382
0.0GlyCys: 0.0 ± 0.0
3.579GlyAsp: 3.579 ± 1.231
2.863GlyGlu: 2.863 ± 0.909
2.147GlyPhe: 2.147 ± 0.707
2.863GlyGly: 2.863 ± 1.473
0.0GlyHis: 0.0 ± 0.0
2.147GlyIle: 2.147 ± 0.691
3.579GlyLys: 3.579 ± 1.251
5.727GlyLeu: 5.727 ± 1.507
0.716GlyMet: 0.716 ± 0.639
2.147GlyAsn: 2.147 ± 0.707
3.579GlyPro: 3.579 ± 1.272
2.147GlyGln: 2.147 ± 0.782
2.863GlyArg: 2.863 ± 1.036
5.727GlySer: 5.727 ± 2.069
4.295GlyThr: 4.295 ± 1.799
4.295GlyVal: 4.295 ± 1.531
1.432GlyTrp: 1.432 ± 1.051
4.295GlyTyr: 4.295 ± 1.49
0.0GlyXaa: 0.0 ± 0.0
His
1.432HisAla: 1.432 ± 1.288
0.0HisCys: 0.0 ± 0.0
1.432HisAsp: 1.432 ± 0.518
0.716HisGlu: 0.716 ± 0.644
2.147HisPhe: 2.147 ± 0.782
2.863HisGly: 2.863 ± 1.139
0.0HisHis: 0.0 ± 0.0
0.716HisIle: 0.716 ± 0.495
0.716HisLys: 0.716 ± 0.495
3.579HisLeu: 3.579 ± 1.176
0.716HisMet: 0.716 ± 0.495
0.716HisAsn: 0.716 ± 0.74
0.716HisPro: 0.716 ± 0.644
0.716HisGln: 0.716 ± 0.639
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.716HisVal: 0.716 ± 0.495
0.716HisTrp: 0.716 ± 0.644
0.716HisTyr: 0.716 ± 0.644
0.0HisXaa: 0.0 ± 0.0
Ile
4.295IleAla: 4.295 ± 1.329
0.0IleCys: 0.0 ± 0.0
0.716IleAsp: 0.716 ± 0.495
3.579IleGlu: 3.579 ± 1.245
2.863IlePhe: 2.863 ± 1.148
5.727IleGly: 5.727 ± 1.269
1.432IleHis: 1.432 ± 0.976
3.579IleIle: 3.579 ± 0.971
0.716IleLys: 0.716 ± 0.495
2.147IleLeu: 2.147 ± 0.782
1.432IleMet: 1.432 ± 0.682
2.863IleAsn: 2.863 ± 1.088
3.579IlePro: 3.579 ± 1.088
1.432IleGln: 1.432 ± 0.71
4.295IleArg: 4.295 ± 1.424
1.432IleSer: 1.432 ± 0.976
1.432IleThr: 1.432 ± 0.976
1.432IleVal: 1.432 ± 0.71
0.716IleTrp: 0.716 ± 0.495
2.863IleTyr: 2.863 ± 1.088
0.0IleXaa: 0.0 ± 0.0
Lys
3.579LysAla: 3.579 ± 2.041
0.716LysCys: 0.716 ± 0.644
2.863LysAsp: 2.863 ± 1.363
3.579LysGlu: 3.579 ± 1.723
2.863LysPhe: 2.863 ± 1.674
0.716LysGly: 0.716 ± 0.495
2.147LysHis: 2.147 ± 1.932
3.579LysIle: 3.579 ± 2.204
10.021LysLys: 10.021 ± 4.87
7.158LysLeu: 7.158 ± 2.517
2.147LysMet: 2.147 ± 1.157
2.147LysAsn: 2.147 ± 1.324
5.011LysPro: 5.011 ± 1.607
2.863LysGln: 2.863 ± 1.413
6.442LysArg: 6.442 ± 1.861
6.442LysSer: 6.442 ± 2.094
2.147LysThr: 2.147 ± 0.994
2.863LysVal: 2.863 ± 1.937
0.716LysTrp: 0.716 ± 0.644
2.147LysTyr: 2.147 ± 0.795
0.0LysXaa: 0.0 ± 0.0
Leu
7.158LeuAla: 7.158 ± 1.198
1.432LeuCys: 1.432 ± 1.202
3.579LeuAsp: 3.579 ± 1.046
4.295LeuGlu: 4.295 ± 2.005
3.579LeuPhe: 3.579 ± 1.231
7.874LeuGly: 7.874 ± 1.502
0.0LeuHis: 0.0 ± 0.0
1.432LeuIle: 1.432 ± 0.991
2.147LeuLys: 2.147 ± 0.795
4.295LeuLeu: 4.295 ± 1.49
3.579LeuMet: 3.579 ± 1.291
6.442LeuAsn: 6.442 ± 2.117
6.442LeuPro: 6.442 ± 2.146
7.874LeuGln: 7.874 ± 1.734
2.147LeuArg: 2.147 ± 0.549
2.147LeuSer: 2.147 ± 1.445
4.295LeuThr: 4.295 ± 1.312
1.432LeuVal: 1.432 ± 0.991
0.716LeuTrp: 0.716 ± 0.644
1.432LeuTyr: 1.432 ± 1.123
0.0LeuXaa: 0.0 ± 0.0
Met
2.147MetAla: 2.147 ± 1.257
0.0MetCys: 0.0 ± 0.0
1.432MetAsp: 1.432 ± 0.518
0.716MetGlu: 0.716 ± 0.787
0.716MetPhe: 0.716 ± 0.824
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.147MetLys: 2.147 ± 1.736
2.147MetLeu: 2.147 ± 1.026
1.432MetMet: 1.432 ± 0.892
1.432MetAsn: 1.432 ± 1.278
0.0MetPro: 0.0 ± 0.0
2.147MetGln: 2.147 ± 0.953
2.147MetArg: 2.147 ± 1.445
2.147MetSer: 2.147 ± 1.045
1.432MetThr: 1.432 ± 0.682
0.716MetVal: 0.716 ± 0.639
0.716MetTrp: 0.716 ± 0.495
1.432MetTyr: 1.432 ± 0.518
0.0MetXaa: 0.0 ± 0.0
Asn
2.147AsnAla: 2.147 ± 0.761
0.0AsnCys: 0.0 ± 0.0
7.158AsnAsp: 7.158 ± 2.556
2.863AsnGlu: 2.863 ± 1.122
0.716AsnPhe: 0.716 ± 0.824
2.863AsnGly: 2.863 ± 1.33
1.432AsnHis: 1.432 ± 0.969
1.432AsnIle: 1.432 ± 0.9
2.147AsnLys: 2.147 ± 1.27
2.147AsnLeu: 2.147 ± 0.953
0.716AsnMet: 0.716 ± 0.495
1.432AsnAsn: 1.432 ± 1.481
1.432AsnPro: 1.432 ± 0.9
3.579AsnGln: 3.579 ± 1.259
3.579AsnArg: 3.579 ± 1.259
2.863AsnSer: 2.863 ± 0.781
4.295AsnThr: 4.295 ± 2.09
4.295AsnVal: 4.295 ± 1.692
2.863AsnTrp: 2.863 ± 1.121
0.716AsnTyr: 0.716 ± 0.74
0.0AsnXaa: 0.0 ± 0.0
Pro
5.011ProAla: 5.011 ± 1.608
0.716ProCys: 0.716 ± 0.644
3.579ProAsp: 3.579 ± 0.999
1.432ProGlu: 1.432 ± 0.892
1.432ProPhe: 1.432 ± 0.892
2.147ProGly: 2.147 ± 1.045
1.432ProHis: 1.432 ± 0.518
2.863ProIle: 2.863 ± 0.702
4.295ProLys: 4.295 ± 1.03
2.147ProLeu: 2.147 ± 0.761
2.147ProMet: 2.147 ± 1.213
0.716ProAsn: 0.716 ± 0.495
0.0ProPro: 0.0 ± 0.0
4.295ProGln: 4.295 ± 2.483
3.579ProArg: 3.579 ± 2.305
3.579ProSer: 3.579 ± 1.804
5.727ProThr: 5.727 ± 1.559
2.863ProVal: 2.863 ± 1.581
1.432ProTrp: 1.432 ± 0.756
0.716ProTyr: 0.716 ± 0.644
0.0ProXaa: 0.0 ± 0.0
Gln
3.579GlnAla: 3.579 ± 1.717
0.716GlnCys: 0.716 ± 0.644
2.863GlnAsp: 2.863 ± 1.036
2.863GlnGlu: 2.863 ± 1.473
1.432GlnPhe: 1.432 ± 0.682
5.011GlnGly: 5.011 ± 1.608
1.432GlnHis: 1.432 ± 0.892
2.147GlnIle: 2.147 ± 1.008
5.011GlnLys: 5.011 ± 1.665
4.295GlnLeu: 4.295 ± 1.028
1.432GlnMet: 1.432 ± 1.037
2.147GlnAsn: 2.147 ± 1.045
0.0GlnPro: 0.0 ± 0.0
4.295GlnGln: 4.295 ± 1.401
5.011GlnArg: 5.011 ± 1.423
2.863GlnSer: 2.863 ± 0.705
2.863GlnThr: 2.863 ± 1.863
1.432GlnVal: 1.432 ± 0.518
0.716GlnTrp: 0.716 ± 0.644
2.147GlnTyr: 2.147 ± 1.643
0.0GlnXaa: 0.0 ± 0.0
Arg
7.158ArgAla: 7.158 ± 1.653
2.147ArgCys: 2.147 ± 1.058
5.011ArgAsp: 5.011 ± 1.662
4.295ArgGlu: 4.295 ± 1.775
2.147ArgPhe: 2.147 ± 0.934
4.295ArgGly: 4.295 ± 1.028
0.716ArgHis: 0.716 ± 0.495
2.147ArgIle: 2.147 ± 0.761
2.147ArgLys: 2.147 ± 1.324
5.727ArgLeu: 5.727 ± 1.446
2.863ArgMet: 2.863 ± 1.473
2.863ArgAsn: 2.863 ± 1.443
5.011ArgPro: 5.011 ± 1.162
1.432ArgGln: 1.432 ± 1.288
3.579ArgArg: 3.579 ± 1.653
5.011ArgSer: 5.011 ± 1.307
2.147ArgThr: 2.147 ± 1.478
2.147ArgVal: 2.147 ± 1.743
0.0ArgTrp: 0.0 ± 0.0
5.727ArgTyr: 5.727 ± 1.468
0.0ArgXaa: 0.0 ± 0.0
Ser
2.863SerAla: 2.863 ± 0.743
1.432SerCys: 1.432 ± 0.991
1.432SerAsp: 1.432 ± 0.518
2.147SerGlu: 2.147 ± 0.549
2.863SerPhe: 2.863 ± 1.203
1.432SerGly: 1.432 ± 0.71
2.147SerHis: 2.147 ± 0.795
5.011SerIle: 5.011 ± 2.138
7.874SerLys: 7.874 ± 4.985
5.727SerLeu: 5.727 ± 2.717
0.716SerMet: 0.716 ± 0.787
2.863SerAsn: 2.863 ± 1.574
5.727SerPro: 5.727 ± 1.324
5.011SerGln: 5.011 ± 1.346
5.011SerArg: 5.011 ± 1.303
5.011SerSer: 5.011 ± 0.648
7.158SerThr: 7.158 ± 3.137
2.147SerVal: 2.147 ± 0.549
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.442ThrAla: 6.442 ± 3.113
0.716ThrCys: 0.716 ± 0.495
1.432ThrAsp: 1.432 ± 0.71
2.147ThrGlu: 2.147 ± 1.058
2.863ThrPhe: 2.863 ± 1.036
2.863ThrGly: 2.863 ± 0.814
1.432ThrHis: 1.432 ± 0.682
2.147ThrIle: 2.147 ± 0.782
2.147ThrLys: 2.147 ± 1.034
2.863ThrLeu: 2.863 ± 1.036
1.432ThrMet: 1.432 ± 0.9
2.863ThrAsn: 2.863 ± 1.229
3.579ThrPro: 3.579 ± 1.568
2.147ThrGln: 2.147 ± 0.691
2.863ThrArg: 2.863 ± 1.33
5.727ThrSer: 5.727 ± 2.572
0.716ThrThr: 0.716 ± 0.495
4.295ThrVal: 4.295 ± 2.016
0.716ThrTrp: 0.716 ± 0.495
4.295ThrTyr: 4.295 ± 1.134
0.0ThrXaa: 0.0 ± 0.0
Val
2.147ValAla: 2.147 ± 1.374
0.0ValCys: 0.0 ± 0.0
2.147ValAsp: 2.147 ± 1.184
3.579ValGlu: 3.579 ± 0.716
1.432ValPhe: 1.432 ± 0.518
4.295ValGly: 4.295 ± 1.317
0.0ValHis: 0.0 ± 0.0
1.432ValIle: 1.432 ± 0.682
6.442ValLys: 6.442 ± 2.792
2.147ValLeu: 2.147 ± 1.045
1.432ValMet: 1.432 ± 0.969
4.295ValAsn: 4.295 ± 2.118
4.295ValPro: 4.295 ± 2.972
0.0ValGln: 0.0 ± 0.0
5.727ValArg: 5.727 ± 1.446
1.432ValSer: 1.432 ± 1.14
4.295ValThr: 4.295 ± 1.554
1.432ValVal: 1.432 ± 0.892
0.716ValTrp: 0.716 ± 0.495
2.147ValTyr: 2.147 ± 1.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.716TrpAla: 0.716 ± 0.644
0.716TrpCys: 0.716 ± 0.644
0.716TrpAsp: 0.716 ± 0.495
0.716TrpGlu: 0.716 ± 0.495
0.716TrpPhe: 0.716 ± 0.495
0.0TrpGly: 0.0 ± 0.0
1.432TrpHis: 1.432 ± 0.518
0.0TrpIle: 0.0 ± 0.0
1.432TrpLys: 1.432 ± 1.573
1.432TrpLeu: 1.432 ± 0.772
0.0TrpMet: 0.0 ± 0.0
0.716TrpAsn: 0.716 ± 0.495
2.863TrpPro: 2.863 ± 1.036
0.0TrpGln: 0.0 ± 0.0
0.716TrpArg: 0.716 ± 0.644
2.147TrpSer: 2.147 ± 0.707
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.716TrpTyr: 0.716 ± 0.495
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.579TyrAla: 3.579 ± 0.971
1.432TyrCys: 1.432 ± 0.518
3.579TyrAsp: 3.579 ± 0.893
2.863TyrGlu: 2.863 ± 2.016
2.863TyrPhe: 2.863 ± 1.33
1.432TyrGly: 1.432 ± 1.288
0.716TyrHis: 0.716 ± 0.644
1.432TyrIle: 1.432 ± 0.518
3.579TyrLys: 3.579 ± 1.316
3.579TyrLeu: 3.579 ± 1.623
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.432TyrPro: 1.432 ± 0.952
2.147TyrGln: 2.147 ± 0.934
2.147TyrArg: 2.147 ± 0.549
4.295TyrSer: 4.295 ± 1.197
0.716TyrThr: 0.716 ± 0.495
2.863TyrVal: 2.863 ± 1.203
0.0TyrTrp: 0.0 ± 0.0
2.147TyrTyr: 2.147 ± 1.661
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski