Amino acid dipepetide frequency for Poplar mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.604AlaAla: 5.604 ± 3.412
1.751AlaCys: 1.751 ± 0.956
3.853AlaAsp: 3.853 ± 1.26
8.406AlaGlu: 8.406 ± 3.98
3.503AlaPhe: 3.503 ± 1.111
5.254AlaGly: 5.254 ± 1.437
2.102AlaHis: 2.102 ± 0.755
3.853AlaIle: 3.853 ± 1.771
4.904AlaLys: 4.904 ± 1.769
5.954AlaLeu: 5.954 ± 2.345
1.051AlaMet: 1.051 ± 0.497
3.503AlaAsn: 3.503 ± 1.517
1.051AlaPro: 1.051 ± 0.584
2.452AlaGln: 2.452 ± 0.89
4.203AlaArg: 4.203 ± 1.409
5.254AlaSer: 5.254 ± 1.251
4.203AlaThr: 4.203 ± 1.632
4.203AlaVal: 4.203 ± 1.723
0.35AlaTrp: 0.35 ± 0.191
2.102AlaTyr: 2.102 ± 0.755
0.0AlaXaa: 0.0 ± 0.0
Cys
2.452CysAla: 2.452 ± 0.886
0.701CysCys: 0.701 ± 0.382
1.401CysAsp: 1.401 ± 2.491
1.751CysGlu: 1.751 ± 0.648
2.102CysPhe: 2.102 ± 0.766
3.152CysGly: 3.152 ± 1.117
0.35CysHis: 0.35 ± 0.745
1.051CysIle: 1.051 ± 0.584
1.751CysLys: 1.751 ± 0.956
2.802CysLeu: 2.802 ± 2.032
0.35CysMet: 0.35 ± 0.191
0.701CysAsn: 0.701 ± 1.489
0.35CysPro: 0.35 ± 0.191
0.35CysGln: 0.35 ± 0.191
2.102CysArg: 2.102 ± 0.618
2.452CysSer: 2.452 ± 2.175
1.751CysThr: 1.751 ± 0.72
3.853CysVal: 3.853 ± 2.445
0.35CysTrp: 0.35 ± 0.608
0.701CysTyr: 0.701 ± 0.382
0.0CysXaa: 0.0 ± 0.0
Asp
2.452AspAla: 2.452 ± 1.339
1.051AspCys: 1.051 ± 0.574
2.102AspAsp: 2.102 ± 0.783
3.152AspGlu: 3.152 ± 0.718
4.203AspPhe: 4.203 ± 2.44
4.203AspGly: 4.203 ± 1.43
1.401AspHis: 1.401 ± 1.315
3.152AspIle: 3.152 ± 2.505
1.401AspLys: 1.401 ± 0.765
4.203AspLeu: 4.203 ± 1.822
2.802AspMet: 2.802 ± 1.088
2.102AspAsn: 2.102 ± 0.911
3.152AspPro: 3.152 ± 1.208
1.051AspGln: 1.051 ± 0.497
2.452AspArg: 2.452 ± 0.623
2.802AspSer: 2.802 ± 0.901
1.051AspThr: 1.051 ± 0.497
3.503AspVal: 3.503 ± 0.971
0.35AspTrp: 0.35 ± 0.191
2.452AspTyr: 2.452 ± 0.895
0.0AspXaa: 0.0 ± 0.0
Glu
6.305GluAla: 6.305 ± 1.699
0.35GluCys: 0.35 ± 0.834
3.152GluAsp: 3.152 ± 1.273
4.904GluGlu: 4.904 ± 1.234
2.802GluPhe: 2.802 ± 0.595
5.954GluGly: 5.954 ± 1.305
1.401GluHis: 1.401 ± 0.587
4.203GluIle: 4.203 ± 1.247
3.853GluLys: 3.853 ± 0.885
7.005GluLeu: 7.005 ± 1.583
2.452GluMet: 2.452 ± 1.05
1.751GluAsn: 1.751 ± 0.646
3.503GluPro: 3.503 ± 1.132
2.102GluGln: 2.102 ± 2.933
3.503GluArg: 3.503 ± 1.421
5.604GluSer: 5.604 ± 1.656
3.152GluThr: 3.152 ± 2.15
6.655GluVal: 6.655 ± 2.392
0.701GluTrp: 0.701 ± 0.521
1.751GluTyr: 1.751 ± 0.968
0.0GluXaa: 0.0 ± 0.0
Phe
4.203PheAla: 4.203 ± 1.632
1.051PheCys: 1.051 ± 0.62
3.152PheAsp: 3.152 ± 1.18
5.254PheGlu: 5.254 ± 2.921
1.051PhePhe: 1.051 ± 1.377
4.553PheGly: 4.553 ± 2.178
0.701PheHis: 0.701 ± 0.68
3.503PheIle: 3.503 ± 0.801
2.452PheLys: 2.452 ± 1.339
4.553PheLeu: 4.553 ± 1.501
1.751PheMet: 1.751 ± 0.646
1.751PheAsn: 1.751 ± 1.249
0.701PhePro: 0.701 ± 1.004
1.751PheGln: 1.751 ± 0.958
2.452PheArg: 2.452 ± 2.676
4.904PheSer: 4.904 ± 2.097
3.853PheThr: 3.853 ± 1.052
3.853PheVal: 3.853 ± 0.986
0.35PheTrp: 0.35 ± 0.191
1.751PheTyr: 1.751 ± 0.668
0.0PheXaa: 0.0 ± 0.0
Gly
5.954GlyAla: 5.954 ± 1.828
1.401GlyCys: 1.401 ± 0.834
4.203GlyAsp: 4.203 ± 1.09
4.553GlyGlu: 4.553 ± 0.82
2.452GlyPhe: 2.452 ± 0.788
3.503GlyGly: 3.503 ± 1.917
2.452GlyHis: 2.452 ± 1.14
2.802GlyIle: 2.802 ± 1.18
5.954GlyLys: 5.954 ± 1.956
4.904GlyLeu: 4.904 ± 1.218
0.701GlyMet: 0.701 ± 0.481
1.051GlyAsn: 1.051 ± 0.584
0.35GlyPro: 0.35 ± 0.191
1.751GlyGln: 1.751 ± 0.961
4.203GlyArg: 4.203 ± 1.625
5.254GlySer: 5.254 ± 1.384
2.802GlyThr: 2.802 ± 0.768
6.305GlyVal: 6.305 ± 2.001
1.751GlyTrp: 1.751 ± 0.956
2.802GlyTyr: 2.802 ± 1.231
0.0GlyXaa: 0.0 ± 0.0
His
1.401HisAla: 1.401 ± 0.765
0.35HisCys: 0.35 ± 0.745
0.701HisAsp: 0.701 ± 0.382
0.701HisGlu: 0.701 ± 0.382
1.751HisPhe: 1.751 ± 1.629
2.102HisGly: 2.102 ± 1.17
0.701HisHis: 0.701 ± 0.382
1.051HisIle: 1.051 ± 0.679
2.102HisLys: 2.102 ± 1.352
3.853HisLeu: 3.853 ± 1.028
1.051HisMet: 1.051 ± 0.784
0.35HisAsn: 0.35 ± 0.191
0.35HisPro: 0.35 ± 0.191
0.0HisGln: 0.0 ± 0.0
1.051HisArg: 1.051 ± 0.679
3.503HisSer: 3.503 ± 1.922
0.35HisThr: 0.35 ± 0.191
1.401HisVal: 1.401 ± 0.765
0.0HisTrp: 0.0 ± 0.0
2.102HisTyr: 2.102 ± 0.755
0.0HisXaa: 0.0 ± 0.0
Ile
4.904IleAla: 4.904 ± 3.282
2.452IleCys: 2.452 ± 1.849
1.751IleAsp: 1.751 ± 0.968
5.604IleGlu: 5.604 ± 1.955
1.051IlePhe: 1.051 ± 0.574
2.102IleGly: 2.102 ± 0.783
1.751IleHis: 1.751 ± 0.961
3.503IleIle: 3.503 ± 2.888
3.152IleLys: 3.152 ± 0.788
3.503IleLeu: 3.503 ± 2.563
1.401IleMet: 1.401 ± 0.765
1.751IleAsn: 1.751 ± 2.209
1.051IlePro: 1.051 ± 0.584
2.452IleGln: 2.452 ± 0.516
2.802IleArg: 2.802 ± 1.482
4.203IleSer: 4.203 ± 1.875
2.802IleThr: 2.802 ± 1.335
2.452IleVal: 2.452 ± 0.924
0.35IleTrp: 0.35 ± 0.191
2.102IleTyr: 2.102 ± 0.755
0.0IleXaa: 0.0 ± 0.0
Lys
4.553LysAla: 4.553 ± 1.72
1.751LysCys: 1.751 ± 1.237
3.503LysAsp: 3.503 ± 0.866
3.853LysGlu: 3.853 ± 1.26
2.802LysPhe: 2.802 ± 1.088
1.751LysGly: 1.751 ± 0.646
1.751LysHis: 1.751 ± 0.956
3.152LysIle: 3.152 ± 0.718
5.954LysLys: 5.954 ± 1.872
7.005LysLeu: 7.005 ± 2.231
2.102LysMet: 2.102 ± 0.995
3.503LysAsn: 3.503 ± 0.826
2.802LysPro: 2.802 ± 0.595
1.751LysGln: 1.751 ± 0.556
5.604LysArg: 5.604 ± 2.518
4.203LysSer: 4.203 ± 1.754
4.203LysThr: 4.203 ± 1.864
3.152LysVal: 3.152 ± 0.935
1.051LysTrp: 1.051 ± 1.116
0.701LysTyr: 0.701 ± 0.382
0.0LysXaa: 0.0 ± 0.0
Leu
8.056LeuAla: 8.056 ± 1.555
2.452LeuCys: 2.452 ± 1.899
4.904LeuAsp: 4.904 ± 1.304
4.203LeuGlu: 4.203 ± 1.566
3.503LeuPhe: 3.503 ± 1.457
5.254LeuGly: 5.254 ± 1.444
1.401LeuHis: 1.401 ± 0.765
5.954LeuIle: 5.954 ± 2.572
5.604LeuLys: 5.604 ± 2.211
8.757LeuLeu: 8.757 ± 2.018
2.102LeuMet: 2.102 ± 0.813
4.553LeuAsn: 4.553 ± 1.904
5.604LeuPro: 5.604 ± 2.076
2.452LeuGln: 2.452 ± 1.025
5.604LeuArg: 5.604 ± 1.407
7.706LeuSer: 7.706 ± 2.391
4.203LeuThr: 4.203 ± 0.733
6.655LeuVal: 6.655 ± 1.94
0.701LeuTrp: 0.701 ± 0.382
2.102LeuTyr: 2.102 ± 1.02
0.0LeuXaa: 0.0 ± 0.0
Met
3.152MetAla: 3.152 ± 1.18
1.051MetCys: 1.051 ± 0.574
1.401MetAsp: 1.401 ± 0.587
1.751MetGlu: 1.751 ± 0.646
0.35MetPhe: 0.35 ± 0.191
1.051MetGly: 1.051 ± 0.584
0.0MetHis: 0.0 ± 0.0
0.35MetIle: 0.35 ± 0.191
1.401MetLys: 1.401 ± 0.544
2.452MetLeu: 2.452 ± 1.339
0.0MetMet: 0.0 ± 0.0
0.701MetAsn: 0.701 ± 0.521
2.452MetPro: 2.452 ± 1.11
0.35MetGln: 0.35 ± 0.784
2.102MetArg: 2.102 ± 0.618
1.751MetSer: 1.751 ± 1.001
1.051MetThr: 1.051 ± 0.574
1.401MetVal: 1.401 ± 0.615
0.701MetTrp: 0.701 ± 1.095
0.35MetTyr: 0.35 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
1.751AsnAla: 1.751 ± 1.13
1.051AsnCys: 1.051 ± 0.584
0.701AsnAsp: 0.701 ± 0.382
2.802AsnGlu: 2.802 ± 1.571
2.452AsnPhe: 2.452 ± 0.788
2.452AsnGly: 2.452 ± 0.939
1.051AsnHis: 1.051 ± 0.679
1.051AsnIle: 1.051 ± 1.147
1.751AsnLys: 1.751 ± 1.759
4.904AsnLeu: 4.904 ± 1.725
1.751AsnMet: 1.751 ± 0.668
1.751AsnAsn: 1.751 ± 1.001
1.751AsnPro: 1.751 ± 1.62
1.751AsnGln: 1.751 ± 0.556
1.751AsnArg: 1.751 ± 0.956
3.853AsnSer: 3.853 ± 2.373
0.35AsnThr: 0.35 ± 0.191
3.152AsnVal: 3.152 ± 1.23
0.35AsnTrp: 0.35 ± 0.191
2.452AsnTyr: 2.452 ± 1.744
0.0AsnXaa: 0.0 ± 0.0
Pro
3.503ProAla: 3.503 ± 1.967
1.401ProCys: 1.401 ± 0.587
3.853ProAsp: 3.853 ± 1.201
2.802ProGlu: 2.802 ± 1.173
0.35ProPhe: 0.35 ± 0.191
5.604ProGly: 5.604 ± 4.067
1.751ProHis: 1.751 ± 0.961
1.751ProIle: 1.751 ± 0.968
1.401ProLys: 1.401 ± 1.271
2.102ProLeu: 2.102 ± 1.027
0.35ProMet: 0.35 ± 0.191
1.751ProAsn: 1.751 ± 0.956
2.102ProPro: 2.102 ± 1.904
1.051ProGln: 1.051 ± 0.584
1.751ProArg: 1.751 ± 0.556
2.452ProSer: 2.452 ± 1.174
2.802ProThr: 2.802 ± 1.088
1.751ProVal: 1.751 ± 0.879
1.401ProTrp: 1.401 ± 0.765
1.751ProTyr: 1.751 ± 0.956
0.0ProXaa: 0.0 ± 0.0
Gln
2.102GlnAla: 2.102 ± 1.689
0.701GlnCys: 0.701 ± 0.382
0.701GlnAsp: 0.701 ± 0.382
1.751GlnGlu: 1.751 ± 0.745
0.701GlnPhe: 0.701 ± 0.382
2.102GlnGly: 2.102 ± 0.755
1.051GlnHis: 1.051 ± 0.574
1.051GlnIle: 1.051 ± 1.116
1.051GlnLys: 1.051 ± 1.424
1.751GlnLeu: 1.751 ± 0.648
0.0GlnMet: 0.0 ± 0.0
0.701GlnAsn: 0.701 ± 0.382
1.401GlnPro: 1.401 ± 1.04
0.35GlnGln: 0.35 ± 0.608
1.401GlnArg: 1.401 ± 0.765
3.152GlnSer: 3.152 ± 1.279
1.051GlnThr: 1.051 ± 0.497
3.152GlnVal: 3.152 ± 0.566
0.0GlnTrp: 0.0 ± 0.0
1.401GlnTyr: 1.401 ± 1.04
0.0GlnXaa: 0.0 ± 0.0
Arg
4.904ArgAla: 4.904 ± 0.72
2.452ArgCys: 2.452 ± 2.918
2.452ArgAsp: 2.452 ± 0.906
5.254ArgGlu: 5.254 ± 1.031
4.904ArgPhe: 4.904 ± 0.821
2.802ArgGly: 2.802 ± 1.789
1.051ArgHis: 1.051 ± 1.377
1.751ArgIle: 1.751 ± 0.556
4.203ArgLys: 4.203 ± 1.073
4.553ArgLeu: 4.553 ± 1.33
1.051ArgMet: 1.051 ± 0.574
2.452ArgAsn: 2.452 ± 1.092
1.751ArgPro: 1.751 ± 1.13
0.0ArgGln: 0.0 ± 0.0
3.152ArgArg: 3.152 ± 0.566
6.305ArgSer: 6.305 ± 1.137
1.051ArgThr: 1.051 ± 0.584
4.904ArgVal: 4.904 ± 1.765
0.701ArgTrp: 0.701 ± 0.382
3.503ArgTyr: 3.503 ± 0.801
0.0ArgXaa: 0.0 ± 0.0
Ser
3.853SerAla: 3.853 ± 2.103
1.751SerCys: 1.751 ± 1.165
4.203SerAsp: 4.203 ± 1.293
6.305SerGlu: 6.305 ± 2.462
4.904SerPhe: 4.904 ± 1.59
4.203SerGly: 4.203 ± 0.733
1.751SerHis: 1.751 ± 0.956
5.604SerIle: 5.604 ± 2.178
7.706SerLys: 7.706 ± 2.627
5.954SerLeu: 5.954 ± 1.815
1.751SerMet: 1.751 ± 0.639
2.802SerAsn: 2.802 ± 1.746
4.203SerPro: 4.203 ± 1.822
2.802SerGln: 2.802 ± 1.105
6.655SerArg: 6.655 ± 1.63
5.604SerSer: 5.604 ± 1.622
5.254SerThr: 5.254 ± 1.068
1.401SerVal: 1.401 ± 0.544
0.701SerTrp: 0.701 ± 1.244
2.452SerTyr: 2.452 ± 1.879
0.0SerXaa: 0.0 ± 0.0
Thr
3.152ThrAla: 3.152 ± 0.809
1.751ThrCys: 1.751 ± 1.076
1.401ThrAsp: 1.401 ± 0.663
3.152ThrGlu: 3.152 ± 1.369
8.056ThrPhe: 8.056 ± 0.862
2.102ThrGly: 2.102 ± 1.109
2.102ThrHis: 2.102 ± 0.911
2.102ThrIle: 2.102 ± 1.147
2.802ThrLys: 2.802 ± 2.338
7.356ThrLeu: 7.356 ± 1.559
1.051ThrMet: 1.051 ± 0.537
1.401ThrAsn: 1.401 ± 0.765
1.401ThrPro: 1.401 ± 1.106
0.0ThrGln: 0.0 ± 0.0
2.102ThrArg: 2.102 ± 1.519
1.751ThrSer: 1.751 ± 0.72
1.051ThrThr: 1.051 ± 1.116
2.102ThrVal: 2.102 ± 0.501
0.35ThrTrp: 0.35 ± 0.191
0.701ThrTyr: 0.701 ± 0.382
0.0ThrXaa: 0.0 ± 0.0
Val
3.152ValAla: 3.152 ± 0.957
4.203ValCys: 4.203 ± 1.632
1.751ValAsp: 1.751 ± 0.956
3.152ValGlu: 3.152 ± 1.273
4.203ValPhe: 4.203 ± 1.305
4.904ValGly: 4.904 ± 0.774
1.051ValHis: 1.051 ± 0.679
2.102ValIle: 2.102 ± 1.147
4.904ValLys: 4.904 ± 1.234
6.655ValLeu: 6.655 ± 1.565
1.051ValMet: 1.051 ± 1.116
3.152ValAsn: 3.152 ± 1.133
3.853ValPro: 3.853 ± 0.884
2.802ValGln: 2.802 ± 1.53
3.503ValArg: 3.503 ± 0.971
5.254ValSer: 5.254 ± 1.581
3.503ValThr: 3.503 ± 1.393
3.503ValVal: 3.503 ± 2.242
0.35ValTrp: 0.35 ± 0.191
1.401ValTyr: 1.401 ± 0.765
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.35TrpCys: 0.35 ± 0.191
1.051TrpAsp: 1.051 ± 0.574
0.35TrpGlu: 0.35 ± 0.745
1.051TrpPhe: 1.051 ± 0.574
0.35TrpGly: 0.35 ± 0.191
0.701TrpHis: 0.701 ± 0.521
0.35TrpIle: 0.35 ± 0.191
0.35TrpLys: 0.35 ± 0.191
1.401TrpLeu: 1.401 ± 0.765
0.0TrpMet: 0.0 ± 0.0
1.751TrpAsn: 1.751 ± 0.669
0.35TrpPro: 0.35 ± 0.191
0.0TrpGln: 0.0 ± 0.0
0.35TrpArg: 0.35 ± 0.608
1.051TrpSer: 1.051 ± 0.955
0.35TrpThr: 0.35 ± 0.608
0.701TrpVal: 0.701 ± 0.382
0.0TrpTrp: 0.0 ± 0.0
0.35TrpTyr: 0.35 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.102TyrAla: 2.102 ± 0.951
2.452TyrCys: 2.452 ± 0.886
2.802TyrAsp: 2.802 ± 1.231
1.751TyrGlu: 1.751 ± 0.956
1.401TyrPhe: 1.401 ± 0.615
1.751TyrGly: 1.751 ± 0.668
0.35TyrHis: 0.35 ± 0.191
2.802TyrIle: 2.802 ± 1.043
2.452TyrLys: 2.452 ± 1.339
2.102TyrLeu: 2.102 ± 0.783
0.701TyrMet: 0.701 ± 0.382
1.401TyrAsn: 1.401 ± 0.765
3.503TyrPro: 3.503 ± 1.236
0.35TyrGln: 0.35 ± 0.834
2.452TyrArg: 2.452 ± 1.879
2.802TyrSer: 2.802 ± 1.043
1.051TyrThr: 1.051 ± 1.91
0.35TyrVal: 0.35 ± 0.191
0.35TyrTrp: 0.35 ± 0.191
1.051TyrTyr: 1.051 ± 0.574
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2856 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski