Amino acid dipepetide frequency for Soybean Putnam virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.286AlaAla: 1.286 ± 0.415
0.0AlaCys: 0.0 ± 0.0
0.429AlaAsp: 0.429 ± 0.287
4.717AlaGlu: 4.717 ± 1.804
4.288AlaPhe: 4.288 ± 1.394
2.573AlaGly: 2.573 ± 1.245
0.858AlaHis: 0.858 ± 0.412
3.431AlaIle: 3.431 ± 1.682
3.859AlaLys: 3.859 ± 2.103
3.002AlaLeu: 3.002 ± 0.911
0.0AlaMet: 0.0 ± 0.0
2.144AlaAsn: 2.144 ± 0.689
0.429AlaPro: 0.429 ± 0.287
1.715AlaGln: 1.715 ± 0.818
2.573AlaArg: 2.573 ± 0.505
4.288AlaSer: 4.288 ± 1.243
2.573AlaThr: 2.573 ± 0.906
1.286AlaVal: 1.286 ± 0.329
0.0AlaTrp: 0.0 ± 0.0
0.858AlaTyr: 0.858 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
0.858CysAla: 0.858 ± 0.715
1.286CysCys: 1.286 ± 0.329
0.858CysAsp: 0.858 ± 0.654
0.429CysGlu: 0.429 ± 0.357
0.429CysPhe: 0.429 ± 0.287
0.0CysGly: 0.0 ± 0.0
0.429CysHis: 0.429 ± 0.634
0.858CysIle: 0.858 ± 0.494
1.715CysLys: 1.715 ± 0.459
0.429CysLeu: 0.429 ± 0.357
0.0CysMet: 0.0 ± 0.0
1.715CysAsn: 1.715 ± 0.749
2.144CysPro: 2.144 ± 0.854
0.0CysGln: 0.0 ± 0.0
0.858CysArg: 0.858 ± 0.37
1.715CysSer: 1.715 ± 0.559
0.0CysThr: 0.0 ± 0.0
0.858CysVal: 0.858 ± 0.573
0.429CysTrp: 0.429 ± 0.327
0.858CysTyr: 0.858 ± 0.481
0.0CysXaa: 0.0 ± 0.0
Asp
2.144AspAla: 2.144 ± 0.771
1.715AspCys: 1.715 ± 0.288
4.288AspAsp: 4.288 ± 1.398
3.859AspGlu: 3.859 ± 1.032
2.144AspPhe: 2.144 ± 0.904
2.144AspGly: 2.144 ± 0.854
1.286AspHis: 1.286 ± 0.639
7.719AspIle: 7.719 ± 2.156
4.717AspLys: 4.717 ± 1.287
3.002AspLeu: 3.002 ± 1.584
0.0AspMet: 0.0 ± 0.0
3.002AspAsn: 3.002 ± 0.429
1.715AspPro: 1.715 ± 1.148
3.431AspGln: 3.431 ± 1.066
3.431AspArg: 3.431 ± 0.509
4.717AspSer: 4.717 ± 0.7
2.144AspThr: 2.144 ± 0.86
1.715AspVal: 1.715 ± 0.588
0.858AspTrp: 0.858 ± 0.412
3.431AspTyr: 3.431 ± 1.152
0.0AspXaa: 0.0 ± 0.0
Glu
5.146GluAla: 5.146 ± 1.427
0.858GluCys: 0.858 ± 0.717
3.859GluAsp: 3.859 ± 1.032
7.29GluGlu: 7.29 ± 1.814
2.573GluPhe: 2.573 ± 0.442
1.286GluGly: 1.286 ± 0.576
2.573GluHis: 2.573 ± 1.109
6.003GluIle: 6.003 ± 1.675
6.432GluLys: 6.432 ± 1.208
5.575GluLeu: 5.575 ± 1.234
1.286GluMet: 1.286 ± 0.348
2.573GluAsn: 2.573 ± 1.043
3.431GluPro: 3.431 ± 1.117
5.146GluGln: 5.146 ± 0.914
2.144GluArg: 2.144 ± 0.471
6.861GluSer: 6.861 ± 1.465
3.859GluThr: 3.859 ± 0.753
3.859GluVal: 3.859 ± 1.458
0.0GluTrp: 0.0 ± 0.0
2.144GluTyr: 2.144 ± 0.916
0.0GluXaa: 0.0 ± 0.0
Phe
1.715PheAla: 1.715 ± 0.559
1.715PheCys: 1.715 ± 0.579
1.715PheAsp: 1.715 ± 0.288
3.431PheGlu: 3.431 ± 0.576
1.715PhePhe: 1.715 ± 0.725
2.144PheGly: 2.144 ± 0.704
0.858PheHis: 0.858 ± 0.573
0.858PheIle: 0.858 ± 0.903
5.146PheLys: 5.146 ± 1.835
6.003PheLeu: 6.003 ± 1.672
0.429PheMet: 0.429 ± 0.452
2.573PheAsn: 2.573 ± 0.935
2.573PhePro: 2.573 ± 1.232
3.431PheGln: 3.431 ± 0.646
1.715PheArg: 1.715 ± 0.579
5.146PheSer: 5.146 ± 1.349
3.002PheThr: 3.002 ± 0.835
3.002PheVal: 3.002 ± 0.838
1.286PheTrp: 1.286 ± 0.575
0.858PheTyr: 0.858 ± 0.654
0.0PheXaa: 0.0 ± 0.0
Gly
2.573GlyAla: 2.573 ± 0.714
0.429GlyCys: 0.429 ± 0.327
2.144GlyAsp: 2.144 ± 0.957
1.715GlyGlu: 1.715 ± 0.459
2.573GlyPhe: 2.573 ± 1.059
0.858GlyGly: 0.858 ± 0.574
1.286GlyHis: 1.286 ± 0.446
4.717GlyIle: 4.717 ± 0.885
5.146GlyLys: 5.146 ± 1.182
3.431GlyLeu: 3.431 ± 0.694
0.429GlyMet: 0.429 ± 0.489
2.573GlyAsn: 2.573 ± 1.474
2.573GlyPro: 2.573 ± 1.29
0.429GlyGln: 0.429 ± 0.452
2.573GlyArg: 2.573 ± 0.673
4.288GlySer: 4.288 ± 0.729
1.286GlyThr: 1.286 ± 0.663
0.858GlyVal: 0.858 ± 0.715
0.0GlyTrp: 0.0 ± 0.0
1.715GlyTyr: 1.715 ± 0.87
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.715HisCys: 1.715 ± 0.722
1.286HisAsp: 1.286 ± 0.733
1.286HisGlu: 1.286 ± 0.329
1.286HisPhe: 1.286 ± 0.415
1.286HisGly: 1.286 ± 0.567
1.715HisHis: 1.715 ± 0.288
2.573HisIle: 2.573 ± 1.179
3.002HisLys: 3.002 ± 0.59
1.715HisLeu: 1.715 ± 0.87
0.429HisMet: 0.429 ± 0.287
0.858HisAsn: 0.858 ± 0.573
0.858HisPro: 0.858 ± 0.344
0.858HisGln: 0.858 ± 0.573
0.858HisArg: 0.858 ± 0.37
0.858HisSer: 0.858 ± 0.473
0.0HisThr: 0.0 ± 0.0
2.573HisVal: 2.573 ± 0.666
0.429HisTrp: 0.429 ± 0.287
1.715HisTyr: 1.715 ± 0.459
0.0HisXaa: 0.0 ± 0.0
Ile
1.715IleAla: 1.715 ± 0.619
2.144IleCys: 2.144 ± 0.854
6.003IleAsp: 6.003 ± 2.813
5.146IleGlu: 5.146 ± 0.537
2.144IlePhe: 2.144 ± 0.689
3.431IleGly: 3.431 ± 0.635
2.573IleHis: 2.573 ± 1.278
4.717IleIle: 4.717 ± 1.558
6.861IleLys: 6.861 ± 2.134
7.719IleLeu: 7.719 ± 1.634
0.858IleMet: 0.858 ± 0.344
3.431IleAsn: 3.431 ± 1.38
3.859IlePro: 3.859 ± 0.682
3.859IleGln: 3.859 ± 1.109
2.573IleArg: 2.573 ± 0.928
3.859IleSer: 3.859 ± 1.966
3.859IleThr: 3.859 ± 1.006
3.431IleVal: 3.431 ± 1.77
0.0IleTrp: 0.0 ± 0.0
2.573IleTyr: 2.573 ± 0.867
0.0IleXaa: 0.0 ± 0.0
Lys
4.288LysAla: 4.288 ± 0.618
0.0LysCys: 0.0 ± 0.0
9.005LysAsp: 9.005 ± 2.857
5.146LysGlu: 5.146 ± 1.693
5.575LysPhe: 5.575 ± 0.753
4.717LysGly: 4.717 ± 1.262
1.715LysHis: 1.715 ± 0.288
7.719LysIle: 7.719 ± 1.947
11.149LysLys: 11.149 ± 2.864
7.719LysLeu: 7.719 ± 2.041
2.144LysMet: 2.144 ± 0.741
7.29LysAsn: 7.29 ± 1.187
5.146LysPro: 5.146 ± 1.083
6.003LysGln: 6.003 ± 0.921
5.146LysArg: 5.146 ± 1.592
6.861LysSer: 6.861 ± 1.415
6.003LysThr: 6.003 ± 1.682
6.861LysVal: 6.861 ± 1.408
0.0LysTrp: 0.0 ± 0.0
2.573LysTyr: 2.573 ± 1.024
0.0LysXaa: 0.0 ± 0.0
Leu
4.288LeuAla: 4.288 ± 1.261
1.715LeuCys: 1.715 ± 0.661
5.575LeuAsp: 5.575 ± 0.73
6.003LeuGlu: 6.003 ± 1.344
2.573LeuPhe: 2.573 ± 0.867
4.717LeuGly: 4.717 ± 1.593
1.286LeuHis: 1.286 ± 0.523
5.575LeuIle: 5.575 ± 0.679
11.578LeuLys: 11.578 ± 1.427
6.861LeuLeu: 6.861 ± 0.708
1.715LeuMet: 1.715 ± 0.661
5.146LeuAsn: 5.146 ± 1.352
2.573LeuPro: 2.573 ± 0.942
5.146LeuGln: 5.146 ± 1.046
3.002LeuArg: 3.002 ± 0.99
6.432LeuSer: 6.432 ± 2.017
4.717LeuThr: 4.717 ± 1.635
4.288LeuVal: 4.288 ± 0.762
0.0LeuTrp: 0.0 ± 0.0
0.858LeuTyr: 0.858 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
0.858MetAla: 0.858 ± 0.473
0.429MetCys: 0.429 ± 0.287
0.0MetAsp: 0.0 ± 0.0
2.144MetGlu: 2.144 ± 0.588
1.286MetPhe: 1.286 ± 0.446
0.0MetGly: 0.0 ± 0.0
0.858MetHis: 0.858 ± 0.494
0.858MetIle: 0.858 ± 0.715
0.858MetLys: 0.858 ± 0.344
0.858MetLeu: 0.858 ± 0.344
0.0MetMet: 0.0 ± 0.0
3.002MetAsn: 3.002 ± 1.171
0.858MetPro: 0.858 ± 0.593
0.858MetGln: 0.858 ± 0.573
0.429MetArg: 0.429 ± 0.357
0.858MetSer: 0.858 ± 0.602
1.715MetThr: 1.715 ± 0.942
1.286MetVal: 1.286 ± 0.552
0.429MetTrp: 0.429 ± 0.287
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.573AsnAla: 2.573 ± 1.486
0.429AsnCys: 0.429 ± 0.327
3.002AsnAsp: 3.002 ± 1.033
6.003AsnGlu: 6.003 ± 1.417
3.002AsnPhe: 3.002 ± 0.984
1.286AsnGly: 1.286 ± 0.64
1.715AsnHis: 1.715 ± 0.691
1.715AsnIle: 1.715 ± 0.739
5.146AsnLys: 5.146 ± 1.965
6.861AsnLeu: 6.861 ± 1.547
0.429AsnMet: 0.429 ± 0.287
2.573AsnAsn: 2.573 ± 0.622
3.431AsnPro: 3.431 ± 0.927
2.144AsnGln: 2.144 ± 0.632
1.286AsnArg: 1.286 ± 0.663
6.003AsnSer: 6.003 ± 0.977
5.575AsnThr: 5.575 ± 1.197
2.573AsnVal: 2.573 ± 0.658
0.429AsnTrp: 0.429 ± 0.327
2.573AsnTyr: 2.573 ± 0.953
0.0AsnXaa: 0.0 ± 0.0
Pro
1.715ProAla: 1.715 ± 0.978
0.0ProCys: 0.0 ± 0.0
1.715ProAsp: 1.715 ± 0.459
3.431ProGlu: 3.431 ± 0.932
1.286ProPhe: 1.286 ± 0.639
0.429ProGly: 0.429 ± 0.287
1.286ProHis: 1.286 ± 0.552
3.431ProIle: 3.431 ± 1.058
5.146ProLys: 5.146 ± 1.062
2.144ProLeu: 2.144 ± 0.963
0.429ProMet: 0.429 ± 0.287
6.003ProAsn: 6.003 ± 1.545
1.286ProPro: 1.286 ± 0.766
0.858ProGln: 0.858 ± 0.573
1.286ProArg: 1.286 ± 0.916
7.29ProSer: 7.29 ± 1.17
1.286ProThr: 1.286 ± 0.523
3.002ProVal: 3.002 ± 0.928
0.429ProTrp: 0.429 ± 0.287
0.429ProTyr: 0.429 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
2.573GlnAla: 2.573 ± 0.714
0.0GlnCys: 0.0 ± 0.0
3.002GlnAsp: 3.002 ± 1.312
4.288GlnGlu: 4.288 ± 1.057
1.715GlnPhe: 1.715 ± 0.288
3.431GlnGly: 3.431 ± 0.359
0.429GlnHis: 0.429 ± 0.357
4.288GlnIle: 4.288 ± 1.181
4.288GlnLys: 4.288 ± 1.463
4.288GlnLeu: 4.288 ± 2.206
1.286GlnMet: 1.286 ± 0.734
2.144GlnAsn: 2.144 ± 1.127
0.858GlnPro: 0.858 ± 0.654
2.573GlnGln: 2.573 ± 1.28
1.715GlnArg: 1.715 ± 0.83
3.859GlnSer: 3.859 ± 0.757
2.144GlnThr: 2.144 ± 1.05
3.431GlnVal: 3.431 ± 0.944
0.429GlnTrp: 0.429 ± 0.287
0.429GlnTyr: 0.429 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
0.858ArgAla: 0.858 ± 0.412
1.286ArgCys: 1.286 ± 0.329
2.144ArgAsp: 2.144 ± 0.848
1.286ArgGlu: 1.286 ± 0.639
2.144ArgPhe: 2.144 ± 0.611
2.573ArgGly: 2.573 ± 0.928
1.286ArgHis: 1.286 ± 0.86
3.859ArgIle: 3.859 ± 1.508
3.431ArgLys: 3.431 ± 1.268
3.431ArgLeu: 3.431 ± 0.544
2.573ArgMet: 2.573 ± 0.673
2.144ArgAsn: 2.144 ± 1.321
1.715ArgPro: 1.715 ± 0.661
0.429ArgGln: 0.429 ± 0.357
3.002ArgArg: 3.002 ± 1.145
3.002ArgSer: 3.002 ± 0.705
2.144ArgThr: 2.144 ± 0.794
1.286ArgVal: 1.286 ± 0.523
0.858ArgTrp: 0.858 ± 0.573
2.573ArgTyr: 2.573 ± 0.853
0.0ArgXaa: 0.0 ± 0.0
Ser
1.286SerAla: 1.286 ± 0.747
0.429SerCys: 0.429 ± 0.327
5.146SerAsp: 5.146 ± 1.317
6.861SerGlu: 6.861 ± 1.199
5.575SerPhe: 5.575 ± 0.654
6.003SerGly: 6.003 ± 0.871
2.144SerHis: 2.144 ± 0.718
5.146SerIle: 5.146 ± 1.174
13.293SerLys: 13.293 ± 1.436
7.719SerLeu: 7.719 ± 2.314
1.286SerMet: 1.286 ± 0.523
3.002SerAsn: 3.002 ± 1.125
3.859SerPro: 3.859 ± 1.013
1.286SerGln: 1.286 ± 0.329
3.431SerArg: 3.431 ± 0.787
10.72SerSer: 10.72 ± 3.452
4.717SerThr: 4.717 ± 1.815
2.573SerVal: 2.573 ± 1.3
0.429SerTrp: 0.429 ± 0.327
1.286SerTyr: 1.286 ± 0.576
0.0SerXaa: 0.0 ± 0.0
Thr
3.431ThrAla: 3.431 ± 1.599
0.429ThrCys: 0.429 ± 0.287
3.431ThrAsp: 3.431 ± 1.375
1.286ThrGlu: 1.286 ± 0.636
1.286ThrPhe: 1.286 ± 0.455
1.286ThrGly: 1.286 ± 0.523
0.858ThrHis: 0.858 ± 0.573
4.288ThrIle: 4.288 ± 1.438
4.717ThrLys: 4.717 ± 1.439
6.003ThrLeu: 6.003 ± 1.636
1.286ThrMet: 1.286 ± 0.831
3.431ThrAsn: 3.431 ± 1.072
2.144ThrPro: 2.144 ± 0.602
4.288ThrGln: 4.288 ± 1.302
2.144ThrArg: 2.144 ± 0.689
3.859ThrSer: 3.859 ± 1.627
2.144ThrThr: 2.144 ± 1.09
3.002ThrVal: 3.002 ± 1.784
0.858ThrTrp: 0.858 ± 0.473
0.858ThrTyr: 0.858 ± 0.473
0.0ThrXaa: 0.0 ± 0.0
Val
0.858ValAla: 0.858 ± 0.574
0.858ValCys: 0.858 ± 0.573
3.431ValAsp: 3.431 ± 1.041
3.859ValGlu: 3.859 ± 0.528
6.432ValPhe: 6.432 ± 0.729
2.144ValGly: 2.144 ± 0.916
0.858ValHis: 0.858 ± 0.903
1.715ValIle: 1.715 ± 0.612
6.432ValLys: 6.432 ± 1.454
2.144ValLeu: 2.144 ± 1.226
1.715ValMet: 1.715 ± 0.688
3.859ValAsn: 3.859 ± 0.872
2.573ValPro: 2.573 ± 0.726
2.573ValGln: 2.573 ± 0.658
2.144ValArg: 2.144 ± 0.44
3.431ValSer: 3.431 ± 1.66
1.715ValThr: 1.715 ± 0.769
2.573ValVal: 2.573 ± 1.052
0.429ValTrp: 0.429 ± 0.327
3.002ValTyr: 3.002 ± 1.104
0.0ValXaa: 0.0 ± 0.0
Trp
0.429TrpAla: 0.429 ± 0.452
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.715TrpGlu: 1.715 ± 0.559
0.429TrpPhe: 0.429 ± 0.287
0.429TrpGly: 0.429 ± 0.287
0.0TrpHis: 0.0 ± 0.0
0.429TrpIle: 0.429 ± 0.327
0.0TrpLys: 0.0 ± 0.0
0.429TrpLeu: 0.429 ± 0.327
0.429TrpMet: 0.429 ± 0.287
0.429TrpAsn: 0.429 ± 0.327
0.0TrpPro: 0.0 ± 0.0
0.858TrpGln: 0.858 ± 0.573
0.858TrpArg: 0.858 ± 0.37
0.0TrpSer: 0.0 ± 0.0
0.858TrpThr: 0.858 ± 0.573
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.429TrpTyr: 0.429 ± 0.357
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.715TyrAla: 1.715 ± 0.622
0.429TyrCys: 0.429 ± 0.327
0.429TyrAsp: 0.429 ± 0.357
3.002TyrGlu: 3.002 ± 1.593
0.858TyrPhe: 0.858 ± 0.593
0.858TyrGly: 0.858 ± 0.412
1.286TyrHis: 1.286 ± 0.86
0.858TyrIle: 0.858 ± 0.473
2.144TyrLys: 2.144 ± 0.957
4.717TyrLeu: 4.717 ± 1.214
0.429TyrMet: 0.429 ± 0.547
0.858TyrAsn: 0.858 ± 0.344
0.858TyrPro: 0.858 ± 0.494
1.286TyrGln: 1.286 ± 0.639
0.858TyrArg: 0.858 ± 0.412
2.144TyrSer: 2.144 ± 0.673
1.286TyrThr: 1.286 ± 0.329
4.288TyrVal: 4.288 ± 0.881
0.429TyrTrp: 0.429 ± 0.357
2.573TyrTyr: 2.573 ± 0.719
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2333 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski