Amino acid dipepetide frequency for Velvet tobacco mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.866AlaAla: 3.866 ± 1.382
0.0AlaCys: 0.0 ± 0.0
4.51AlaAsp: 4.51 ± 2.688
7.088AlaGlu: 7.088 ± 1.941
1.289AlaPhe: 1.289 ± 0.952
3.222AlaGly: 3.222 ± 2.854
1.289AlaHis: 1.289 ± 0.461
1.933AlaIle: 1.933 ± 1.107
5.799AlaLys: 5.799 ± 2.471
4.51AlaLeu: 4.51 ± 2.094
1.933AlaMet: 1.933 ± 0.458
0.644AlaAsn: 0.644 ± 0.489
5.799AlaPro: 5.799 ± 1.538
1.289AlaGln: 1.289 ± 0.694
5.155AlaArg: 5.155 ± 0.421
8.376AlaSer: 8.376 ± 1.959
4.51AlaThr: 4.51 ± 2.58
3.222AlaVal: 3.222 ± 1.069
0.0AlaTrp: 0.0 ± 0.0
0.644AlaTyr: 0.644 ± 0.943
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.644CysCys: 0.644 ± 0.489
2.577CysAsp: 2.577 ± 0.895
1.933CysGlu: 1.933 ± 1.154
1.933CysPhe: 1.933 ± 1.468
0.644CysGly: 0.644 ± 0.943
0.0CysHis: 0.0 ± 0.0
0.644CysIle: 0.644 ± 0.943
1.933CysLys: 1.933 ± 0.81
1.933CysLeu: 1.933 ± 0.824
0.644CysMet: 0.644 ± 0.489
0.0CysAsn: 0.0 ± 0.0
1.933CysPro: 1.933 ± 1.154
3.222CysGln: 3.222 ± 0.756
0.0CysArg: 0.0 ± 0.0
1.933CysSer: 1.933 ± 1.154
1.289CysThr: 1.289 ± 0.934
0.644CysVal: 0.644 ± 0.943
0.0CysTrp: 0.0 ± 0.0
1.289CysTyr: 1.289 ± 0.993
0.0CysXaa: 0.0 ± 0.0
Asp
1.933AspAla: 1.933 ± 0.81
0.644AspCys: 0.644 ± 0.762
2.577AspAsp: 2.577 ± 0.921
3.866AspGlu: 3.866 ± 2.241
5.155AspPhe: 5.155 ± 2.748
2.577AspGly: 2.577 ± 0.921
1.289AspHis: 1.289 ± 0.934
2.577AspIle: 2.577 ± 1.204
1.933AspLys: 1.933 ± 1.154
4.51AspLeu: 4.51 ± 2.058
0.644AspMet: 0.644 ± 0.489
0.644AspAsn: 0.644 ± 0.762
1.933AspPro: 1.933 ± 0.824
0.644AspGln: 0.644 ± 0.489
0.644AspArg: 0.644 ± 0.762
6.443AspSer: 6.443 ± 2.265
0.644AspThr: 0.644 ± 0.489
2.577AspVal: 2.577 ± 0.789
1.289AspTrp: 1.289 ± 0.979
4.51AspTyr: 4.51 ± 1.609
0.0AspXaa: 0.0 ± 0.0
Glu
5.155GluAla: 5.155 ± 1.389
0.0GluCys: 0.0 ± 0.0
1.933GluAsp: 1.933 ± 0.81
3.222GluGlu: 3.222 ± 1.211
4.51GluPhe: 4.51 ± 1.375
4.51GluGly: 4.51 ± 2.434
0.0GluHis: 0.0 ± 0.0
5.155GluIle: 5.155 ± 1.166
4.51GluLys: 4.51 ± 0.544
8.376GluLeu: 8.376 ± 3.246
0.0GluMet: 0.0 ± 0.0
2.577GluAsn: 2.577 ± 0.921
5.799GluPro: 5.799 ± 2.141
5.155GluGln: 5.155 ± 1.166
5.155GluArg: 5.155 ± 2.457
3.222GluSer: 3.222 ± 0.715
3.222GluThr: 3.222 ± 0.513
4.51GluVal: 4.51 ± 1.819
0.644GluTrp: 0.644 ± 0.489
1.289GluTyr: 1.289 ± 0.934
0.0GluXaa: 0.0 ± 0.0
Phe
1.289PheAla: 1.289 ± 0.461
1.933PheCys: 1.933 ± 1.468
2.577PheAsp: 2.577 ± 0.895
1.289PheGlu: 1.289 ± 0.993
0.644PhePhe: 0.644 ± 0.943
2.577PheGly: 2.577 ± 1.423
0.644PheHis: 0.644 ± 0.489
0.644PheIle: 0.644 ± 0.762
1.933PheLys: 1.933 ± 0.783
1.933PheLeu: 1.933 ± 0.783
0.644PheMet: 0.644 ± 0.489
1.289PheAsn: 1.289 ± 0.461
1.289PhePro: 1.289 ± 0.461
2.577PheGln: 2.577 ± 0.789
3.866PheArg: 3.866 ± 1.621
3.866PheSer: 3.866 ± 1.64
1.933PheThr: 1.933 ± 1.073
3.866PheVal: 3.866 ± 2.605
0.644PheTrp: 0.644 ± 0.489
1.289PheTyr: 1.289 ± 0.461
0.0PheXaa: 0.0 ± 0.0
Gly
6.443GlyAla: 6.443 ± 1.325
0.644GlyCys: 0.644 ± 0.943
2.577GlyAsp: 2.577 ± 0.575
5.155GlyGlu: 5.155 ± 0.733
5.155GlyPhe: 5.155 ± 1.884
1.933GlyGly: 1.933 ± 0.81
0.644GlyHis: 0.644 ± 0.489
1.289GlyIle: 1.289 ± 1.887
3.222GlyLys: 3.222 ± 1.758
3.866GlyLeu: 3.866 ± 1.506
1.933GlyMet: 1.933 ± 0.458
2.577GlyAsn: 2.577 ± 0.575
1.933GlyPro: 1.933 ± 1.468
3.222GlyGln: 3.222 ± 0.972
2.577GlyArg: 2.577 ± 1.147
7.732GlySer: 7.732 ± 1.262
3.222GlyThr: 3.222 ± 3.808
5.155GlyVal: 5.155 ± 2.398
2.577GlyTrp: 2.577 ± 1.28
2.577GlyTyr: 2.577 ± 1.257
0.0GlyXaa: 0.0 ± 0.0
His
1.289HisAla: 1.289 ± 0.461
0.0HisCys: 0.0 ± 0.0
1.933HisAsp: 1.933 ± 1.154
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.289HisGly: 1.289 ± 0.694
0.0HisHis: 0.0 ± 0.0
1.289HisIle: 1.289 ± 1.217
0.644HisLys: 0.644 ± 0.497
2.577HisLeu: 2.577 ± 1.672
0.0HisMet: 0.0 ± 0.0
0.644HisAsn: 0.644 ± 0.497
1.289HisPro: 1.289 ± 0.461
1.289HisGln: 1.289 ± 0.979
0.644HisArg: 0.644 ± 0.489
0.644HisSer: 0.644 ± 0.762
1.933HisThr: 1.933 ± 0.81
1.289HisVal: 1.289 ± 0.461
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.222IleAla: 3.222 ± 1.05
0.644IleCys: 0.644 ± 0.943
1.289IleAsp: 1.289 ± 0.952
5.155IleGlu: 5.155 ± 1.839
0.0IlePhe: 0.0 ± 0.0
3.866IleGly: 3.866 ± 2.244
0.644IleHis: 0.644 ± 0.497
1.933IleIle: 1.933 ± 1.154
1.933IleLys: 1.933 ± 0.783
1.933IleLeu: 1.933 ± 1.107
0.0IleMet: 0.0 ± 0.0
1.289IleAsn: 1.289 ± 0.763
3.866IlePro: 3.866 ± 1.848
1.289IleGln: 1.289 ± 1.523
1.933IleArg: 1.933 ± 0.824
7.088IleSer: 7.088 ± 2.087
0.644IleThr: 0.644 ± 0.497
3.222IleVal: 3.222 ± 0.715
0.644IleTrp: 0.644 ± 0.489
1.933IleTyr: 1.933 ± 2.285
0.0IleXaa: 0.0 ± 0.0
Lys
1.933LysAla: 1.933 ± 0.81
1.933LysCys: 1.933 ± 0.824
3.866LysAsp: 3.866 ± 1.102
4.51LysGlu: 4.51 ± 1.911
2.577LysPhe: 2.577 ± 0.895
2.577LysGly: 2.577 ± 0.575
0.644LysHis: 0.644 ± 0.489
2.577LysIle: 2.577 ± 1.672
2.577LysLys: 2.577 ± 1.204
4.51LysLeu: 4.51 ± 1.172
1.289LysMet: 1.289 ± 0.993
1.933LysAsn: 1.933 ± 0.936
5.155LysPro: 5.155 ± 3.226
1.933LysGln: 1.933 ± 1.031
2.577LysArg: 2.577 ± 1.147
5.799LysSer: 5.799 ± 0.974
3.866LysThr: 3.866 ± 1.074
2.577LysVal: 2.577 ± 1.28
1.933LysTrp: 1.933 ± 1.444
1.933LysTyr: 1.933 ± 0.81
0.0LysXaa: 0.0 ± 0.0
Leu
7.732LeuAla: 7.732 ± 1.725
2.577LeuCys: 2.577 ± 1.257
3.222LeuAsp: 3.222 ± 0.972
4.51LeuGlu: 4.51 ± 1.678
2.577LeuPhe: 2.577 ± 1.257
1.933LeuGly: 1.933 ± 1.031
3.222LeuHis: 3.222 ± 1.88
7.088LeuIle: 7.088 ± 1.28
2.577LeuLys: 2.577 ± 0.575
9.021LeuLeu: 9.021 ± 2.737
1.933LeuMet: 1.933 ± 1.468
1.289LeuAsn: 1.289 ± 0.461
2.577LeuPro: 2.577 ± 2.105
1.933LeuGln: 1.933 ± 0.783
9.021LeuArg: 9.021 ± 0.984
7.088LeuSer: 7.088 ± 1.067
3.866LeuThr: 3.866 ± 1.188
6.443LeuVal: 6.443 ± 1.975
4.51LeuTrp: 4.51 ± 1.663
3.866LeuTyr: 3.866 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
2.577MetAla: 2.577 ± 1.388
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.866MetGly: 3.866 ± 1.506
0.644MetHis: 0.644 ± 0.489
2.577MetIle: 2.577 ± 1.388
1.289MetLys: 1.289 ± 0.461
3.866MetLeu: 3.866 ± 1.433
0.644MetMet: 0.644 ± 0.497
0.0MetAsn: 0.0 ± 0.0
0.644MetPro: 0.644 ± 0.943
0.0MetGln: 0.0 ± 0.0
0.644MetArg: 0.644 ± 0.489
2.577MetSer: 2.577 ± 1.169
0.644MetThr: 0.644 ± 0.497
1.933MetVal: 1.933 ± 0.824
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.933AsnAla: 1.933 ± 1.49
1.289AsnCys: 1.289 ± 0.952
1.289AsnAsp: 1.289 ± 0.461
3.222AsnGlu: 3.222 ± 1.239
3.866AsnPhe: 3.866 ± 1.336
2.577AsnGly: 2.577 ± 1.423
0.0AsnHis: 0.0 ± 0.0
1.289AsnIle: 1.289 ± 0.694
2.577AsnLys: 2.577 ± 1.28
2.577AsnLeu: 2.577 ± 0.921
0.644AsnMet: 0.644 ± 0.787
0.0AsnAsn: 0.0 ± 0.0
0.644AsnPro: 0.644 ± 0.497
1.289AsnGln: 1.289 ± 0.763
1.933AsnArg: 1.933 ± 1.154
3.222AsnSer: 3.222 ± 1.758
1.933AsnThr: 1.933 ± 2.285
1.289AsnVal: 1.289 ± 0.993
1.289AsnTrp: 1.289 ± 0.952
0.644AsnTyr: 0.644 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
5.799ProAla: 5.799 ± 1.538
1.289ProCys: 1.289 ± 0.993
1.289ProAsp: 1.289 ± 0.461
5.155ProGlu: 5.155 ± 1.714
1.933ProPhe: 1.933 ± 1.49
4.51ProGly: 4.51 ± 0.861
0.644ProHis: 0.644 ± 0.489
1.933ProIle: 1.933 ± 0.458
5.155ProLys: 5.155 ± 1.714
2.577ProLeu: 2.577 ± 0.575
1.933ProMet: 1.933 ± 1.37
3.222ProAsn: 3.222 ± 0.972
3.222ProPro: 3.222 ± 1.774
1.289ProGln: 1.289 ± 0.934
0.0ProArg: 0.0 ± 0.0
6.443ProSer: 6.443 ± 1.893
4.51ProThr: 4.51 ± 0.88
1.933ProVal: 1.933 ± 0.824
1.933ProTrp: 1.933 ± 1.031
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.577GlnAla: 2.577 ± 1.423
0.644GlnCys: 0.644 ± 0.943
0.644GlnAsp: 0.644 ± 0.497
2.577GlnGlu: 2.577 ± 1.257
0.644GlnPhe: 0.644 ± 0.497
2.577GlnGly: 2.577 ± 1.388
0.0GlnHis: 0.0 ± 0.0
1.933GlnIle: 1.933 ± 1.468
1.289GlnLys: 1.289 ± 0.952
2.577GlnLeu: 2.577 ± 0.646
1.289GlnMet: 1.289 ± 0.694
1.933GlnAsn: 1.933 ± 0.458
0.644GlnPro: 0.644 ± 0.497
0.644GlnGln: 0.644 ± 0.762
1.933GlnArg: 1.933 ± 0.824
3.222GlnSer: 3.222 ± 1.541
3.866GlnThr: 3.866 ± 1.156
3.866GlnVal: 3.866 ± 1.0
0.644GlnTrp: 0.644 ± 0.489
3.222GlnTyr: 3.222 ± 2.327
0.0GlnXaa: 0.0 ± 0.0
Arg
5.799ArgAla: 5.799 ± 0.733
1.933ArgCys: 1.933 ± 1.468
2.577ArgAsp: 2.577 ± 0.789
3.866ArgGlu: 3.866 ± 1.621
3.222ArgPhe: 3.222 ± 1.727
2.577ArgGly: 2.577 ± 0.921
0.644ArgHis: 0.644 ± 0.762
2.577ArgIle: 2.577 ± 0.921
3.866ArgLys: 3.866 ± 1.999
6.443ArgLeu: 6.443 ± 1.918
0.644ArgMet: 0.644 ± 1.386
2.577ArgAsn: 2.577 ± 0.921
2.577ArgPro: 2.577 ± 1.328
0.644ArgGln: 0.644 ± 0.489
10.954ArgArg: 10.954 ± 2.825
5.155ArgSer: 5.155 ± 1.735
1.933ArgThr: 1.933 ± 0.81
1.289ArgVal: 1.289 ± 0.461
0.644ArgTrp: 0.644 ± 0.762
1.289ArgTyr: 1.289 ± 0.763
0.0ArgXaa: 0.0 ± 0.0
Ser
5.799SerAla: 5.799 ± 1.373
3.866SerCys: 3.866 ± 2.364
4.51SerAsp: 4.51 ± 2.494
6.443SerGlu: 6.443 ± 2.094
0.644SerPhe: 0.644 ± 0.489
9.665SerGly: 9.665 ± 3.081
1.289SerHis: 1.289 ± 0.461
3.222SerIle: 3.222 ± 1.256
8.376SerLys: 8.376 ± 2.806
10.309SerLeu: 10.309 ± 4.145
2.577SerMet: 2.577 ± 1.194
2.577SerAsn: 2.577 ± 0.895
5.799SerPro: 5.799 ± 1.445
2.577SerGln: 2.577 ± 1.147
7.088SerArg: 7.088 ± 1.82
15.464SerSer: 15.464 ± 1.36
7.088SerThr: 7.088 ± 2.446
10.309SerVal: 10.309 ± 0.61
0.644SerTrp: 0.644 ± 0.489
1.933SerTyr: 1.933 ± 0.81
0.0SerXaa: 0.0 ± 0.0
Thr
2.577ThrAla: 2.577 ± 1.388
0.644ThrCys: 0.644 ± 0.489
1.933ThrAsp: 1.933 ± 1.798
1.289ThrGlu: 1.289 ± 0.979
1.933ThrPhe: 1.933 ± 1.073
3.866ThrGly: 3.866 ± 1.188
1.933ThrHis: 1.933 ± 0.81
0.644ThrIle: 0.644 ± 0.497
2.577ThrLys: 2.577 ± 0.575
7.088ThrLeu: 7.088 ± 3.858
1.933ThrMet: 1.933 ± 1.49
1.933ThrAsn: 1.933 ± 1.107
3.222ThrPro: 3.222 ± 1.156
1.289ThrGln: 1.289 ± 0.763
1.289ThrArg: 1.289 ± 1.217
7.088ThrSer: 7.088 ± 0.951
5.799ThrThr: 5.799 ± 1.483
4.51ThrVal: 4.51 ± 2.615
0.644ThrTrp: 0.644 ± 0.762
1.933ThrTyr: 1.933 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
3.866ValAla: 3.866 ± 1.99
2.577ValCys: 2.577 ± 2.75
3.866ValAsp: 3.866 ± 1.616
5.799ValGlu: 5.799 ± 3.286
0.644ValPhe: 0.644 ± 0.489
5.799ValGly: 5.799 ± 1.412
1.289ValHis: 1.289 ± 1.217
1.289ValIle: 1.289 ± 1.523
3.866ValLys: 3.866 ± 1.433
4.51ValLeu: 4.51 ± 1.007
1.289ValMet: 1.289 ± 0.694
5.799ValAsn: 5.799 ± 0.711
3.222ValPro: 3.222 ± 0.513
3.866ValGln: 3.866 ± 1.188
3.866ValArg: 3.866 ± 0.822
7.088ValSer: 7.088 ± 3.116
2.577ValThr: 2.577 ± 1.599
7.088ValVal: 7.088 ± 4.011
0.644ValTrp: 0.644 ± 0.497
0.644ValTyr: 0.644 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
1.289TrpAla: 1.289 ± 0.979
0.0TrpCys: 0.0 ± 0.0
1.289TrpAsp: 1.289 ± 0.461
1.289TrpGlu: 1.289 ± 0.979
0.0TrpPhe: 0.0 ± 0.0
0.644TrpGly: 0.644 ± 0.489
1.289TrpHis: 1.289 ± 0.461
1.289TrpIle: 1.289 ± 0.763
0.0TrpLys: 0.0 ± 0.0
1.933TrpLeu: 1.933 ± 0.458
0.0TrpMet: 0.0 ± 0.0
1.289TrpAsn: 1.289 ± 0.461
2.577TrpPro: 2.577 ± 1.257
0.644TrpGln: 0.644 ± 0.497
0.644TrpArg: 0.644 ± 0.762
5.155TrpSer: 5.155 ± 0.421
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.644TrpTyr: 0.644 ± 0.762
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.933TyrCys: 1.933 ± 1.154
2.577TyrAsp: 2.577 ± 0.789
2.577TyrGlu: 2.577 ± 1.423
0.0TyrPhe: 0.0 ± 0.0
3.222TyrGly: 3.222 ± 0.715
0.644TyrHis: 0.644 ± 0.497
0.644TyrIle: 0.644 ± 0.762
0.644TyrLys: 0.644 ± 0.762
1.933TyrLeu: 1.933 ± 0.824
1.289TyrMet: 1.289 ± 0.498
1.289TyrAsn: 1.289 ± 0.993
0.644TyrPro: 0.644 ± 0.489
1.933TyrGln: 1.933 ± 1.073
1.933TyrArg: 1.933 ± 1.031
2.577TyrSer: 2.577 ± 1.526
0.644TyrThr: 0.644 ± 0.943
3.866TyrVal: 3.866 ± 1.0
1.289TyrTrp: 1.289 ± 0.461
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski