Amino acid dipepetide frequency for Tomato mottle Taino virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.405AlaAla: 1.405 ± 0.755
1.405AlaCys: 1.405 ± 0.755
1.405AlaAsp: 1.405 ± 0.601
1.405AlaGlu: 1.405 ± 0.913
0.703AlaPhe: 0.703 ± 0.63
2.811AlaGly: 2.811 ± 1.31
2.811AlaHis: 2.811 ± 1.111
3.514AlaIle: 3.514 ± 1.894
4.919AlaLys: 4.919 ± 1.728
4.919AlaLeu: 4.919 ± 1.545
1.405AlaMet: 1.405 ± 1.074
1.405AlaAsn: 1.405 ± 0.755
2.811AlaPro: 2.811 ± 0.971
2.811AlaGln: 2.811 ± 0.828
4.919AlaArg: 4.919 ± 1.804
9.838AlaSer: 9.838 ± 3.683
3.514AlaThr: 3.514 ± 1.264
2.811AlaVal: 2.811 ± 0.999
0.0AlaTrp: 0.0 ± 0.0
1.405AlaTyr: 1.405 ± 1.331
0.0AlaXaa: 0.0 ± 0.0
Cys
1.405CysAla: 1.405 ± 0.961
0.0CysCys: 0.0 ± 0.0
0.703CysAsp: 0.703 ± 0.63
0.703CysGlu: 0.703 ± 0.682
0.0CysPhe: 0.0 ± 0.0
0.703CysGly: 0.703 ± 0.922
0.0CysHis: 0.0 ± 0.0
0.703CysIle: 0.703 ± 0.63
2.811CysLys: 2.811 ± 0.529
1.405CysLeu: 1.405 ± 1.235
1.405CysMet: 1.405 ± 0.603
1.405CysAsn: 1.405 ± 0.601
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.703CysArg: 0.703 ± 0.537
2.108CysSer: 2.108 ± 1.846
2.108CysThr: 2.108 ± 1.085
2.108CysVal: 2.108 ± 1.378
1.405CysTrp: 1.405 ± 1.235
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.703AspAla: 0.703 ± 0.63
0.0AspCys: 0.0 ± 0.0
2.108AspAsp: 2.108 ± 1.846
4.216AspGlu: 4.216 ± 0.992
4.216AspPhe: 4.216 ± 1.158
2.811AspGly: 2.811 ± 1.488
0.703AspHis: 0.703 ± 0.63
2.811AspIle: 2.811 ± 0.863
2.108AspLys: 2.108 ± 1.109
4.919AspLeu: 4.919 ± 1.384
0.703AspMet: 0.703 ± 0.617
1.405AspAsn: 1.405 ± 0.711
1.405AspPro: 1.405 ± 1.26
0.703AspGln: 0.703 ± 0.665
4.216AspArg: 4.216 ± 2.132
6.325AspSer: 6.325 ± 0.715
2.811AspThr: 2.811 ± 1.199
6.325AspVal: 6.325 ± 1.571
1.405AspTrp: 1.405 ± 1.074
1.405AspTyr: 1.405 ± 0.818
0.0AspXaa: 0.0 ± 0.0
Glu
1.405GluAla: 1.405 ± 0.755
0.703GluCys: 0.703 ± 0.63
0.703GluAsp: 0.703 ± 0.665
2.811GluGlu: 2.811 ± 1.737
0.703GluPhe: 0.703 ± 0.537
4.919GluGly: 4.919 ± 2.064
0.0GluHis: 0.0 ± 0.0
2.811GluIle: 2.811 ± 1.999
0.703GluLys: 0.703 ± 0.617
4.919GluLeu: 4.919 ± 1.054
0.703GluMet: 0.703 ± 0.537
7.027GluAsn: 7.027 ± 1.633
3.514GluPro: 3.514 ± 1.277
2.108GluGln: 2.108 ± 1.335
2.108GluArg: 2.108 ± 0.72
4.216GluSer: 4.216 ± 3.025
0.703GluThr: 0.703 ± 0.537
0.703GluVal: 0.703 ± 0.63
2.811GluTrp: 2.811 ± 1.31
1.405GluTyr: 1.405 ± 1.26
0.0GluXaa: 0.0 ± 0.0
Phe
2.108PheAla: 2.108 ± 0.808
0.703PheCys: 0.703 ± 0.682
2.108PheAsp: 2.108 ± 1.257
0.703PheGlu: 0.703 ± 0.537
1.405PhePhe: 1.405 ± 0.601
2.108PheGly: 2.108 ± 0.703
2.811PheHis: 2.811 ± 0.968
2.108PheIle: 2.108 ± 1.141
4.216PheLys: 4.216 ± 2.126
2.108PheLeu: 2.108 ± 1.143
0.0PheMet: 0.0 ± 0.0
4.216PheAsn: 4.216 ± 0.894
2.108PhePro: 2.108 ± 1.109
2.108PheGln: 2.108 ± 1.121
1.405PheArg: 1.405 ± 0.913
4.216PheSer: 4.216 ± 1.55
2.811PheThr: 2.811 ± 0.999
2.108PheVal: 2.108 ± 1.257
2.108PheTrp: 2.108 ± 1.583
2.108PheTyr: 2.108 ± 0.85
0.0PheXaa: 0.0 ± 0.0
Gly
4.216GlyAla: 4.216 ± 1.877
2.108GlyCys: 2.108 ± 1.044
1.405GlyAsp: 1.405 ± 1.074
4.919GlyGlu: 4.919 ± 1.404
1.405GlyPhe: 1.405 ± 1.034
3.514GlyGly: 3.514 ± 1.166
1.405GlyHis: 1.405 ± 0.998
2.108GlyIle: 2.108 ± 1.085
8.433GlyLys: 8.433 ± 1.159
2.108GlyLeu: 2.108 ± 1.338
1.405GlyMet: 1.405 ± 1.154
3.514GlyAsn: 3.514 ± 1.626
4.216GlyPro: 4.216 ± 1.171
2.811GlyGln: 2.811 ± 1.223
0.703GlyArg: 0.703 ± 0.537
4.919GlySer: 4.919 ± 0.883
4.919GlyThr: 4.919 ± 1.669
2.811GlyVal: 2.811 ± 1.674
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.405HisAla: 1.405 ± 0.898
1.405HisCys: 1.405 ± 0.961
2.811HisAsp: 2.811 ± 1.232
1.405HisGlu: 1.405 ± 0.744
1.405HisPhe: 1.405 ± 0.601
1.405HisGly: 1.405 ± 1.034
0.703HisHis: 0.703 ± 0.922
0.0HisIle: 0.0 ± 0.0
0.703HisLys: 0.703 ± 0.665
4.216HisLeu: 4.216 ± 1.003
0.703HisMet: 0.703 ± 0.617
3.514HisAsn: 3.514 ± 1.585
1.405HisPro: 1.405 ± 0.601
2.811HisGln: 2.811 ± 1.184
3.514HisArg: 3.514 ± 2.06
2.108HisSer: 2.108 ± 0.991
2.811HisThr: 2.811 ± 1.883
3.514HisVal: 3.514 ± 0.985
0.703HisTrp: 0.703 ± 0.537
1.405HisTyr: 1.405 ± 0.601
0.0HisXaa: 0.0 ± 0.0
Ile
2.108IleAla: 2.108 ± 1.085
0.703IleCys: 0.703 ± 0.537
3.514IleAsp: 3.514 ± 1.168
4.216IleGlu: 4.216 ± 1.88
1.405IlePhe: 1.405 ± 0.998
2.108IleGly: 2.108 ± 1.008
2.108IleHis: 2.108 ± 1.306
0.703IleIle: 0.703 ± 0.63
5.622IleLys: 5.622 ± 1.507
3.514IleLeu: 3.514 ± 1.487
0.703IleMet: 0.703 ± 0.617
2.811IleAsn: 2.811 ± 1.477
2.811IlePro: 2.811 ± 1.135
2.811IleGln: 2.811 ± 1.354
6.325IleArg: 6.325 ± 1.616
2.811IleSer: 2.811 ± 1.184
3.514IleThr: 3.514 ± 0.93
3.514IleVal: 3.514 ± 1.175
1.405IleTrp: 1.405 ± 0.898
2.108IleTyr: 2.108 ± 1.583
0.0IleXaa: 0.0 ± 0.0
Lys
4.919LysAla: 4.919 ± 1.056
0.0LysCys: 0.0 ± 0.0
6.325LysAsp: 6.325 ± 2.236
2.108LysGlu: 2.108 ± 1.611
2.811LysPhe: 2.811 ± 1.352
4.216LysGly: 4.216 ± 1.82
2.108LysHis: 2.108 ± 1.109
6.325LysIle: 6.325 ± 1.423
1.405LysLys: 1.405 ± 0.998
4.216LysLeu: 4.216 ± 1.494
1.405LysMet: 1.405 ± 0.929
3.514LysAsn: 3.514 ± 1.322
3.514LysPro: 3.514 ± 0.926
0.703LysGln: 0.703 ± 0.63
6.325LysArg: 6.325 ± 2.641
2.811LysSer: 2.811 ± 0.863
1.405LysThr: 1.405 ± 1.074
4.216LysVal: 4.216 ± 3.211
0.703LysTrp: 0.703 ± 0.63
2.108LysTyr: 2.108 ± 1.119
0.0LysXaa: 0.0 ± 0.0
Leu
2.108LeuAla: 2.108 ± 1.162
0.703LeuCys: 0.703 ± 0.537
3.514LeuAsp: 3.514 ± 1.405
2.108LeuGlu: 2.108 ± 1.143
2.108LeuPhe: 2.108 ± 1.162
4.216LeuGly: 4.216 ± 0.705
5.622LeuHis: 5.622 ± 1.638
0.703LeuIle: 0.703 ± 0.665
6.325LeuLys: 6.325 ± 1.146
2.811LeuLeu: 2.811 ± 1.37
0.703LeuMet: 0.703 ± 0.617
5.622LeuAsn: 5.622 ± 0.551
1.405LeuPro: 1.405 ± 1.845
2.108LeuGln: 2.108 ± 1.338
3.514LeuArg: 3.514 ± 0.587
9.838LeuSer: 9.838 ± 3.264
2.811LeuThr: 2.811 ± 1.203
5.622LeuVal: 5.622 ± 1.154
0.0LeuTrp: 0.0 ± 0.0
2.811LeuTyr: 2.811 ± 0.808
0.0LeuXaa: 0.0 ± 0.0
Met
1.405MetAla: 1.405 ± 0.755
0.703MetCys: 0.703 ± 0.682
5.622MetAsp: 5.622 ± 1.821
0.0MetGlu: 0.0 ± 0.0
2.108MetPhe: 2.108 ± 0.703
0.703MetGly: 0.703 ± 0.63
0.703MetHis: 0.703 ± 0.682
0.703MetIle: 0.703 ± 0.617
0.703MetLys: 0.703 ± 0.63
0.703MetLeu: 0.703 ± 0.617
0.703MetMet: 0.703 ± 0.682
0.703MetAsn: 0.703 ± 0.617
1.405MetPro: 1.405 ± 0.755
0.703MetGln: 0.703 ± 0.537
1.405MetArg: 1.405 ± 1.102
2.811MetSer: 2.811 ± 1.13
2.811MetThr: 2.811 ± 1.871
0.703MetVal: 0.703 ± 0.617
0.703MetTrp: 0.703 ± 0.537
3.514MetTyr: 3.514 ± 1.636
0.0MetXaa: 0.0 ± 0.0
Asn
4.216AsnAla: 4.216 ± 1.015
3.514AsnCys: 3.514 ± 1.011
2.108AsnAsp: 2.108 ± 1.119
3.514AsnGlu: 3.514 ± 2.086
0.703AsnPhe: 0.703 ± 0.665
2.108AsnGly: 2.108 ± 0.875
4.919AsnHis: 4.919 ± 2.768
4.216AsnIle: 4.216 ± 0.705
0.703AsnLys: 0.703 ± 0.682
2.108AsnLeu: 2.108 ± 1.143
2.811AsnMet: 2.811 ± 1.216
2.108AsnAsn: 2.108 ± 0.875
2.811AsnPro: 2.811 ± 0.808
2.108AsnGln: 2.108 ± 1.048
5.622AsnArg: 5.622 ± 1.371
2.811AsnSer: 2.811 ± 1.144
3.514AsnThr: 3.514 ± 2.268
4.216AsnVal: 4.216 ± 1.35
0.0AsnTrp: 0.0 ± 0.0
3.514AsnTyr: 3.514 ± 0.93
0.0AsnXaa: 0.0 ± 0.0
Pro
0.703ProAla: 0.703 ± 0.537
1.405ProCys: 1.405 ± 0.711
2.108ProAsp: 2.108 ± 0.703
2.108ProGlu: 2.108 ± 1.846
1.405ProPhe: 1.405 ± 0.601
2.811ProGly: 2.811 ± 1.7
2.811ProHis: 2.811 ± 1.621
4.216ProIle: 4.216 ± 3.08
4.216ProLys: 4.216 ± 1.099
2.811ProLeu: 2.811 ± 1.354
2.108ProMet: 2.108 ± 1.335
2.108ProAsn: 2.108 ± 1.141
2.108ProPro: 2.108 ± 0.864
2.108ProGln: 2.108 ± 1.846
2.811ProArg: 2.811 ± 1.512
4.919ProSer: 4.919 ± 2.535
2.108ProThr: 2.108 ± 0.72
2.108ProVal: 2.108 ± 1.119
1.405ProTrp: 1.405 ± 0.601
1.405ProTyr: 1.405 ± 0.826
0.0ProXaa: 0.0 ± 0.0
Gln
2.811GlnAla: 2.811 ± 0.808
1.405GlnCys: 1.405 ± 0.998
1.405GlnAsp: 1.405 ± 0.998
3.514GlnGlu: 3.514 ± 1.326
2.108GlnPhe: 2.108 ± 1.109
2.108GlnGly: 2.108 ± 1.846
0.0GlnHis: 0.0 ± 0.0
3.514GlnIle: 3.514 ± 1.869
0.0GlnLys: 0.0 ± 0.0
2.811GlnLeu: 2.811 ± 2.52
0.703GlnMet: 0.703 ± 0.617
2.108GlnAsn: 2.108 ± 1.121
3.514GlnPro: 3.514 ± 2.573
1.405GlnGln: 1.405 ± 0.601
3.514GlnArg: 3.514 ± 0.927
4.919GlnSer: 4.919 ± 2.054
0.0GlnThr: 0.0 ± 0.0
2.811GlnVal: 2.811 ± 1.459
0.0GlnTrp: 0.0 ± 0.0
2.108GlnTyr: 2.108 ± 0.703
0.0GlnXaa: 0.0 ± 0.0
Arg
4.919ArgAla: 4.919 ± 2.164
2.108ArgCys: 2.108 ± 1.338
4.919ArgAsp: 4.919 ± 1.571
0.703ArgGlu: 0.703 ± 0.537
7.73ArgPhe: 7.73 ± 2.781
5.622ArgGly: 5.622 ± 1.616
2.108ArgHis: 2.108 ± 1.143
3.514ArgIle: 3.514 ± 0.926
2.811ArgLys: 2.811 ± 0.529
2.108ArgLeu: 2.108 ± 1.324
2.108ArgMet: 2.108 ± 0.784
0.703ArgAsn: 0.703 ± 0.682
3.514ArgPro: 3.514 ± 1.293
2.108ArgGln: 2.108 ± 0.864
6.325ArgArg: 6.325 ± 2.927
5.622ArgSer: 5.622 ± 1.229
6.325ArgThr: 6.325 ± 2.075
6.325ArgVal: 6.325 ± 1.756
0.0ArgTrp: 0.0 ± 0.0
2.108ArgTyr: 2.108 ± 1.143
0.0ArgXaa: 0.0 ± 0.0
Ser
7.027SerAla: 7.027 ± 2.375
1.405SerCys: 1.405 ± 0.818
2.811SerAsp: 2.811 ± 0.529
1.405SerGlu: 1.405 ± 0.826
4.216SerPhe: 4.216 ± 1.289
4.216SerGly: 4.216 ± 1.17
3.514SerHis: 3.514 ± 1.793
6.325SerIle: 6.325 ± 2.075
5.622SerLys: 5.622 ± 2.048
5.622SerLeu: 5.622 ± 1.719
2.811SerMet: 2.811 ± 1.346
4.216SerAsn: 4.216 ± 1.557
3.514SerPro: 3.514 ± 2.737
3.514SerGln: 3.514 ± 1.62
6.325SerArg: 6.325 ± 1.337
4.216SerSer: 4.216 ± 1.372
6.325SerThr: 6.325 ± 1.826
5.622SerVal: 5.622 ± 1.389
1.405SerTrp: 1.405 ± 1.26
3.514SerTyr: 3.514 ± 1.277
0.0SerXaa: 0.0 ± 0.0
Thr
7.73ThrAla: 7.73 ± 2.48
0.0ThrCys: 0.0 ± 0.0
1.405ThrAsp: 1.405 ± 0.818
2.108ThrGlu: 2.108 ± 1.301
2.811ThrPhe: 2.811 ± 1.871
3.514ThrGly: 3.514 ± 1.258
3.514ThrHis: 3.514 ± 2.06
1.405ThrIle: 1.405 ± 0.998
2.108ThrLys: 2.108 ± 1.085
3.514ThrLeu: 3.514 ± 1.277
1.405ThrMet: 1.405 ± 0.601
4.919ThrAsn: 4.919 ± 0.616
2.811ThrPro: 2.811 ± 1.511
0.703ThrGln: 0.703 ± 0.537
4.216ThrArg: 4.216 ± 1.341
2.811ThrSer: 2.811 ± 1.155
3.514ThrThr: 3.514 ± 1.319
4.216ThrVal: 4.216 ± 1.921
0.703ThrTrp: 0.703 ± 0.682
2.108ThrTyr: 2.108 ± 1.143
0.0ThrXaa: 0.0 ± 0.0
Val
0.703ValAla: 0.703 ± 0.922
0.703ValCys: 0.703 ± 0.63
4.216ValAsp: 4.216 ± 1.55
4.216ValGlu: 4.216 ± 2.104
2.108ValPhe: 2.108 ± 1.143
3.514ValGly: 3.514 ± 1.487
1.405ValHis: 1.405 ± 0.961
4.216ValIle: 4.216 ± 1.615
4.919ValLys: 4.919 ± 1.052
4.919ValLeu: 4.919 ± 2.196
3.514ValMet: 3.514 ± 1.815
3.514ValAsn: 3.514 ± 1.802
3.514ValPro: 3.514 ± 0.722
5.622ValGln: 5.622 ± 1.558
2.811ValArg: 2.811 ± 1.459
4.216ValSer: 4.216 ± 1.678
1.405ValThr: 1.405 ± 1.364
2.811ValVal: 2.811 ± 1.086
0.703ValTrp: 0.703 ± 0.665
7.027ValTyr: 7.027 ± 1.787
0.0ValXaa: 0.0 ± 0.0
Trp
2.811TrpAla: 2.811 ± 0.971
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.405TrpGlu: 1.405 ± 0.913
0.0TrpPhe: 0.0 ± 0.0
0.703TrpGly: 0.703 ± 0.537
0.0TrpHis: 0.0 ± 0.0
0.703TrpIle: 0.703 ± 0.922
2.108TrpLys: 2.108 ± 0.681
0.703TrpLeu: 0.703 ± 0.682
1.405TrpMet: 1.405 ± 0.711
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.703TrpGln: 0.703 ± 0.537
1.405TrpArg: 1.405 ± 1.102
0.0TrpSer: 0.0 ± 0.0
2.108TrpThr: 2.108 ± 0.808
1.405TrpVal: 1.405 ± 0.755
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.108TyrAla: 2.108 ± 1.335
0.703TyrCys: 0.703 ± 0.617
2.108TyrAsp: 2.108 ± 1.249
1.405TyrGlu: 1.405 ± 1.364
4.216TyrPhe: 4.216 ± 0.983
2.811TyrGly: 2.811 ± 0.529
0.703TyrHis: 0.703 ± 0.665
4.216TyrIle: 4.216 ± 0.894
1.405TyrLys: 1.405 ± 0.601
4.216TyrLeu: 4.216 ± 2.173
1.405TyrMet: 1.405 ± 0.876
2.811TyrAsn: 2.811 ± 1.029
1.405TyrPro: 1.405 ± 0.744
2.811TyrGln: 2.811 ± 0.529
3.514TyrArg: 3.514 ± 1.719
2.108TyrSer: 2.108 ± 0.703
0.0TyrThr: 0.0 ± 0.0
2.108TyrVal: 2.108 ± 1.38
0.0TyrTrp: 0.0 ± 0.0
1.405TyrTyr: 1.405 ± 0.818
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1424 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski