Amino acid dipepetide frequency for Semliki forest virus (SFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.043AlaAla: 10.043 ± 1.477
2.172AlaCys: 2.172 ± 0.116
4.886AlaAsp: 4.886 ± 0.462
7.058AlaGlu: 7.058 ± 2.409
4.072AlaPhe: 4.072 ± 0.866
4.072AlaGly: 4.072 ± 0.866
1.086AlaHis: 1.086 ± 1.089
4.072AlaIle: 4.072 ± 0.371
3.529AlaLys: 3.529 ± 0.173
6.786AlaLeu: 6.786 ± 0.206
2.443AlaMet: 2.443 ± 0.437
2.172AlaAsn: 2.172 ± 0.116
5.429AlaPro: 5.429 ± 0.083
1.629AlaGln: 1.629 ± 0.396
5.157AlaArg: 5.157 ± 0.635
6.786AlaSer: 6.786 ± 0.619
7.6AlaThr: 7.6 ± 1.023
8.686AlaVal: 8.686 ± 0.363
0.271AlaTrp: 0.271 ± 0.272
4.343AlaTyr: 4.343 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
2.443CysAla: 2.443 ± 0.388
2.172CysCys: 2.172 ± 0.941
1.357CysAsp: 1.357 ± 0.289
1.086CysGlu: 1.086 ± 0.264
1.357CysPhe: 1.357 ± 0.536
2.443CysGly: 2.443 ± 0.025
1.357CysHis: 1.357 ± 0.124
0.814CysIle: 0.814 ± 0.817
1.086CysLys: 1.086 ± 0.264
2.443CysLeu: 2.443 ± 0.388
0.271CysMet: 0.271 ± 0.272
0.814CysAsn: 0.814 ± 0.421
1.357CysPro: 1.357 ± 0.289
0.543CysGln: 0.543 ± 0.132
2.986CysArg: 2.986 ± 0.107
1.629CysSer: 1.629 ± 0.809
2.443CysThr: 2.443 ± 0.388
1.629CysVal: 1.629 ± 0.016
0.271CysTrp: 0.271 ± 0.14
1.357CysTyr: 1.357 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
4.615AspAla: 4.615 ± 1.972
1.357AspCys: 1.357 ± 0.949
1.357AspAsp: 1.357 ± 0.701
2.714AspGlu: 2.714 ± 0.577
1.629AspPhe: 1.629 ± 0.396
2.443AspGly: 2.443 ± 0.025
2.443AspHis: 2.443 ± 0.8
2.714AspIle: 2.714 ± 0.99
3.257AspLys: 3.257 ± 0.033
4.615AspLeu: 4.615 ± 0.322
2.443AspMet: 2.443 ± 0.388
2.986AspAsn: 2.986 ± 0.932
2.443AspPro: 2.443 ± 0.025
0.814AspGln: 0.814 ± 0.008
3.8AspArg: 3.8 ± 0.726
4.343AspSer: 4.343 ± 1.006
3.529AspThr: 3.529 ± 0.998
4.886AspVal: 4.886 ± 1.699
0.271AspTrp: 0.271 ± 0.14
0.814AspTyr: 0.814 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
5.429GluAla: 5.429 ± 0.33
0.814GluCys: 0.814 ± 0.404
2.714GluAsp: 2.714 ± 0.577
3.8GluGlu: 3.8 ± 0.726
1.9GluPhe: 1.9 ± 0.157
3.529GluGly: 3.529 ± 0.239
1.086GluHis: 1.086 ± 0.148
1.9GluIle: 1.9 ± 0.157
3.257GluLys: 3.257 ± 0.033
2.714GluLeu: 2.714 ± 1.402
0.0GluMet: 0.0 ± 0.0
2.986GluAsn: 2.986 ± 0.107
2.172GluPro: 2.172 ± 0.528
1.9GluGln: 1.9 ± 0.569
4.343GluArg: 4.343 ± 1.006
1.357GluSer: 1.357 ± 0.289
2.172GluThr: 2.172 ± 0.116
4.072GluVal: 4.072 ± 0.866
1.357GluTrp: 1.357 ± 0.124
3.257GluTyr: 3.257 ± 0.445
0.0GluXaa: 0.0 ± 0.0
Phe
1.629PheAla: 1.629 ± 0.396
1.086PheCys: 1.086 ± 0.148
4.072PheAsp: 4.072 ± 1.279
0.814PheGlu: 0.814 ± 0.008
0.271PhePhe: 0.271 ± 0.14
2.443PheGly: 2.443 ± 0.388
0.814PheHis: 0.814 ± 0.421
1.629PheIle: 1.629 ± 0.016
2.443PheLys: 2.443 ± 0.437
1.629PheLeu: 1.629 ± 0.016
0.543PheMet: 0.543 ± 0.132
2.443PheAsn: 2.443 ± 0.025
2.172PhePro: 2.172 ± 0.941
1.357PheGln: 1.357 ± 0.124
1.086PheArg: 1.086 ± 0.561
3.529PheSer: 3.529 ± 0.586
1.9PheThr: 1.9 ± 0.256
1.629PheVal: 1.629 ± 0.396
0.271PheTrp: 0.271 ± 0.272
0.271PheTyr: 0.271 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
5.157GlyAla: 5.157 ± 1.46
1.086GlyCys: 1.086 ± 0.148
4.343GlyAsp: 4.343 ± 0.231
2.443GlyGlu: 2.443 ± 0.437
2.172GlyPhe: 2.172 ± 0.297
3.8GlyGly: 3.8 ± 0.924
1.629GlyHis: 1.629 ± 0.809
1.9GlyIle: 1.9 ± 0.157
5.7GlyLys: 5.7 ± 1.18
4.343GlyLeu: 4.343 ± 1.006
1.357GlyMet: 1.357 ± 0.949
1.9GlyAsn: 1.9 ± 1.081
1.629GlyPro: 1.629 ± 0.429
0.814GlyGln: 0.814 ± 0.008
4.615GlyArg: 4.615 ± 0.091
3.529GlySer: 3.529 ± 0.586
3.529GlyThr: 3.529 ± 1.064
4.343GlyVal: 4.343 ± 1.006
0.543GlyTrp: 0.543 ± 0.132
2.443GlyTyr: 2.443 ± 0.025
0.0GlyXaa: 0.0 ± 0.0
His
3.257HisAla: 3.257 ± 0.033
0.271HisCys: 0.271 ± 0.14
1.357HisAsp: 1.357 ± 0.536
1.629HisGlu: 1.629 ± 0.396
1.086HisPhe: 1.086 ± 0.677
1.629HisGly: 1.629 ± 0.809
1.086HisHis: 1.086 ± 0.264
1.086HisIle: 1.086 ± 0.677
0.814HisLys: 0.814 ± 0.008
2.172HisLeu: 2.172 ± 0.709
0.543HisMet: 0.543 ± 0.545
1.086HisAsn: 1.086 ± 0.561
2.172HisPro: 2.172 ± 0.297
1.357HisGln: 1.357 ± 0.289
0.814HisArg: 0.814 ± 0.008
2.443HisSer: 2.443 ± 1.625
2.443HisThr: 2.443 ± 0.85
2.172HisVal: 2.172 ± 0.116
0.543HisTrp: 0.543 ± 0.132
1.086HisTyr: 1.086 ± 0.264
0.0HisXaa: 0.0 ± 0.0
Ile
4.072IleAla: 4.072 ± 1.279
1.086IleCys: 1.086 ± 0.561
2.443IleAsp: 2.443 ± 0.388
2.714IleGlu: 2.714 ± 0.66
1.357IlePhe: 1.357 ± 0.949
2.172IleGly: 2.172 ± 0.528
1.629IleHis: 1.629 ± 0.429
2.443IleIle: 2.443 ± 0.437
1.9IleLys: 1.9 ± 0.569
3.257IleLeu: 3.257 ± 0.033
0.271IleMet: 0.271 ± 0.14
1.086IleAsn: 1.086 ± 0.264
4.343IlePro: 4.343 ± 0.644
2.172IleGln: 2.172 ± 0.528
1.629IleArg: 1.629 ± 0.016
1.9IleSer: 1.9 ± 0.256
3.8IleThr: 3.8 ± 0.099
4.072IleVal: 4.072 ± 0.041
0.0IleTrp: 0.0 ± 0.0
0.543IleTyr: 0.543 ± 0.28
0.0IleXaa: 0.0 ± 0.0
Lys
2.986LysAla: 2.986 ± 0.52
1.629LysCys: 1.629 ± 0.396
1.357LysAsp: 1.357 ± 0.949
1.629LysGlu: 1.629 ± 0.396
1.086LysPhe: 1.086 ± 0.148
3.8LysGly: 3.8 ± 0.313
1.357LysHis: 1.357 ± 0.289
4.343LysIle: 4.343 ± 0.644
6.515LysLys: 6.515 ± 1.997
5.157LysLeu: 5.157 ± 1.84
1.357LysMet: 1.357 ± 0.701
0.543LysAsn: 0.543 ± 0.132
5.429LysPro: 5.429 ± 0.908
2.172LysGln: 2.172 ± 0.709
2.986LysArg: 2.986 ± 0.718
4.343LysSer: 4.343 ± 1.006
4.615LysThr: 4.615 ± 0.503
6.786LysVal: 6.786 ± 0.206
1.086LysTrp: 1.086 ± 0.148
2.172LysTyr: 2.172 ± 0.528
0.0LysXaa: 0.0 ± 0.0
Leu
6.786LeuAla: 6.786 ± 1.031
1.9LeuCys: 1.9 ± 0.256
4.343LeuAsp: 4.343 ± 1.831
3.8LeuGlu: 3.8 ± 0.099
2.714LeuPhe: 2.714 ± 0.577
4.072LeuGly: 4.072 ± 0.041
1.629LeuHis: 1.629 ± 0.396
2.986LeuIle: 2.986 ± 0.52
4.343LeuLys: 4.343 ± 0.181
6.515LeuLeu: 6.515 ± 1.716
0.814LeuMet: 0.814 ± 0.421
2.986LeuAsn: 2.986 ± 0.718
3.529LeuPro: 3.529 ± 1.411
4.886LeuGln: 4.886 ± 0.462
3.529LeuArg: 3.529 ± 0.586
5.7LeuSer: 5.7 ± 0.058
7.058LeuThr: 7.058 ± 0.066
6.786LeuVal: 6.786 ± 1.444
1.357LeuTrp: 1.357 ± 0.124
3.529LeuTyr: 3.529 ± 0.998
0.0LeuXaa: 0.0 ± 0.0
Met
2.172MetAla: 2.172 ± 0.297
1.086MetCys: 1.086 ± 0.264
1.086MetAsp: 1.086 ± 0.561
0.543MetGlu: 0.543 ± 0.132
0.814MetPhe: 0.814 ± 0.421
0.543MetGly: 0.543 ± 0.132
1.086MetHis: 1.086 ± 0.148
0.271MetIle: 0.271 ± 0.14
2.443MetLys: 2.443 ± 0.437
1.357MetLeu: 1.357 ± 0.536
0.814MetMet: 0.814 ± 0.421
1.086MetAsn: 1.086 ± 0.677
0.543MetPro: 0.543 ± 0.545
1.086MetGln: 1.086 ± 0.264
1.9MetArg: 1.9 ± 0.157
1.357MetSer: 1.357 ± 0.124
1.086MetThr: 1.086 ± 0.148
1.357MetVal: 1.357 ± 0.124
0.271MetTrp: 0.271 ± 0.272
1.086MetTyr: 1.086 ± 0.148
0.0MetXaa: 0.0 ± 0.0
Asn
3.257AsnAla: 3.257 ± 1.205
1.086AsnCys: 1.086 ± 0.264
2.172AsnAsp: 2.172 ± 0.116
1.629AsnGlu: 1.629 ± 0.429
1.086AsnPhe: 1.086 ± 0.561
2.172AsnGly: 2.172 ± 0.528
1.086AsnHis: 1.086 ± 0.148
2.714AsnIle: 2.714 ± 0.577
2.443AsnLys: 2.443 ± 0.388
1.357AsnLeu: 1.357 ± 0.124
0.814AsnMet: 0.814 ± 0.421
0.814AsnAsn: 0.814 ± 0.404
1.357AsnPro: 1.357 ± 0.289
1.357AsnGln: 1.357 ± 0.536
1.086AsnArg: 1.086 ± 0.561
0.814AsnSer: 0.814 ± 0.404
2.986AsnThr: 2.986 ± 0.107
4.615AsnVal: 4.615 ± 1.328
0.543AsnTrp: 0.543 ± 0.132
1.357AsnTyr: 1.357 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
3.8ProAla: 3.8 ± 0.099
2.172ProCys: 2.172 ± 0.528
3.257ProAsp: 3.257 ± 1.205
1.9ProGlu: 1.9 ± 0.157
2.443ProPhe: 2.443 ± 0.8
4.072ProGly: 4.072 ± 1.196
1.086ProHis: 1.086 ± 0.677
2.172ProIle: 2.172 ± 0.709
4.072ProLys: 4.072 ± 0.371
4.886ProLeu: 4.886 ± 0.462
1.086ProMet: 1.086 ± 0.148
1.357ProAsn: 1.357 ± 0.289
3.529ProPro: 3.529 ± 0.652
1.086ProGln: 1.086 ± 0.264
3.8ProArg: 3.8 ± 0.726
5.157ProSer: 5.157 ± 1.015
4.615ProThr: 4.615 ± 0.091
7.329ProVal: 7.329 ± 0.751
0.814ProTrp: 0.814 ± 0.404
2.443ProTyr: 2.443 ± 1.625
0.0ProXaa: 0.0 ± 0.0
Gln
3.257GlnAla: 3.257 ± 0.38
1.357GlnCys: 1.357 ± 0.124
1.357GlnAsp: 1.357 ± 0.124
2.172GlnGlu: 2.172 ± 0.709
0.814GlnPhe: 0.814 ± 0.817
0.543GlnGly: 0.543 ± 0.28
0.543GlnHis: 0.543 ± 0.132
1.629GlnIle: 1.629 ± 0.809
1.9GlnLys: 1.9 ± 0.157
2.172GlnLeu: 2.172 ± 0.116
1.9GlnMet: 1.9 ± 0.157
1.357GlnAsn: 1.357 ± 0.124
2.172GlnPro: 2.172 ± 0.528
2.443GlnGln: 2.443 ± 1.213
0.814GlnArg: 0.814 ± 0.421
1.629GlnSer: 1.629 ± 0.016
2.443GlnThr: 2.443 ± 0.388
2.172GlnVal: 2.172 ± 0.297
0.271GlnTrp: 0.271 ± 0.14
1.357GlnTyr: 1.357 ± 0.949
0.0GlnXaa: 0.0 ± 0.0
Arg
5.429ArgAla: 5.429 ± 0.33
1.357ArgCys: 1.357 ± 0.701
1.357ArgAsp: 1.357 ± 0.701
4.072ArgGlu: 4.072 ± 0.454
1.629ArgPhe: 1.629 ± 0.016
1.9ArgGly: 1.9 ± 0.982
0.814ArgHis: 0.814 ± 0.404
2.172ArgIle: 2.172 ± 0.297
3.257ArgLys: 3.257 ± 1.27
5.429ArgLeu: 5.429 ± 2.392
1.357ArgMet: 1.357 ± 0.33
2.172ArgAsn: 2.172 ± 0.116
5.429ArgPro: 5.429 ± 1.32
0.814ArgGln: 0.814 ± 0.008
3.257ArgArg: 3.257 ± 0.445
4.343ArgSer: 4.343 ± 0.231
4.072ArgThr: 4.072 ± 0.041
3.529ArgVal: 3.529 ± 0.173
0.271ArgTrp: 0.271 ± 0.272
1.9ArgTyr: 1.9 ± 0.569
0.0ArgXaa: 0.0 ± 0.0
Ser
7.329SerAla: 7.329 ± 0.338
1.9SerCys: 1.9 ± 0.256
3.529SerAsp: 3.529 ± 0.239
2.986SerGlu: 2.986 ± 0.52
2.172SerPhe: 2.172 ± 0.297
5.429SerGly: 5.429 ± 0.495
1.357SerHis: 1.357 ± 0.124
1.9SerIle: 1.9 ± 0.157
2.172SerLys: 2.172 ± 0.116
7.6SerLeu: 7.6 ± 1.039
0.543SerMet: 0.543 ± 0.132
1.629SerAsn: 1.629 ± 0.396
4.072SerPro: 4.072 ± 0.371
1.357SerGln: 1.357 ± 0.536
3.8SerArg: 3.8 ± 1.138
4.615SerSer: 4.615 ± 0.734
4.072SerThr: 4.072 ± 0.866
3.257SerVal: 3.257 ± 0.38
1.086SerTrp: 1.086 ± 0.148
1.357SerTyr: 1.357 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
8.686ThrAla: 8.686 ± 0.462
3.8ThrCys: 3.8 ± 1.749
4.615ThrAsp: 4.615 ± 1.559
3.257ThrGlu: 3.257 ± 1.27
1.9ThrPhe: 1.9 ± 0.157
4.343ThrGly: 4.343 ± 0.594
1.357ThrHis: 1.357 ± 0.124
3.529ThrIle: 3.529 ± 0.652
3.8ThrLys: 3.8 ± 0.313
6.243ThrLeu: 6.243 ± 1.312
2.714ThrMet: 2.714 ± 0.165
1.9ThrAsn: 1.9 ± 0.256
4.343ThrPro: 4.343 ± 0.644
2.714ThrGln: 2.714 ± 1.073
2.986ThrArg: 2.986 ± 0.52
1.9ThrSer: 1.9 ± 0.157
4.615ThrThr: 4.615 ± 0.322
7.329ThrVal: 7.329 ± 0.074
0.543ThrTrp: 0.543 ± 0.132
2.172ThrTyr: 2.172 ± 0.116
0.0ThrXaa: 0.0 ± 0.0
Val
5.7ValAla: 5.7 ± 0.767
2.443ValCys: 2.443 ± 0.85
5.972ValAsp: 5.972 ± 1.435
4.072ValGlu: 4.072 ± 0.041
2.714ValPhe: 2.714 ± 0.577
4.343ValGly: 4.343 ± 0.644
3.8ValHis: 3.8 ± 0.099
2.986ValIle: 2.986 ± 0.718
5.429ValLys: 5.429 ± 0.33
7.6ValLeu: 7.6 ± 1.452
2.172ValMet: 2.172 ± 0.116
4.072ValAsn: 4.072 ± 0.866
5.7ValPro: 5.7 ± 0.767
1.357ValGln: 1.357 ± 0.536
4.886ValArg: 4.886 ± 0.049
4.072ValSer: 4.072 ± 0.371
7.329ValThr: 7.329 ± 0.338
7.058ValVal: 7.058 ± 1.304
0.271ValTrp: 0.271 ± 0.14
3.257ValTyr: 3.257 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
1.086TrpAla: 1.086 ± 0.561
0.0TrpCys: 0.0 ± 0.0
0.543TrpAsp: 0.543 ± 0.132
0.543TrpGlu: 0.543 ± 0.28
0.271TrpPhe: 0.271 ± 0.14
0.543TrpGly: 0.543 ± 0.545
0.814TrpHis: 0.814 ± 0.008
0.543TrpIle: 0.543 ± 0.28
0.271TrpLys: 0.271 ± 0.14
1.086TrpLeu: 1.086 ± 0.264
0.0TrpMet: 0.0 ± 0.0
0.271TrpAsn: 0.271 ± 0.272
1.086TrpPro: 1.086 ± 0.264
0.543TrpGln: 0.543 ± 0.132
0.271TrpArg: 0.271 ± 0.272
1.086TrpSer: 1.086 ± 0.264
0.814TrpThr: 0.814 ± 0.404
1.086TrpVal: 1.086 ± 0.264
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.157TyrAla: 5.157 ± 0.602
1.086TyrCys: 1.086 ± 0.677
1.357TyrAsp: 1.357 ± 0.124
1.9TyrGlu: 1.9 ± 0.157
0.543TyrPhe: 0.543 ± 0.28
2.986TyrGly: 2.986 ± 0.52
2.986TyrHis: 2.986 ± 0.305
1.086TyrIle: 1.086 ± 0.677
2.714TyrLys: 2.714 ± 1.073
1.9TyrLeu: 1.9 ± 0.569
0.271TyrMet: 0.271 ± 0.272
1.086TyrAsn: 1.086 ± 0.264
2.172TyrPro: 2.172 ± 0.297
1.9TyrGln: 1.9 ± 0.157
1.086TyrArg: 1.086 ± 0.148
1.629TyrSer: 1.629 ± 0.016
1.629TyrThr: 1.629 ± 0.809
2.443TyrVal: 2.443 ± 0.025
0.814TyrTrp: 0.814 ± 0.008
1.357TyrTyr: 1.357 ± 0.536
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3685 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski