Amino acid dipepetide frequency for Shuangao toti-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.687AlaAla: 3.687 ± 0.09
0.461AlaCys: 0.461 ± 0.471
0.922AlaAsp: 0.922 ± 0.206
3.687AlaGlu: 3.687 ± 0.09
0.0AlaPhe: 0.0 ± 0.0
5.069AlaGly: 5.069 ± 0.703
0.461AlaHis: 0.461 ± 0.264
1.382AlaIle: 1.382 ± 0.793
4.608AlaLys: 4.608 ± 0.296
4.147AlaLeu: 4.147 ± 0.561
0.461AlaMet: 0.461 ± 0.264
1.382AlaAsn: 1.382 ± 0.793
0.0AlaPro: 0.0 ± 0.0
1.843AlaGln: 1.843 ± 1.147
3.687AlaArg: 3.687 ± 0.09
4.608AlaSer: 4.608 ± 2.643
2.304AlaThr: 2.304 ± 1.322
4.147AlaVal: 4.147 ± 0.174
0.922AlaTrp: 0.922 ± 0.206
1.382AlaTyr: 1.382 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.461CysCys: 0.461 ± 0.471
1.382CysAsp: 1.382 ± 0.677
0.922CysGlu: 0.922 ± 0.941
0.0CysPhe: 0.0 ± 0.0
0.922CysGly: 0.922 ± 0.529
0.461CysHis: 0.461 ± 0.471
1.382CysIle: 1.382 ± 0.793
3.226CysLys: 3.226 ± 0.354
2.304CysLeu: 2.304 ± 0.148
0.922CysMet: 0.922 ± 0.529
0.461CysAsn: 0.461 ± 0.264
1.382CysPro: 1.382 ± 0.677
0.922CysGln: 0.922 ± 0.529
0.461CysArg: 0.461 ± 0.471
0.461CysSer: 0.461 ± 0.471
0.461CysThr: 0.461 ± 0.264
1.382CysVal: 1.382 ± 0.058
0.0CysTrp: 0.0 ± 0.0
0.461CysTyr: 0.461 ± 0.264
0.0CysXaa: 0.0 ± 0.0
Asp
2.765AspAla: 2.765 ± 0.116
1.382AspCys: 1.382 ± 0.793
3.226AspAsp: 3.226 ± 0.381
3.687AspGlu: 3.687 ± 0.645
1.382AspPhe: 1.382 ± 0.677
2.304AspGly: 2.304 ± 0.587
0.922AspHis: 0.922 ± 0.529
3.687AspIle: 3.687 ± 1.56
3.226AspLys: 3.226 ± 1.115
1.382AspLeu: 1.382 ± 0.793
0.922AspMet: 0.922 ± 0.206
3.226AspAsn: 3.226 ± 1.115
3.226AspPro: 3.226 ± 1.115
2.304AspGln: 2.304 ± 0.148
5.53AspArg: 5.53 ± 0.232
3.226AspSer: 3.226 ± 1.85
5.53AspThr: 5.53 ± 0.232
1.843AspVal: 1.843 ± 0.322
3.226AspTrp: 3.226 ± 0.354
1.843AspTyr: 1.843 ± 0.413
0.0AspXaa: 0.0 ± 0.0
Glu
4.147GluAla: 4.147 ± 1.644
1.843GluCys: 1.843 ± 1.057
5.53GluAsp: 5.53 ± 0.967
8.756GluGlu: 8.756 ± 0.122
1.382GluPhe: 1.382 ± 0.677
5.991GluGly: 5.991 ± 0.973
1.382GluHis: 1.382 ± 0.058
5.53GluIle: 5.53 ± 0.503
3.687GluLys: 3.687 ± 0.645
5.069GluLeu: 5.069 ± 1.502
3.687GluMet: 3.687 ± 0.09
2.304GluAsn: 2.304 ± 0.587
1.843GluPro: 1.843 ± 1.057
0.461GluGln: 0.461 ± 0.471
4.147GluArg: 4.147 ± 0.909
5.53GluSer: 5.53 ± 0.232
4.147GluThr: 4.147 ± 0.174
7.834GluVal: 7.834 ± 1.386
3.687GluTrp: 3.687 ± 0.825
2.765GluTyr: 2.765 ± 0.116
0.0GluXaa: 0.0 ± 0.0
Phe
1.843PheAla: 1.843 ± 0.413
0.0PheCys: 0.0 ± 0.0
1.382PheAsp: 1.382 ± 0.058
1.382PheGlu: 1.382 ± 0.058
0.0PhePhe: 0.0 ± 0.0
1.382PheGly: 1.382 ± 0.058
0.922PheHis: 0.922 ± 0.206
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
2.765PheLeu: 2.765 ± 0.619
0.461PheMet: 0.461 ± 0.471
0.461PheAsn: 0.461 ± 0.471
0.922PhePro: 0.922 ± 0.206
1.843PheGln: 1.843 ± 1.147
1.843PheArg: 1.843 ± 0.413
1.382PheSer: 1.382 ± 0.058
1.382PheThr: 1.382 ± 0.058
3.226PheVal: 3.226 ± 0.381
0.0PheTrp: 0.0 ± 0.0
0.461PheTyr: 0.461 ± 0.471
0.0PheXaa: 0.0 ± 0.0
Gly
2.304GlyAla: 2.304 ± 0.148
0.922GlyCys: 0.922 ± 0.941
3.687GlyAsp: 3.687 ± 0.09
3.226GlyGlu: 3.226 ± 0.354
2.304GlyPhe: 2.304 ± 0.883
8.756GlyGly: 8.756 ± 0.122
0.461GlyHis: 0.461 ± 0.471
5.069GlyIle: 5.069 ± 2.173
5.069GlyLys: 5.069 ± 2.237
6.912GlyLeu: 6.912 ± 1.025
3.687GlyMet: 3.687 ± 1.56
5.53GlyAsn: 5.53 ± 0.232
5.069GlyPro: 5.069 ± 0.032
1.382GlyGln: 1.382 ± 0.058
4.608GlyArg: 4.608 ± 0.439
9.217GlySer: 9.217 ± 0.877
2.304GlyThr: 2.304 ± 0.587
7.373GlyVal: 7.373 ± 1.65
1.843GlyTrp: 1.843 ± 0.322
3.687GlyTyr: 3.687 ± 0.825
0.0GlyXaa: 0.0 ± 0.0
His
2.304HisAla: 2.304 ± 0.883
0.461HisCys: 0.461 ± 0.264
0.461HisAsp: 0.461 ± 0.264
0.922HisGlu: 0.922 ± 0.529
0.461HisPhe: 0.461 ± 0.471
2.304HisGly: 2.304 ± 0.883
0.461HisHis: 0.461 ± 0.264
0.461HisIle: 0.461 ± 0.471
1.382HisLys: 1.382 ± 0.058
3.687HisLeu: 3.687 ± 1.56
0.0HisMet: 0.0 ± 0.0
1.382HisAsn: 1.382 ± 0.058
2.304HisPro: 2.304 ± 0.148
1.843HisGln: 1.843 ± 0.322
0.922HisArg: 0.922 ± 0.529
0.461HisSer: 0.461 ± 0.264
0.0HisThr: 0.0 ± 0.0
1.382HisVal: 1.382 ± 0.058
0.922HisTrp: 0.922 ± 0.206
0.922HisTyr: 0.922 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
3.226IleAla: 3.226 ± 0.381
1.382IleCys: 1.382 ± 0.058
3.687IleAsp: 3.687 ± 0.645
2.765IleGlu: 2.765 ± 0.619
0.922IlePhe: 0.922 ± 0.206
6.912IleGly: 6.912 ± 0.445
1.843IleHis: 1.843 ± 0.413
2.304IleIle: 2.304 ± 0.148
5.991IleLys: 5.991 ± 3.178
3.226IleLeu: 3.226 ± 1.115
0.922IleMet: 0.922 ± 0.206
3.226IleAsn: 3.226 ± 0.381
4.147IlePro: 4.147 ± 1.296
0.922IleGln: 0.922 ± 0.941
6.912IleArg: 6.912 ± 0.29
3.226IleSer: 3.226 ± 0.381
0.922IleThr: 0.922 ± 0.206
2.765IleVal: 2.765 ± 0.851
2.304IleTrp: 2.304 ± 0.148
1.382IleTyr: 1.382 ± 0.793
0.0IleXaa: 0.0 ± 0.0
Lys
2.304LysAla: 2.304 ± 0.587
1.843LysCys: 1.843 ± 0.413
3.226LysAsp: 3.226 ± 0.354
8.295LysGlu: 8.295 ± 1.856
1.382LysPhe: 1.382 ± 0.058
3.687LysGly: 3.687 ± 0.645
1.843LysHis: 1.843 ± 0.322
6.912LysIle: 6.912 ± 1.914
4.608LysLys: 4.608 ± 1.908
4.608LysLeu: 4.608 ± 0.439
2.304LysMet: 2.304 ± 0.883
1.843LysAsn: 1.843 ± 1.147
3.226LysPro: 3.226 ± 1.824
1.843LysGln: 1.843 ± 0.413
3.687LysArg: 3.687 ± 2.295
4.147LysSer: 4.147 ± 0.174
5.069LysThr: 5.069 ± 0.703
5.991LysVal: 5.991 ± 0.973
1.843LysTrp: 1.843 ± 0.413
3.687LysTyr: 3.687 ± 0.645
0.0LysXaa: 0.0 ± 0.0
Leu
3.687LeuAla: 3.687 ± 0.09
1.382LeuCys: 1.382 ± 0.677
5.53LeuAsp: 5.53 ± 0.232
7.834LeuGlu: 7.834 ± 0.819
1.843LeuPhe: 1.843 ± 1.147
7.373LeuGly: 7.373 ± 2.385
2.304LeuHis: 2.304 ± 0.883
3.687LeuIle: 3.687 ± 1.38
4.608LeuLys: 4.608 ± 0.296
4.147LeuLeu: 4.147 ± 0.561
2.304LeuMet: 2.304 ± 0.417
6.452LeuAsn: 6.452 ± 1.496
4.147LeuPro: 4.147 ± 1.644
0.461LeuGln: 0.461 ± 0.471
6.452LeuArg: 6.452 ± 0.709
3.226LeuSer: 3.226 ± 1.089
4.608LeuThr: 4.608 ± 0.296
2.304LeuVal: 2.304 ± 0.148
1.843LeuTrp: 1.843 ± 0.413
1.843LeuTyr: 1.843 ± 0.413
0.0LeuXaa: 0.0 ± 0.0
Met
1.382MetAla: 1.382 ± 0.793
0.461MetCys: 0.461 ± 0.471
1.382MetAsp: 1.382 ± 0.677
1.382MetGlu: 1.382 ± 0.058
0.461MetPhe: 0.461 ± 0.264
1.382MetGly: 1.382 ± 0.058
0.922MetHis: 0.922 ± 0.206
0.461MetIle: 0.461 ± 0.264
3.687MetLys: 3.687 ± 3.765
1.382MetLeu: 1.382 ± 0.058
1.382MetMet: 1.382 ± 0.677
0.922MetAsn: 0.922 ± 0.206
1.843MetPro: 1.843 ± 0.413
0.922MetGln: 0.922 ± 0.529
0.461MetArg: 0.461 ± 0.471
5.069MetSer: 5.069 ± 2.173
1.382MetThr: 1.382 ± 0.793
1.843MetVal: 1.843 ± 0.413
0.922MetTrp: 0.922 ± 0.529
0.461MetTyr: 0.461 ± 0.471
0.0MetXaa: 0.0 ± 0.0
Asn
0.922AsnAla: 0.922 ± 0.941
0.922AsnCys: 0.922 ± 0.941
2.304AsnAsp: 2.304 ± 0.587
4.147AsnGlu: 4.147 ± 0.561
0.922AsnPhe: 0.922 ± 0.206
2.304AsnGly: 2.304 ± 0.587
2.304AsnHis: 2.304 ± 0.587
4.608AsnIle: 4.608 ± 0.296
4.608AsnLys: 4.608 ± 1.174
3.687AsnLeu: 3.687 ± 0.645
1.382AsnMet: 1.382 ± 0.175
0.461AsnAsn: 0.461 ± 0.264
1.382AsnPro: 1.382 ± 0.058
0.0AsnGln: 0.0 ± 0.0
5.53AsnArg: 5.53 ± 1.702
5.069AsnSer: 5.069 ± 0.703
0.461AsnThr: 0.461 ± 0.264
2.765AsnVal: 2.765 ± 0.116
1.382AsnTrp: 1.382 ± 0.793
1.843AsnTyr: 1.843 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
1.382ProAla: 1.382 ± 0.793
0.0ProCys: 0.0 ± 0.0
1.843ProAsp: 1.843 ± 0.413
3.687ProGlu: 3.687 ± 2.115
0.922ProPhe: 0.922 ± 0.206
2.765ProGly: 2.765 ± 1.354
0.922ProHis: 0.922 ± 0.941
4.147ProIle: 4.147 ± 2.766
1.382ProLys: 1.382 ± 0.058
3.687ProLeu: 3.687 ± 0.825
0.0ProMet: 0.0 ± 0.0
3.687ProAsn: 3.687 ± 0.825
0.922ProPro: 0.922 ± 0.529
1.382ProGln: 1.382 ± 0.793
1.843ProArg: 1.843 ± 1.057
4.147ProSer: 4.147 ± 0.174
4.147ProThr: 4.147 ± 2.379
3.687ProVal: 3.687 ± 0.09
0.461ProTrp: 0.461 ± 0.471
0.922ProTyr: 0.922 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
2.304GlnAla: 2.304 ± 0.148
0.922GlnCys: 0.922 ± 0.206
1.843GlnAsp: 1.843 ± 1.057
1.382GlnGlu: 1.382 ± 0.677
0.0GlnPhe: 0.0 ± 0.0
1.843GlnGly: 1.843 ± 0.413
0.461GlnHis: 0.461 ± 0.264
0.461GlnIle: 0.461 ± 0.264
0.922GlnLys: 0.922 ± 0.941
1.382GlnLeu: 1.382 ± 0.793
0.922GlnMet: 0.922 ± 0.529
0.922GlnAsn: 0.922 ± 0.529
1.382GlnPro: 1.382 ± 0.058
0.461GlnGln: 0.461 ± 0.264
1.843GlnArg: 1.843 ± 0.413
2.304GlnSer: 2.304 ± 0.148
0.922GlnThr: 0.922 ± 0.206
0.922GlnVal: 0.922 ± 0.941
0.0GlnTrp: 0.0 ± 0.0
0.922GlnTyr: 0.922 ± 0.941
0.0GlnXaa: 0.0 ± 0.0
Arg
4.608ArgAla: 4.608 ± 0.296
0.461ArgCys: 0.461 ± 0.264
4.147ArgAsp: 4.147 ± 0.174
6.912ArgGlu: 6.912 ± 1.025
1.382ArgPhe: 1.382 ± 0.058
4.608ArgGly: 4.608 ± 0.296
0.922ArgHis: 0.922 ± 0.206
5.991ArgIle: 5.991 ± 0.497
4.608ArgLys: 4.608 ± 0.296
5.069ArgLeu: 5.069 ± 2.972
3.226ArgMet: 3.226 ± 0.354
3.226ArgAsn: 3.226 ± 0.354
1.382ArgPro: 1.382 ± 0.058
1.843ArgGln: 1.843 ± 0.413
4.147ArgArg: 4.147 ± 0.174
3.687ArgSer: 3.687 ± 1.38
5.069ArgThr: 5.069 ± 1.438
6.452ArgVal: 6.452 ± 0.761
2.304ArgTrp: 2.304 ± 0.883
2.304ArgTyr: 2.304 ± 0.883
0.0ArgXaa: 0.0 ± 0.0
Ser
2.304SerAla: 2.304 ± 1.322
0.0SerCys: 0.0 ± 0.0
4.608SerAsp: 4.608 ± 1.908
5.991SerGlu: 5.991 ± 0.973
2.304SerPhe: 2.304 ± 0.148
7.373SerGly: 7.373 ± 1.29
1.382SerHis: 1.382 ± 0.793
3.687SerIle: 3.687 ± 0.09
4.608SerLys: 4.608 ± 1.174
5.069SerLeu: 5.069 ± 0.703
0.922SerMet: 0.922 ± 0.529
4.147SerAsn: 4.147 ± 2.379
1.382SerPro: 1.382 ± 0.058
1.843SerGln: 1.843 ± 1.057
6.452SerArg: 6.452 ± 0.026
4.147SerSer: 4.147 ± 1.644
3.226SerThr: 3.226 ± 1.115
3.687SerVal: 3.687 ± 0.09
2.304SerTrp: 2.304 ± 0.148
3.226SerTyr: 3.226 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
1.382ThrAla: 1.382 ± 0.677
0.922ThrCys: 0.922 ± 0.206
1.843ThrAsp: 1.843 ± 1.057
4.608ThrGlu: 4.608 ± 1.908
2.765ThrPhe: 2.765 ± 0.116
5.991ThrGly: 5.991 ± 0.497
0.0ThrHis: 0.0 ± 0.0
0.922ThrIle: 0.922 ± 0.206
4.608ThrLys: 4.608 ± 1.174
6.912ThrLeu: 6.912 ± 1.76
0.922ThrMet: 0.922 ± 0.206
0.922ThrAsn: 0.922 ± 0.941
2.304ThrPro: 2.304 ± 0.587
0.922ThrGln: 0.922 ± 0.529
4.608ThrArg: 4.608 ± 0.296
2.765ThrSer: 2.765 ± 0.116
1.382ThrThr: 1.382 ± 0.793
4.608ThrVal: 4.608 ± 1.908
2.304ThrTrp: 2.304 ± 1.322
2.765ThrTyr: 2.765 ± 0.619
0.0ThrXaa: 0.0 ± 0.0
Val
2.304ValAla: 2.304 ± 1.322
2.304ValCys: 2.304 ± 0.148
5.53ValAsp: 5.53 ± 0.967
4.608ValGlu: 4.608 ± 1.174
1.843ValPhe: 1.843 ± 0.413
6.912ValGly: 6.912 ± 1.025
0.922ValHis: 0.922 ± 0.941
3.687ValIle: 3.687 ± 0.825
6.452ValLys: 6.452 ± 1.444
6.912ValLeu: 6.912 ± 1.914
1.843ValMet: 1.843 ± 1.057
2.765ValAsn: 2.765 ± 0.851
3.226ValPro: 3.226 ± 0.354
0.461ValGln: 0.461 ± 0.264
4.147ValArg: 4.147 ± 1.296
2.304ValSer: 2.304 ± 0.148
4.608ValThr: 4.608 ± 0.439
2.304ValVal: 2.304 ± 1.618
1.382ValTrp: 1.382 ± 1.412
3.226ValTyr: 3.226 ± 1.115
0.0ValXaa: 0.0 ± 0.0
Trp
0.922TrpAla: 0.922 ± 0.206
0.461TrpCys: 0.461 ± 0.264
0.922TrpAsp: 0.922 ± 0.529
4.608TrpGlu: 4.608 ± 1.031
0.461TrpPhe: 0.461 ± 0.264
1.382TrpGly: 1.382 ± 0.058
1.382TrpHis: 1.382 ± 0.677
2.765TrpIle: 2.765 ± 0.116
2.765TrpLys: 2.765 ± 0.619
0.922TrpLeu: 0.922 ± 0.941
0.922TrpMet: 0.922 ± 0.206
1.382TrpAsn: 1.382 ± 1.412
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.226TrpArg: 3.226 ± 0.381
1.843TrpSer: 1.843 ± 1.057
2.765TrpThr: 2.765 ± 0.619
1.382TrpVal: 1.382 ± 0.793
0.922TrpTrp: 0.922 ± 0.529
0.922TrpTyr: 0.922 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.382TyrAla: 1.382 ± 0.793
1.382TyrCys: 1.382 ± 0.793
1.382TyrAsp: 1.382 ± 0.058
0.922TyrGlu: 0.922 ± 0.206
0.922TyrPhe: 0.922 ± 0.529
4.147TyrGly: 4.147 ± 2.766
2.765TyrHis: 2.765 ± 0.116
1.843TyrIle: 1.843 ± 0.413
2.304TyrLys: 2.304 ± 0.883
3.226TyrLeu: 3.226 ± 0.354
0.461TyrMet: 0.461 ± 0.264
2.304TyrAsn: 2.304 ± 1.322
1.843TyrPro: 1.843 ± 0.413
0.461TyrGln: 0.461 ± 0.471
1.843TyrArg: 1.843 ± 0.413
1.843TyrSer: 1.843 ± 0.322
2.765TyrThr: 2.765 ± 0.619
1.843TyrVal: 1.843 ± 0.413
1.382TyrTrp: 1.382 ± 0.058
0.461TyrTyr: 0.461 ± 0.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2171 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski