Amino acid dipepetide frequency for Wuhan house centipede virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.518AlaAla: 4.518 ± 0.759
0.753AlaCys: 0.753 ± 0.569
3.012AlaAsp: 3.012 ± 1.394
3.765AlaGlu: 3.765 ± 1.039
3.765AlaPhe: 3.765 ± 0.786
2.259AlaGly: 2.259 ± 0.701
0.753AlaHis: 0.753 ± 0.494
3.012AlaIle: 3.012 ± 1.144
3.765AlaLys: 3.765 ± 1.617
6.024AlaLeu: 6.024 ± 1.406
2.259AlaMet: 2.259 ± 0.61
1.506AlaAsn: 1.506 ± 0.828
3.765AlaPro: 3.765 ± 1.208
5.271AlaGln: 5.271 ± 0.98
8.283AlaArg: 8.283 ± 2.023
3.765AlaSer: 3.765 ± 0.786
3.765AlaThr: 3.765 ± 1.187
6.024AlaVal: 6.024 ± 3.819
1.506AlaTrp: 1.506 ± 0.726
3.765AlaTyr: 3.765 ± 1.95
0.753AlaXaa: 0.753 ± 0.569
Cys
0.753CysAla: 0.753 ± 0.494
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.753CysPhe: 0.753 ± 0.796
0.753CysGly: 0.753 ± 0.796
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.506CysLeu: 1.506 ± 1.592
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.506CysArg: 1.506 ± 0.404
0.0CysSer: 0.0 ± 0.0
0.753CysThr: 0.753 ± 0.796
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
3.012CysTyr: 3.012 ± 0.684
0.0CysXaa: 0.0 ± 0.0
Asp
3.012AspAla: 3.012 ± 0.684
0.753AspCys: 0.753 ± 0.796
5.271AspAsp: 5.271 ± 0.637
5.271AspGlu: 5.271 ± 0.482
3.012AspPhe: 3.012 ± 0.808
3.012AspGly: 3.012 ± 1.144
0.0AspHis: 0.0 ± 0.0
0.753AspIle: 0.753 ± 0.569
4.518AspLys: 4.518 ± 0.834
3.765AspLeu: 3.765 ± 2.472
0.753AspMet: 0.753 ± 0.64
1.506AspAsn: 1.506 ± 0.989
1.506AspPro: 1.506 ± 0.828
0.753AspGln: 0.753 ± 0.494
2.259AspArg: 2.259 ± 0.954
4.518AspSer: 4.518 ± 1.464
3.765AspThr: 3.765 ± 1.621
5.271AspVal: 5.271 ± 0.482
3.765AspTrp: 3.765 ± 1.214
3.765AspTyr: 3.765 ± 1.617
0.0AspXaa: 0.0 ± 0.0
Glu
4.518GluAla: 4.518 ± 0.759
0.0GluCys: 0.0 ± 0.0
4.518GluAsp: 4.518 ± 1.254
3.765GluGlu: 3.765 ± 0.201
4.518GluPhe: 4.518 ± 1.093
5.271GluGly: 5.271 ± 1.832
0.0GluHis: 0.0 ± 0.0
1.506GluIle: 1.506 ± 1.138
1.506GluLys: 1.506 ± 0.726
6.777GluLeu: 6.777 ± 0.867
3.012GluMet: 3.012 ± 1.334
0.0GluAsn: 0.0 ± 0.0
3.012GluPro: 3.012 ± 1.144
3.765GluGln: 3.765 ± 0.786
2.259GluArg: 2.259 ± 0.854
5.271GluSer: 5.271 ± 2.324
3.765GluThr: 3.765 ± 1.772
3.012GluVal: 3.012 ± 0.389
0.753GluTrp: 0.753 ± 0.569
0.753GluTyr: 0.753 ± 0.796
0.0GluXaa: 0.0 ± 0.0
Phe
0.753PheAla: 0.753 ± 0.494
0.753PheCys: 0.753 ± 0.796
2.259PheAsp: 2.259 ± 1.483
6.024PheGlu: 6.024 ± 0.515
0.0PhePhe: 0.0 ± 0.0
3.765PheGly: 3.765 ± 0.201
0.0PheHis: 0.0 ± 0.0
1.506PheIle: 1.506 ± 0.726
0.0PheLys: 0.0 ± 0.0
3.765PheLeu: 3.765 ± 0.786
0.753PheMet: 0.753 ± 0.494
0.753PheAsn: 0.753 ± 0.796
0.753PhePro: 0.753 ± 0.796
2.259PheGln: 2.259 ± 0.854
2.259PheArg: 2.259 ± 0.854
6.777PheSer: 6.777 ± 1.786
1.506PheThr: 1.506 ± 0.989
3.012PheVal: 3.012 ± 1.452
1.506PheTrp: 1.506 ± 0.726
1.506PheTyr: 1.506 ± 0.726
0.0PheXaa: 0.0 ± 0.0
Gly
3.012GlyAla: 3.012 ± 2.275
2.259GlyCys: 2.259 ± 1.521
3.765GlyAsp: 3.765 ± 0.786
3.765GlyGlu: 3.765 ± 1.187
3.765GlyPhe: 3.765 ± 0.786
5.271GlyGly: 5.271 ± 0.637
0.0GlyHis: 0.0 ± 0.0
2.259GlyIle: 2.259 ± 0.701
2.259GlyLys: 2.259 ± 0.854
6.024GlyLeu: 6.024 ± 1.814
3.012GlyMet: 3.012 ± 1.144
0.753GlyAsn: 0.753 ± 0.494
3.765GlyPro: 3.765 ± 1.214
3.012GlyGln: 3.012 ± 0.808
3.012GlyArg: 3.012 ± 0.808
6.024GlySer: 6.024 ± 1.717
6.024GlyThr: 6.024 ± 1.615
3.765GlyVal: 3.765 ± 0.786
1.506GlyTrp: 1.506 ± 0.726
3.012GlyTyr: 3.012 ± 0.684
0.0GlyXaa: 0.0 ± 0.0
His
0.753HisAla: 0.753 ± 0.569
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.753HisGly: 0.753 ± 0.494
0.0HisHis: 0.0 ± 0.0
2.259HisIle: 2.259 ± 1.483
0.753HisLys: 0.753 ± 0.494
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.753HisPro: 0.753 ± 0.494
2.259HisGln: 2.259 ± 0.417
1.506HisArg: 1.506 ± 0.828
0.753HisSer: 0.753 ± 0.569
1.506HisThr: 1.506 ± 0.404
4.518HisVal: 4.518 ± 1.093
0.753HisTrp: 0.753 ± 0.494
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.518IleAla: 4.518 ± 1.212
0.753IleCys: 0.753 ± 0.494
2.259IleAsp: 2.259 ± 0.701
0.753IleGlu: 0.753 ± 0.494
2.259IlePhe: 2.259 ± 0.854
2.259IleGly: 2.259 ± 0.701
3.012IleHis: 3.012 ± 1.144
2.259IleIle: 2.259 ± 0.854
2.259IleLys: 2.259 ± 0.954
3.765IleLeu: 3.765 ± 1.032
0.753IleMet: 0.753 ± 0.796
0.0IleAsn: 0.0 ± 0.0
3.012IlePro: 3.012 ± 0.389
2.259IleGln: 2.259 ± 0.701
3.012IleArg: 3.012 ± 0.808
4.518IleSer: 4.518 ± 1.212
0.753IleThr: 0.753 ± 0.494
0.753IleVal: 0.753 ± 0.569
1.506IleTrp: 1.506 ± 1.138
1.506IleTyr: 1.506 ± 1.592
0.0IleXaa: 0.0 ± 0.0
Lys
4.518LysAla: 4.518 ± 0.759
0.0LysCys: 0.0 ± 0.0
2.259LysAsp: 2.259 ± 1.441
0.0LysGlu: 0.0 ± 0.0
3.765LysPhe: 3.765 ± 1.039
2.259LysGly: 2.259 ± 0.854
2.259LysHis: 2.259 ± 0.954
3.012LysIle: 3.012 ± 1.144
7.53LysLys: 7.53 ± 1.461
2.259LysLeu: 2.259 ± 1.441
1.506LysMet: 1.506 ± 0.828
1.506LysAsn: 1.506 ± 0.989
3.012LysPro: 3.012 ± 0.808
3.012LysGln: 3.012 ± 0.389
3.012LysArg: 3.012 ± 0.808
4.518LysSer: 4.518 ± 0.834
1.506LysThr: 1.506 ± 0.404
2.259LysVal: 2.259 ± 0.417
0.753LysTrp: 0.753 ± 0.494
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.53LeuAla: 7.53 ± 4.278
0.0LeuCys: 0.0 ± 0.0
3.765LeuAsp: 3.765 ± 0.201
4.518LeuGlu: 4.518 ± 1.254
1.506LeuPhe: 1.506 ± 0.726
6.024LeuGly: 6.024 ± 0.778
0.753LeuHis: 0.753 ± 0.494
6.024LeuIle: 6.024 ± 0.615
3.765LeuLys: 3.765 ± 1.214
6.777LeuLeu: 6.777 ± 1.351
0.753LeuMet: 0.753 ± 0.494
4.518LeuAsn: 4.518 ± 0.319
6.024LeuPro: 6.024 ± 1.368
4.518LeuGln: 4.518 ± 1.708
9.789LeuArg: 9.789 ± 2.396
5.271LeuSer: 5.271 ± 0.637
6.024LeuThr: 6.024 ± 3.304
6.024LeuVal: 6.024 ± 2.278
3.012LeuTrp: 3.012 ± 2.283
5.271LeuTyr: 5.271 ± 2.595
0.0LeuXaa: 0.0 ± 0.0
Met
1.506MetAla: 1.506 ± 0.989
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.012MetGlu: 3.012 ± 1.334
0.0MetPhe: 0.0 ± 0.0
1.506MetGly: 1.506 ± 0.726
1.506MetHis: 1.506 ± 0.989
1.506MetIle: 1.506 ± 0.404
3.765MetLys: 3.765 ± 1.999
3.012MetLeu: 3.012 ± 0.389
0.753MetMet: 0.753 ± 0.494
0.753MetAsn: 0.753 ± 0.569
2.259MetPro: 2.259 ± 0.417
1.506MetGln: 1.506 ± 0.989
0.0MetArg: 0.0 ± 0.0
0.753MetSer: 0.753 ± 0.569
2.259MetThr: 2.259 ± 0.854
1.506MetVal: 1.506 ± 0.404
0.0MetTrp: 0.0 ± 0.0
2.259MetTyr: 2.259 ± 0.954
0.0MetXaa: 0.0 ± 0.0
Asn
1.506AsnAla: 1.506 ± 0.404
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.506AsnGlu: 1.506 ± 0.726
3.012AsnPhe: 3.012 ± 1.144
0.753AsnGly: 0.753 ± 0.796
0.0AsnHis: 0.0 ± 0.0
0.753AsnIle: 0.753 ± 0.494
0.753AsnLys: 0.753 ± 0.494
2.259AsnLeu: 2.259 ± 0.701
0.753AsnMet: 0.753 ± 0.494
0.753AsnAsn: 0.753 ± 0.569
0.0AsnPro: 0.0 ± 0.0
0.753AsnGln: 0.753 ± 0.494
1.506AsnArg: 1.506 ± 0.989
3.765AsnSer: 3.765 ± 1.214
0.753AsnThr: 0.753 ± 0.494
2.259AsnVal: 2.259 ± 0.854
1.506AsnTrp: 1.506 ± 0.404
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.024ProAla: 6.024 ± 2.329
0.753ProCys: 0.753 ± 0.796
3.765ProAsp: 3.765 ± 1.039
3.012ProGlu: 3.012 ± 1.334
0.0ProPhe: 0.0 ± 0.0
3.012ProGly: 3.012 ± 0.684
0.753ProHis: 0.753 ± 0.796
1.506ProIle: 1.506 ± 0.989
0.753ProLys: 0.753 ± 0.494
6.024ProLeu: 6.024 ± 1.616
3.012ProMet: 3.012 ± 0.808
3.012ProAsn: 3.012 ± 0.389
2.259ProPro: 2.259 ± 0.701
0.753ProGln: 0.753 ± 0.569
3.012ProArg: 3.012 ± 1.394
4.518ProSer: 4.518 ± 0.319
3.765ProThr: 3.765 ± 1.032
3.765ProVal: 3.765 ± 1.208
1.506ProTrp: 1.506 ± 1.138
1.506ProTyr: 1.506 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
3.012GlnAla: 3.012 ± 0.808
0.0GlnCys: 0.0 ± 0.0
2.259GlnAsp: 2.259 ± 0.417
3.012GlnGlu: 3.012 ± 1.334
1.506GlnPhe: 1.506 ± 1.592
3.012GlnGly: 3.012 ± 0.808
2.259GlnHis: 2.259 ± 0.417
3.012GlnIle: 3.012 ± 0.808
1.506GlnLys: 1.506 ± 0.726
4.518GlnLeu: 4.518 ± 1.731
2.259GlnMet: 2.259 ± 0.607
0.753GlnAsn: 0.753 ± 0.494
0.753GlnPro: 0.753 ± 0.569
3.012GlnGln: 3.012 ± 1.452
2.259GlnArg: 2.259 ± 0.417
3.765GlnSer: 3.765 ± 1.208
0.0GlnThr: 0.0 ± 0.0
3.765GlnVal: 3.765 ± 1.208
1.506GlnTrp: 1.506 ± 1.138
1.506GlnTyr: 1.506 ± 0.989
0.0GlnXaa: 0.0 ± 0.0
Arg
5.271ArgAla: 5.271 ± 2.287
0.753ArgCys: 0.753 ± 0.796
5.271ArgAsp: 5.271 ± 1.737
4.518ArgGlu: 4.518 ± 0.319
3.765ArgPhe: 3.765 ± 1.214
3.012ArgGly: 3.012 ± 0.684
1.506ArgHis: 1.506 ± 0.726
4.518ArgIle: 4.518 ± 1.402
1.506ArgLys: 1.506 ± 1.138
6.024ArgLeu: 6.024 ± 2.723
0.0ArgMet: 0.0 ± 0.0
3.765ArgAsn: 3.765 ± 1.617
6.024ArgPro: 6.024 ± 1.814
1.506ArgGln: 1.506 ± 0.726
7.53ArgArg: 7.53 ± 2.417
2.259ArgSer: 2.259 ± 0.854
3.765ArgThr: 3.765 ± 1.032
5.271ArgVal: 5.271 ± 1.832
2.259ArgTrp: 2.259 ± 1.441
3.012ArgTyr: 3.012 ± 1.144
0.0ArgXaa: 0.0 ± 0.0
Ser
9.789SerAla: 9.789 ± 2.145
1.506SerCys: 1.506 ± 0.404
3.765SerAsp: 3.765 ± 1.617
2.259SerGlu: 2.259 ± 1.176
2.259SerPhe: 2.259 ± 0.701
8.283SerGly: 8.283 ± 0.97
0.753SerHis: 0.753 ± 0.569
1.506SerIle: 1.506 ± 0.828
3.012SerLys: 3.012 ± 0.684
12.801SerLeu: 12.801 ± 1.323
0.753SerMet: 0.753 ± 0.494
0.753SerAsn: 0.753 ± 0.494
5.271SerPro: 5.271 ± 0.482
0.753SerGln: 0.753 ± 0.494
3.765SerArg: 3.765 ± 0.201
5.271SerSer: 5.271 ± 1.401
6.024SerThr: 6.024 ± 1.006
6.024SerVal: 6.024 ± 2.053
1.506SerTrp: 1.506 ± 0.726
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.012ThrAla: 3.012 ± 0.808
0.753ThrCys: 0.753 ± 0.494
3.765ThrAsp: 3.765 ± 1.187
1.506ThrGlu: 1.506 ± 0.726
0.753ThrPhe: 0.753 ± 0.494
3.765ThrGly: 3.765 ± 0.201
0.753ThrHis: 0.753 ± 0.494
1.506ThrIle: 1.506 ± 0.404
3.012ThrLys: 3.012 ± 1.144
5.271ThrLeu: 5.271 ± 2.036
1.506ThrMet: 1.506 ± 0.726
0.0ThrAsn: 0.0 ± 0.0
4.518ThrPro: 4.518 ± 1.708
1.506ThrGln: 1.506 ± 0.726
6.024ThrArg: 6.024 ± 1.3
8.283ThrSer: 8.283 ± 2.177
4.518ThrThr: 4.518 ± 1.212
5.271ThrVal: 5.271 ± 3.076
2.259ThrTrp: 2.259 ± 0.701
2.259ThrTyr: 2.259 ± 1.707
0.0ThrXaa: 0.0 ± 0.0
Val
3.012ValAla: 3.012 ± 1.394
0.0ValCys: 0.0 ± 0.0
6.777ValAsp: 6.777 ± 1.803
5.271ValGlu: 5.271 ± 1.587
3.765ValPhe: 3.765 ± 1.032
6.777ValGly: 6.777 ± 1.975
0.0ValHis: 0.0 ± 0.0
1.506ValIle: 1.506 ± 0.828
4.518ValLys: 4.518 ± 0.834
7.53ValLeu: 7.53 ± 2.987
4.518ValMet: 4.518 ± 1.254
0.753ValAsn: 0.753 ± 0.569
3.765ValPro: 3.765 ± 1.95
3.012ValGln: 3.012 ± 2.275
5.271ValArg: 5.271 ± 1.618
3.012ValSer: 3.012 ± 0.808
5.271ValThr: 5.271 ± 1.401
4.518ValVal: 4.518 ± 2.234
3.012ValTrp: 3.012 ± 0.389
1.506ValTyr: 1.506 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
3.765TrpAla: 3.765 ± 2.316
0.0TrpCys: 0.0 ± 0.0
3.765TrpAsp: 3.765 ± 1.214
1.506TrpGlu: 1.506 ± 0.989
0.0TrpPhe: 0.0 ± 0.0
1.506TrpGly: 1.506 ± 0.828
0.0TrpHis: 0.0 ± 0.0
1.506TrpIle: 1.506 ± 0.404
1.506TrpLys: 1.506 ± 0.404
2.259TrpLeu: 2.259 ± 0.417
0.0TrpMet: 0.0 ± 0.0
0.753TrpAsn: 0.753 ± 0.569
0.753TrpPro: 0.753 ± 0.494
1.506TrpGln: 1.506 ± 1.138
3.765TrpArg: 3.765 ± 1.621
1.506TrpSer: 1.506 ± 0.726
3.012TrpThr: 3.012 ± 1.652
3.765TrpVal: 3.765 ± 2.139
0.753TrpTrp: 0.753 ± 0.494
0.753TrpTyr: 0.753 ± 0.569
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.259TyrAla: 2.259 ± 1.441
0.0TyrCys: 0.0 ± 0.0
0.753TyrAsp: 0.753 ± 0.494
4.518TyrGlu: 4.518 ± 2.966
1.506TyrPhe: 1.506 ± 0.989
3.012TyrGly: 3.012 ± 1.394
2.259TyrHis: 2.259 ± 1.707
2.259TyrIle: 2.259 ± 1.176
2.259TyrLys: 2.259 ± 0.701
1.506TyrLeu: 1.506 ± 0.828
0.753TyrMet: 0.753 ± 0.569
0.753TyrAsn: 0.753 ± 0.494
1.506TyrPro: 1.506 ± 0.726
1.506TyrGln: 1.506 ± 1.592
1.506TyrArg: 1.506 ± 0.989
1.506TyrSer: 1.506 ± 0.828
1.506TyrThr: 1.506 ± 1.138
3.012TyrVal: 3.012 ± 0.389
3.012TyrTrp: 3.012 ± 1.656
0.753TyrTyr: 0.753 ± 0.494
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.753XaaGln: 0.753 ± 0.569
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1329 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski