Amino acid dipepetide frequency for Wuhan arthropod virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.028AlaAla: 2.028 ± 1.303
1.014AlaCys: 1.014 ± 0.089
2.366AlaAsp: 2.366 ± 1.108
2.028AlaGlu: 2.028 ± 0.179
3.381AlaPhe: 3.381 ± 0.031
4.057AlaGly: 4.057 ± 1.618
1.014AlaHis: 1.014 ± 0.404
2.366AlaIle: 2.366 ± 0.867
3.719AlaLys: 3.719 ± 0.657
4.057AlaLeu: 4.057 ± 0.63
1.014AlaMet: 1.014 ± 0.404
1.69AlaAsn: 1.69 ± 1.003
3.719AlaPro: 3.719 ± 1.812
3.381AlaGln: 3.381 ± 0.525
2.366AlaArg: 2.366 ± 0.121
4.395AlaSer: 4.395 ± 0.93
2.366AlaThr: 2.366 ± 0.373
3.381AlaVal: 3.381 ± 0.031
0.338AlaTrp: 0.338 ± 0.299
3.381AlaTyr: 3.381 ± 0.031
0.0AlaXaa: 0.0 ± 0.0
Cys
1.69CysAla: 1.69 ± 0.478
0.0CysCys: 0.0 ± 0.0
0.676CysAsp: 0.676 ± 0.105
2.028CysGlu: 2.028 ± 0.673
3.043CysPhe: 3.043 ± 1.213
2.028CysGly: 2.028 ± 1.167
0.676CysHis: 0.676 ± 0.105
2.028CysIle: 2.028 ± 0.179
1.014CysLys: 1.014 ± 0.089
1.69CysLeu: 1.69 ± 0.478
1.352CysMet: 1.352 ± 0.566
0.676CysAsn: 0.676 ± 0.389
1.69CysPro: 1.69 ± 0.51
0.676CysGln: 0.676 ± 0.105
0.338CysArg: 0.338 ± 0.299
1.69CysSer: 1.69 ± 0.51
0.338CysThr: 0.338 ± 0.299
1.352CysVal: 1.352 ± 0.778
0.0CysTrp: 0.0 ± 0.0
0.338CysTyr: 0.338 ± 0.299
0.0CysXaa: 0.0 ± 0.0
Asp
2.366AspAla: 2.366 ± 1.361
1.352AspCys: 1.352 ± 0.778
3.719AspAsp: 3.719 ± 0.163
4.733AspGlu: 4.733 ± 0.253
3.719AspPhe: 3.719 ± 0.331
1.69AspGly: 1.69 ± 0.016
1.69AspHis: 1.69 ± 0.016
5.071AspIle: 5.071 ± 0.047
2.366AspLys: 2.366 ± 0.615
6.423AspLeu: 6.423 ± 1.739
1.014AspMet: 1.014 ± 0.583
2.366AspAsn: 2.366 ± 0.615
2.028AspPro: 2.028 ± 0.179
1.352AspGln: 1.352 ± 0.284
2.028AspArg: 2.028 ± 0.179
2.366AspSer: 2.366 ± 0.615
3.381AspThr: 3.381 ± 0.463
3.043AspVal: 3.043 ± 0.72
1.69AspTrp: 1.69 ± 0.478
2.028AspTyr: 2.028 ± 0.179
0.0AspXaa: 0.0 ± 0.0
Glu
3.719GluAla: 3.719 ± 0.657
2.366GluCys: 2.366 ± 0.867
3.043GluAsp: 3.043 ± 0.268
5.409GluGlu: 5.409 ± 1.629
2.705GluPhe: 2.705 ± 1.062
2.366GluGly: 2.366 ± 0.867
0.338GluHis: 0.338 ± 0.299
6.761GluIle: 6.761 ± 1.913
3.381GluLys: 3.381 ± 0.957
4.057GluLeu: 4.057 ± 0.136
0.676GluMet: 0.676 ± 0.389
3.719GluAsn: 3.719 ± 2.139
0.676GluPro: 0.676 ± 0.389
3.381GluGln: 3.381 ± 0.031
3.719GluArg: 3.719 ± 0.657
4.733GluSer: 4.733 ± 1.734
2.028GluThr: 2.028 ± 0.179
2.366GluVal: 2.366 ± 0.121
0.676GluTrp: 0.676 ± 0.105
2.705GluTyr: 2.705 ± 0.568
0.0GluXaa: 0.0 ± 0.0
Phe
4.733PheAla: 4.733 ± 3.205
0.338PheCys: 0.338 ± 0.299
4.395PheAsp: 4.395 ± 1.046
3.719PheGlu: 3.719 ± 1.151
0.676PhePhe: 0.676 ± 0.389
2.028PheGly: 2.028 ± 0.179
1.352PheHis: 1.352 ± 0.21
4.733PheIle: 4.733 ± 1.229
2.705PheLys: 2.705 ± 0.42
3.719PheLeu: 3.719 ± 0.163
1.69PheMet: 1.69 ± 0.478
3.719PheAsn: 3.719 ± 0.163
1.69PhePro: 1.69 ± 0.972
1.69PheGln: 1.69 ± 1.497
3.719PheArg: 3.719 ± 0.163
5.747PheSer: 5.747 ± 0.836
3.381PheThr: 3.381 ± 1.019
3.381PheVal: 3.381 ± 0.463
0.676PheTrp: 0.676 ± 0.389
1.69PheTyr: 1.69 ± 0.478
0.0PheXaa: 0.0 ± 0.0
Gly
1.69GlyAla: 1.69 ± 1.003
1.352GlyCys: 1.352 ± 0.21
3.043GlyAsp: 3.043 ± 0.72
1.69GlyGlu: 1.69 ± 1.003
3.381GlyPhe: 3.381 ± 0.463
2.705GlyGly: 2.705 ± 0.42
1.69GlyHis: 1.69 ± 0.972
3.381GlyIle: 3.381 ± 0.463
3.719GlyLys: 3.719 ± 0.657
3.719GlyLeu: 3.719 ± 0.657
1.014GlyMet: 1.014 ± 0.089
4.057GlyAsn: 4.057 ± 1.618
1.014GlyPro: 1.014 ± 0.089
2.366GlyGln: 2.366 ± 0.121
2.705GlyArg: 2.705 ± 0.42
3.043GlySer: 3.043 ± 1.213
2.028GlyThr: 2.028 ± 0.315
4.057GlyVal: 4.057 ± 0.358
0.676GlyTrp: 0.676 ± 0.105
1.69GlyTyr: 1.69 ± 0.016
0.0GlyXaa: 0.0 ± 0.0
His
0.338HisAla: 0.338 ± 0.299
0.338HisCys: 0.338 ± 0.194
0.676HisAsp: 0.676 ± 0.389
0.0HisGlu: 0.0 ± 0.0
1.014HisPhe: 1.014 ± 0.404
0.676HisGly: 0.676 ± 0.105
0.338HisHis: 0.338 ± 0.194
2.705HisIle: 2.705 ± 0.568
1.352HisLys: 1.352 ± 0.778
1.014HisLeu: 1.014 ± 0.583
0.676HisMet: 0.676 ± 0.389
0.676HisAsn: 0.676 ± 0.389
1.014HisPro: 1.014 ± 0.404
0.338HisGln: 0.338 ± 0.194
0.0HisArg: 0.0 ± 0.0
1.69HisSer: 1.69 ± 0.016
1.014HisThr: 1.014 ± 0.089
2.366HisVal: 2.366 ± 0.615
0.338HisTrp: 0.338 ± 0.299
0.676HisTyr: 0.676 ± 0.389
0.0HisXaa: 0.0 ± 0.0
Ile
4.395IleAla: 4.395 ± 1.54
2.705IleCys: 2.705 ± 0.568
7.437IleAsp: 7.437 ± 2.302
6.085IleGlu: 6.085 ± 1.524
3.719IlePhe: 3.719 ± 0.163
3.719IleGly: 3.719 ± 0.825
0.676IleHis: 0.676 ± 0.389
5.071IleIle: 5.071 ± 1.435
3.381IleLys: 3.381 ± 0.463
4.395IleLeu: 4.395 ± 0.552
1.69IleMet: 1.69 ± 1.075
5.409IleAsn: 5.409 ± 0.346
4.733IlePro: 4.733 ± 0.735
2.028IleGln: 2.028 ± 0.179
3.719IleArg: 3.719 ± 0.657
6.085IleSer: 6.085 ± 0.945
2.705IleThr: 2.705 ± 1.408
5.409IleVal: 5.409 ± 0.642
1.352IleTrp: 1.352 ± 0.284
2.366IleTyr: 2.366 ± 0.373
0.0IleXaa: 0.0 ± 0.0
Lys
3.043LysAla: 3.043 ± 0.762
1.352LysCys: 1.352 ± 0.284
5.071LysAsp: 5.071 ± 1.435
4.057LysGlu: 4.057 ± 1.345
2.028LysPhe: 2.028 ± 0.673
2.366LysGly: 2.366 ± 1.361
1.69LysHis: 1.69 ± 0.016
6.761LysIle: 6.761 ± 1.419
3.719LysLys: 3.719 ± 0.657
3.043LysLeu: 3.043 ± 0.268
1.69LysMet: 1.69 ± 0.972
4.733LysAsn: 4.733 ± 2.228
2.366LysPro: 2.366 ± 0.373
3.043LysGln: 3.043 ± 0.226
3.719LysArg: 3.719 ± 0.657
3.719LysSer: 3.719 ± 1.151
4.395LysThr: 4.395 ± 0.93
3.043LysVal: 3.043 ± 0.226
1.014LysTrp: 1.014 ± 0.089
2.028LysTyr: 2.028 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
3.719LeuAla: 3.719 ± 0.331
1.014LeuCys: 1.014 ± 0.404
3.381LeuAsp: 3.381 ± 0.525
6.085LeuGlu: 6.085 ± 0.536
3.719LeuPhe: 3.719 ± 0.657
4.733LeuGly: 4.733 ± 0.253
2.705LeuHis: 2.705 ± 0.568
6.085LeuIle: 6.085 ± 1.524
4.733LeuLys: 4.733 ± 1.734
5.071LeuLeu: 5.071 ± 1.929
1.014LeuMet: 1.014 ± 0.089
3.043LeuAsn: 3.043 ± 0.268
2.705LeuPro: 2.705 ± 0.914
3.719LeuGln: 3.719 ± 0.825
2.705LeuArg: 2.705 ± 0.074
7.437LeuSer: 7.437 ± 0.82
5.747LeuThr: 5.747 ± 0.646
4.395LeuVal: 4.395 ± 0.436
0.676LeuTrp: 0.676 ± 0.599
4.057LeuTyr: 4.057 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
2.028MetAla: 2.028 ± 0.809
0.676MetCys: 0.676 ± 0.389
1.352MetAsp: 1.352 ± 0.284
0.676MetGlu: 0.676 ± 0.389
1.69MetPhe: 1.69 ± 0.016
0.0MetGly: 0.0 ± 0.0
0.338MetHis: 0.338 ± 0.194
1.014MetIle: 1.014 ± 0.583
2.028MetLys: 2.028 ± 1.167
1.69MetLeu: 1.69 ± 0.972
0.338MetMet: 0.338 ± 0.194
1.014MetAsn: 1.014 ± 0.089
1.352MetPro: 1.352 ± 0.21
1.352MetGln: 1.352 ± 0.21
1.69MetArg: 1.69 ± 0.478
1.014MetSer: 1.014 ± 0.898
1.014MetThr: 1.014 ± 0.089
0.338MetVal: 0.338 ± 0.299
0.676MetTrp: 0.676 ± 0.389
2.366MetTyr: 2.366 ± 0.373
0.0MetXaa: 0.0 ± 0.0
Asn
2.366AsnAla: 2.366 ± 0.373
1.352AsnCys: 1.352 ± 0.778
2.705AsnAsp: 2.705 ± 0.074
2.366AsnGlu: 2.366 ± 0.867
4.395AsnPhe: 4.395 ± 0.552
1.352AsnGly: 1.352 ± 0.704
1.352AsnHis: 1.352 ± 0.284
5.071AsnIle: 5.071 ± 0.047
5.071AsnLys: 5.071 ± 1.435
4.057AsnLeu: 4.057 ± 0.852
1.014AsnMet: 1.014 ± 0.404
3.043AsnAsn: 3.043 ± 0.268
3.043AsnPro: 3.043 ± 1.213
2.705AsnGln: 2.705 ± 0.074
1.352AsnArg: 1.352 ± 0.284
5.409AsnSer: 5.409 ± 0.346
3.043AsnThr: 3.043 ± 0.226
3.381AsnVal: 3.381 ± 0.031
0.338AsnTrp: 0.338 ± 0.299
4.395AsnTyr: 4.395 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
1.352ProAla: 1.352 ± 0.704
2.366ProCys: 2.366 ± 1.108
2.028ProAsp: 2.028 ± 0.809
2.705ProGlu: 2.705 ± 0.568
4.733ProPhe: 4.733 ± 0.735
1.69ProGly: 1.69 ± 0.016
0.0ProHis: 0.0 ± 0.0
1.69ProIle: 1.69 ± 0.478
2.705ProLys: 2.705 ± 0.568
5.409ProLeu: 5.409 ± 1.334
1.352ProMet: 1.352 ± 0.704
1.69ProAsn: 1.69 ± 0.51
2.705ProPro: 2.705 ± 1.408
3.043ProGln: 3.043 ± 0.72
2.028ProArg: 2.028 ± 0.809
4.057ProSer: 4.057 ± 1.124
4.733ProThr: 4.733 ± 2.711
2.028ProVal: 2.028 ± 0.179
0.338ProTrp: 0.338 ± 0.299
1.69ProTyr: 1.69 ± 1.003
0.0ProXaa: 0.0 ± 0.0
Gln
1.014GlnAla: 1.014 ± 0.404
1.352GlnCys: 1.352 ± 0.21
1.69GlnAsp: 1.69 ± 0.016
3.043GlnGlu: 3.043 ± 0.762
3.719GlnPhe: 3.719 ± 1.319
4.057GlnGly: 4.057 ± 0.136
0.338GlnHis: 0.338 ± 0.194
4.057GlnIle: 4.057 ± 0.136
2.028GlnLys: 2.028 ± 0.179
2.028GlnLeu: 2.028 ± 0.179
1.352GlnMet: 1.352 ± 1.198
3.381GlnAsn: 3.381 ± 0.463
1.352GlnPro: 1.352 ± 0.284
2.705GlnGln: 2.705 ± 0.568
2.028GlnArg: 2.028 ± 0.179
4.733GlnSer: 4.733 ± 1.723
1.014GlnThr: 1.014 ± 0.404
2.028GlnVal: 2.028 ± 0.809
1.014GlnTrp: 1.014 ± 0.089
2.705GlnTyr: 2.705 ± 1.408
0.0GlnXaa: 0.0 ± 0.0
Arg
2.705ArgAla: 2.705 ± 1.902
0.676ArgCys: 0.676 ± 0.105
2.028ArgAsp: 2.028 ± 0.179
2.366ArgGlu: 2.366 ± 1.361
2.366ArgPhe: 2.366 ± 0.121
1.69ArgGly: 1.69 ± 0.51
0.338ArgHis: 0.338 ± 0.194
3.381ArgIle: 3.381 ± 0.957
3.381ArgLys: 3.381 ± 1.451
5.071ArgLeu: 5.071 ± 0.047
0.338ArgMet: 0.338 ± 0.194
3.719ArgAsn: 3.719 ± 0.657
2.028ArgPro: 2.028 ± 0.809
1.352ArgGln: 1.352 ± 0.704
1.352ArgArg: 1.352 ± 0.284
3.043ArgSer: 3.043 ± 1.256
2.366ArgThr: 2.366 ± 0.121
4.395ArgVal: 4.395 ± 0.436
0.338ArgTrp: 0.338 ± 0.194
1.69ArgTyr: 1.69 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
2.366SerAla: 2.366 ± 1.602
2.028SerCys: 2.028 ± 0.809
2.366SerAsp: 2.366 ± 0.615
2.366SerGlu: 2.366 ± 0.867
4.057SerPhe: 4.057 ± 1.124
4.733SerGly: 4.733 ± 2.217
0.676SerHis: 0.676 ± 0.105
6.423SerIle: 6.423 ± 0.237
4.733SerLys: 4.733 ± 0.747
7.437SerLeu: 7.437 ± 0.82
2.366SerMet: 2.366 ± 0.867
5.409SerAsn: 5.409 ± 0.346
5.747SerPro: 5.747 ± 2.128
5.747SerGln: 5.747 ± 0.342
2.366SerArg: 2.366 ± 0.373
5.071SerSer: 5.071 ± 0.047
4.057SerThr: 4.057 ± 0.63
4.057SerVal: 4.057 ± 1.124
1.014SerTrp: 1.014 ± 0.089
4.057SerTyr: 4.057 ± 0.358
0.0SerXaa: 0.0 ± 0.0
Thr
3.043ThrAla: 3.043 ± 1.707
1.352ThrCys: 1.352 ± 0.284
2.705ThrAsp: 2.705 ± 1.902
3.043ThrGlu: 3.043 ± 0.762
2.028ThrPhe: 2.028 ± 0.179
4.057ThrGly: 4.057 ± 1.124
0.676ThrHis: 0.676 ± 0.105
3.043ThrIle: 3.043 ± 0.72
3.043ThrLys: 3.043 ± 0.226
4.057ThrLeu: 4.057 ± 0.136
1.014ThrMet: 1.014 ± 0.583
3.381ThrAsn: 3.381 ± 0.525
4.733ThrPro: 4.733 ± 0.735
2.366ThrGln: 2.366 ± 0.615
3.043ThrArg: 3.043 ± 0.226
3.381ThrSer: 3.381 ± 0.525
4.057ThrThr: 4.057 ± 1.124
5.747ThrVal: 5.747 ± 0.152
0.338ThrTrp: 0.338 ± 0.299
1.014ThrTyr: 1.014 ± 0.089
0.0ThrXaa: 0.0 ± 0.0
Val
6.423ValAla: 6.423 ± 0.257
0.338ValCys: 0.338 ± 0.299
3.719ValAsp: 3.719 ± 0.657
3.719ValGlu: 3.719 ± 0.657
1.69ValPhe: 1.69 ± 0.51
3.043ValGly: 3.043 ± 0.268
0.676ValHis: 0.676 ± 0.105
2.705ValIle: 2.705 ± 0.074
5.071ValLys: 5.071 ± 0.447
5.071ValLeu: 5.071 ± 0.941
2.028ValMet: 2.028 ± 0.179
2.366ValAsn: 2.366 ± 0.373
3.381ValPro: 3.381 ± 2.501
2.366ValGln: 2.366 ± 1.108
3.043ValArg: 3.043 ± 0.226
4.395ValSer: 4.395 ± 0.436
4.057ValThr: 4.057 ± 0.358
4.395ValVal: 4.395 ± 2.411
0.338ValTrp: 0.338 ± 0.194
3.719ValTyr: 3.719 ± 0.331
0.0ValXaa: 0.0 ± 0.0
Trp
1.352TrpAla: 1.352 ± 0.284
0.338TrpCys: 0.338 ± 0.194
0.338TrpAsp: 0.338 ± 0.194
0.338TrpGlu: 0.338 ± 0.299
0.338TrpPhe: 0.338 ± 0.299
0.0TrpGly: 0.0 ± 0.0
0.338TrpHis: 0.338 ± 0.194
0.338TrpIle: 0.338 ± 0.299
1.69TrpLys: 1.69 ± 0.478
0.676TrpLeu: 0.676 ± 0.105
0.0TrpMet: 0.0 ± 0.0
1.352TrpAsn: 1.352 ± 0.284
0.338TrpPro: 0.338 ± 0.299
1.014TrpGln: 1.014 ± 0.089
0.0TrpArg: 0.0 ± 0.0
1.014TrpSer: 1.014 ± 0.898
1.69TrpThr: 1.69 ± 0.016
0.676TrpVal: 0.676 ± 0.105
0.0TrpTrp: 0.0 ± 0.0
0.676TrpTyr: 0.676 ± 0.389
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.69TyrAla: 1.69 ± 0.016
1.014TyrCys: 1.014 ± 0.898
1.69TyrAsp: 1.69 ± 1.003
2.366TyrGlu: 2.366 ± 1.361
3.381TyrPhe: 3.381 ± 0.957
2.028TyrGly: 2.028 ± 0.179
0.338TyrHis: 0.338 ± 0.194
4.733TyrIle: 4.733 ± 0.241
3.043TyrLys: 3.043 ± 0.762
4.057TyrLeu: 4.057 ± 0.136
0.676TyrMet: 0.676 ± 0.389
2.366TyrAsn: 2.366 ± 0.121
2.028TyrPro: 2.028 ± 0.809
1.014TyrGln: 1.014 ± 0.089
2.705TyrArg: 2.705 ± 0.568
4.057TyrSer: 4.057 ± 1.618
2.705TyrThr: 2.705 ± 0.074
2.705TyrVal: 2.705 ± 1.062
0.676TyrTrp: 0.676 ± 0.105
2.028TyrTyr: 2.028 ± 0.809
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2959 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski