Amino acid dipepetide frequency for Wenzhou qinvirus-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.248AlaAla: 14.248 ± 6.015
2.226AlaCys: 2.226 ± 0.997
8.014AlaAsp: 8.014 ± 1.576
6.679AlaGlu: 6.679 ± 0.924
3.117AlaPhe: 3.117 ± 0.363
5.788AlaGly: 5.788 ± 4.638
4.007AlaHis: 4.007 ± 0.761
5.788AlaIle: 5.788 ± 0.526
5.788AlaLys: 5.788 ± 1.559
8.014AlaLeu: 8.014 ± 4.674
4.452AlaMet: 4.452 ± 1.993
4.452AlaAsn: 4.452 ± 1.105
6.233AlaPro: 6.233 ± 1.758
3.117AlaGln: 3.117 ± 1.703
9.795AlaArg: 9.795 ± 0.778
6.233AlaSer: 6.233 ± 1.34
6.233AlaThr: 6.233 ± 1.34
9.795AlaVal: 9.795 ± 1.287
1.781AlaTrp: 1.781 ± 0.797
6.233AlaTyr: 6.233 ± 0.308
0.0AlaXaa: 0.0 ± 0.0
Cys
1.781CysAla: 1.781 ± 0.797
0.0CysCys: 0.0 ± 0.0
0.89CysAsp: 0.89 ± 0.399
0.0CysGlu: 0.0 ± 0.0
0.89CysPhe: 0.89 ± 0.399
1.781CysGly: 1.781 ± 0.235
0.0CysHis: 0.0 ± 0.0
1.781CysIle: 1.781 ± 0.797
1.781CysLys: 1.781 ± 0.235
0.445CysLeu: 0.445 ± 0.833
0.445CysMet: 0.445 ± 0.199
0.89CysAsn: 0.89 ± 0.399
1.781CysPro: 1.781 ± 0.235
0.445CysGln: 0.445 ± 0.199
0.89CysArg: 0.89 ± 0.399
1.781CysSer: 1.781 ± 0.797
0.89CysThr: 0.89 ± 0.634
0.89CysVal: 0.89 ± 0.634
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
9.35AspAla: 9.35 ± 3.043
2.226AspCys: 2.226 ± 1.069
4.898AspAsp: 4.898 ± 2.971
4.898AspGlu: 4.898 ± 2.193
0.0AspPhe: 0.0 ± 0.0
4.898AspGly: 4.898 ± 0.906
0.89AspHis: 0.89 ± 0.399
3.117AspIle: 3.117 ± 0.363
0.89AspLys: 0.89 ± 0.399
3.562AspLeu: 3.562 ± 0.562
2.671AspMet: 2.671 ± 0.163
1.336AspAsn: 1.336 ± 0.598
4.452AspPro: 4.452 ± 0.961
2.671AspGln: 2.671 ± 0.87
4.452AspArg: 4.452 ± 1.993
2.671AspSer: 2.671 ± 0.163
3.117AspThr: 3.117 ± 1.395
6.233AspVal: 6.233 ± 1.758
0.445AspTrp: 0.445 ± 0.833
2.671AspTyr: 2.671 ± 1.902
0.0AspXaa: 0.0 ± 0.0
Glu
5.343GluAla: 5.343 ± 1.359
0.0GluCys: 0.0 ± 0.0
1.781GluAsp: 1.781 ± 0.235
4.898GluGlu: 4.898 ± 0.127
2.226GluPhe: 2.226 ± 0.997
3.117GluGly: 3.117 ± 1.395
3.117GluHis: 3.117 ± 1.395
3.562GluIle: 3.562 ± 1.595
0.0GluLys: 0.0 ± 0.0
5.343GluLeu: 5.343 ± 0.326
0.89GluMet: 0.89 ± 0.278
0.89GluAsn: 0.89 ± 0.634
2.226GluPro: 2.226 ± 0.036
1.781GluGln: 1.781 ± 0.235
8.459GluArg: 8.459 ± 2.755
3.117GluSer: 3.117 ± 0.363
3.117GluThr: 3.117 ± 0.363
5.343GluVal: 5.343 ± 1.739
0.89GluTrp: 0.89 ± 0.399
2.226GluTyr: 2.226 ± 0.997
0.0GluXaa: 0.0 ± 0.0
Phe
3.562PheAla: 3.562 ± 1.595
0.89PheCys: 0.89 ± 0.399
2.671PheAsp: 2.671 ± 0.163
1.781PheGlu: 1.781 ± 1.268
0.445PhePhe: 0.445 ± 0.199
0.445PheGly: 0.445 ± 0.199
0.89PheHis: 0.89 ± 0.399
1.336PheIle: 1.336 ± 0.598
0.89PheLys: 0.89 ± 0.399
2.226PheLeu: 2.226 ± 0.997
2.226PheMet: 2.226 ± 0.036
1.336PheAsn: 1.336 ± 0.598
0.0PhePro: 0.0 ± 0.0
2.226PheGln: 2.226 ± 0.997
2.671PheArg: 2.671 ± 1.196
0.89PheSer: 0.89 ± 0.399
1.336PheThr: 1.336 ± 0.598
0.0PheVal: 0.0 ± 0.0
0.89PheTrp: 0.89 ± 0.634
2.226PheTyr: 2.226 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
8.459GlyAla: 8.459 ± 0.344
0.0GlyCys: 0.0 ± 0.0
4.898GlyAsp: 4.898 ± 2.971
2.671GlyGlu: 2.671 ± 0.87
2.671GlyPhe: 2.671 ± 0.163
2.671GlyGly: 2.671 ± 0.87
1.781GlyHis: 1.781 ± 0.235
3.562GlyIle: 3.562 ± 1.504
2.671GlyLys: 2.671 ± 0.87
4.007GlyLeu: 4.007 ± 1.304
2.671GlyMet: 2.671 ± 1.196
0.89GlyAsn: 0.89 ± 0.634
1.781GlyPro: 1.781 ± 0.797
4.007GlyGln: 4.007 ± 2.337
3.562GlyArg: 3.562 ± 0.471
2.226GlySer: 2.226 ± 0.036
1.336GlyThr: 1.336 ± 1.468
3.562GlyVal: 3.562 ± 0.471
1.336GlyTrp: 1.336 ± 0.598
2.671GlyTyr: 2.671 ± 0.163
0.0GlyXaa: 0.0 ± 0.0
His
5.343HisAla: 5.343 ± 0.326
0.445HisCys: 0.445 ± 0.199
1.781HisAsp: 1.781 ± 0.235
3.117HisGlu: 3.117 ± 1.395
0.89HisPhe: 0.89 ± 0.399
0.445HisGly: 0.445 ± 0.199
0.445HisHis: 0.445 ± 0.833
2.226HisIle: 2.226 ± 0.997
0.0HisLys: 0.0 ± 0.0
1.336HisLeu: 1.336 ± 0.598
2.226HisMet: 2.226 ± 0.036
2.226HisAsn: 2.226 ± 0.036
2.226HisPro: 2.226 ± 0.036
0.89HisGln: 0.89 ± 0.634
2.671HisArg: 2.671 ± 0.163
0.89HisSer: 0.89 ± 0.399
4.007HisThr: 4.007 ± 1.304
2.226HisVal: 2.226 ± 0.036
0.89HisTrp: 0.89 ± 0.634
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.007IleAla: 4.007 ± 0.272
0.0IleCys: 0.0 ± 0.0
2.226IleAsp: 2.226 ± 0.997
4.007IleGlu: 4.007 ± 0.761
1.336IlePhe: 1.336 ± 0.598
3.117IleGly: 3.117 ± 2.736
2.226IleHis: 2.226 ± 0.036
1.336IleIle: 1.336 ± 0.598
0.445IleLys: 0.445 ± 0.199
5.788IleLeu: 5.788 ± 0.526
2.226IleMet: 2.226 ± 0.178
0.89IleAsn: 0.89 ± 0.634
2.671IlePro: 2.671 ± 0.163
2.671IleGln: 2.671 ± 0.163
6.233IleArg: 6.233 ± 0.725
2.226IleSer: 2.226 ± 0.997
2.671IleThr: 2.671 ± 1.196
4.007IleVal: 4.007 ± 0.761
0.445IleTrp: 0.445 ± 0.199
1.336IleTyr: 1.336 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
4.898LysAla: 4.898 ± 2.971
0.0LysCys: 0.0 ± 0.0
1.781LysAsp: 1.781 ± 0.797
1.336LysGlu: 1.336 ± 0.435
0.89LysPhe: 0.89 ± 0.399
0.89LysGly: 0.89 ± 0.399
1.336LysHis: 1.336 ± 0.598
1.336LysIle: 1.336 ± 0.598
0.445LysLys: 0.445 ± 0.199
5.788LysLeu: 5.788 ± 0.526
0.445LysMet: 0.445 ± 0.833
1.781LysAsn: 1.781 ± 1.268
1.781LysPro: 1.781 ± 0.797
0.445LysGln: 0.445 ± 0.199
1.336LysArg: 1.336 ± 0.598
2.671LysSer: 2.671 ± 0.163
2.226LysThr: 2.226 ± 0.997
3.117LysVal: 3.117 ± 0.67
0.89LysTrp: 0.89 ± 0.634
1.781LysTyr: 1.781 ± 0.797
0.0LysXaa: 0.0 ± 0.0
Leu
7.569LeuAla: 7.569 ± 1.323
2.226LeuCys: 2.226 ± 0.036
7.124LeuAsp: 7.124 ± 2.157
4.452LeuGlu: 4.452 ± 0.072
3.562LeuPhe: 3.562 ± 0.562
4.898LeuGly: 4.898 ± 2.971
1.781LeuHis: 1.781 ± 0.235
3.562LeuIle: 3.562 ± 0.471
1.781LeuLys: 1.781 ± 0.797
7.569LeuLeu: 7.569 ± 1.323
3.562LeuMet: 3.562 ± 2.536
2.671LeuAsn: 2.671 ± 0.163
5.343LeuPro: 5.343 ± 3.805
2.671LeuGln: 2.671 ± 0.163
6.679LeuArg: 6.679 ± 0.924
5.788LeuSer: 5.788 ± 0.526
4.007LeuThr: 4.007 ± 0.272
6.233LeuVal: 6.233 ± 1.758
0.445LeuTrp: 0.445 ± 0.199
2.226LeuTyr: 2.226 ± 1.069
0.0LeuXaa: 0.0 ± 0.0
Met
5.343MetAla: 5.343 ± 1.359
0.445MetCys: 0.445 ± 0.833
0.89MetAsp: 0.89 ± 0.399
0.89MetGlu: 0.89 ± 0.399
0.445MetPhe: 0.445 ± 0.199
2.226MetGly: 2.226 ± 0.036
1.781MetHis: 1.781 ± 0.235
0.445MetIle: 0.445 ± 0.199
1.336MetLys: 1.336 ± 0.598
2.671MetLeu: 2.671 ± 0.163
1.336MetMet: 1.336 ± 2.5
1.781MetAsn: 1.781 ± 0.235
1.336MetPro: 1.336 ± 0.598
0.445MetGln: 0.445 ± 0.833
2.226MetArg: 2.226 ± 0.036
2.226MetSer: 2.226 ± 0.036
2.671MetThr: 2.671 ± 0.87
2.226MetVal: 2.226 ± 0.997
0.89MetTrp: 0.89 ± 0.634
0.89MetTyr: 0.89 ± 0.399
0.0MetXaa: 0.0 ± 0.0
Asn
4.007AsnAla: 4.007 ± 0.272
1.781AsnCys: 1.781 ± 0.797
1.781AsnAsp: 1.781 ± 0.797
1.781AsnGlu: 1.781 ± 0.797
0.89AsnPhe: 0.89 ± 0.399
1.781AsnGly: 1.781 ± 1.268
1.336AsnHis: 1.336 ± 1.468
2.226AsnIle: 2.226 ± 1.069
1.781AsnLys: 1.781 ± 0.235
4.007AsnLeu: 4.007 ± 0.761
1.336AsnMet: 1.336 ± 0.435
0.445AsnAsn: 0.445 ± 0.199
0.89AsnPro: 0.89 ± 0.399
0.445AsnGln: 0.445 ± 0.833
2.671AsnArg: 2.671 ± 0.163
1.336AsnSer: 1.336 ± 0.435
1.336AsnThr: 1.336 ± 0.435
1.781AsnVal: 1.781 ± 1.268
0.0AsnTrp: 0.0 ± 0.0
0.89AsnTyr: 0.89 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
4.007ProAla: 4.007 ± 0.272
0.0ProCys: 0.0 ± 0.0
3.562ProAsp: 3.562 ± 1.504
2.671ProGlu: 2.671 ± 0.163
1.336ProPhe: 1.336 ± 0.598
4.452ProGly: 4.452 ± 0.961
1.336ProHis: 1.336 ± 1.468
1.336ProIle: 1.336 ± 1.468
2.226ProLys: 2.226 ± 0.036
3.562ProLeu: 3.562 ± 0.562
0.89ProMet: 0.89 ± 0.399
2.671ProAsn: 2.671 ± 1.196
3.562ProPro: 3.562 ± 0.471
0.89ProGln: 0.89 ± 0.634
3.562ProArg: 3.562 ± 0.562
1.336ProSer: 1.336 ± 0.598
2.671ProThr: 2.671 ± 0.163
4.898ProVal: 4.898 ± 0.127
0.0ProTrp: 0.0 ± 0.0
1.336ProTyr: 1.336 ± 0.598
0.0ProXaa: 0.0 ± 0.0
Gln
5.343GlnAla: 5.343 ± 3.805
0.89GlnCys: 0.89 ± 0.634
0.89GlnAsp: 0.89 ± 0.399
1.781GlnGlu: 1.781 ± 0.235
0.89GlnPhe: 0.89 ± 0.634
2.671GlnGly: 2.671 ± 0.87
2.226GlnHis: 2.226 ± 1.069
2.671GlnIle: 2.671 ± 0.163
0.89GlnLys: 0.89 ± 0.399
2.671GlnLeu: 2.671 ± 0.87
0.445GlnMet: 0.445 ± 0.833
2.671GlnAsn: 2.671 ± 0.87
0.445GlnPro: 0.445 ± 0.199
0.89GlnGln: 0.89 ± 0.399
0.445GlnArg: 0.445 ± 0.199
2.671GlnSer: 2.671 ± 1.902
1.336GlnThr: 1.336 ± 0.598
1.781GlnVal: 1.781 ± 0.797
0.445GlnTrp: 0.445 ± 0.199
1.336GlnTyr: 1.336 ± 0.435
0.0GlnXaa: 0.0 ± 0.0
Arg
5.788ArgAla: 5.788 ± 1.559
1.336ArgCys: 1.336 ± 0.435
4.007ArgAsp: 4.007 ± 1.794
4.007ArgGlu: 4.007 ± 1.794
2.226ArgPhe: 2.226 ± 0.036
5.788ArgGly: 5.788 ± 0.526
3.562ArgHis: 3.562 ± 1.595
4.898ArgIle: 4.898 ± 1.16
4.898ArgLys: 4.898 ± 1.938
4.452ArgLeu: 4.452 ± 0.961
2.671ArgMet: 2.671 ± 1.196
3.117ArgAsn: 3.117 ± 0.363
2.226ArgPro: 2.226 ± 0.036
1.336ArgGln: 1.336 ± 0.435
4.452ArgArg: 4.452 ± 0.072
5.343ArgSer: 5.343 ± 0.706
4.452ArgThr: 4.452 ± 0.961
4.452ArgVal: 4.452 ± 0.961
1.336ArgTrp: 1.336 ± 0.435
2.671ArgTyr: 2.671 ± 0.163
0.0ArgXaa: 0.0 ± 0.0
Ser
10.24SerAla: 10.24 ± 0.454
0.89SerCys: 0.89 ± 0.399
5.343SerAsp: 5.343 ± 1.359
4.007SerGlu: 4.007 ± 1.794
1.781SerPhe: 1.781 ± 0.235
2.671SerGly: 2.671 ± 1.196
0.0SerHis: 0.0 ± 0.0
2.226SerIle: 2.226 ± 0.036
1.336SerLys: 1.336 ± 1.468
4.898SerLeu: 4.898 ± 0.906
0.445SerMet: 0.445 ± 0.199
0.0SerAsn: 0.0 ± 0.0
2.226SerPro: 2.226 ± 1.069
2.226SerGln: 2.226 ± 2.102
4.452SerArg: 4.452 ± 0.961
3.562SerSer: 3.562 ± 0.562
4.452SerThr: 4.452 ± 1.105
2.226SerVal: 2.226 ± 0.997
0.0SerTrp: 0.0 ± 0.0
2.226SerTyr: 2.226 ± 0.997
0.0SerXaa: 0.0 ± 0.0
Thr
6.679ThrAla: 6.679 ± 2.174
0.89ThrCys: 0.89 ± 0.399
4.898ThrAsp: 4.898 ± 0.906
4.452ThrGlu: 4.452 ± 1.993
0.0ThrPhe: 0.0 ± 0.0
2.671ThrGly: 2.671 ± 1.902
2.671ThrHis: 2.671 ± 0.163
2.671ThrIle: 2.671 ± 0.87
2.671ThrLys: 2.671 ± 0.163
8.459ThrLeu: 8.459 ± 1.376
0.0ThrMet: 0.0 ± 0.0
0.89ThrAsn: 0.89 ± 0.399
2.226ThrPro: 2.226 ± 0.997
0.89ThrGln: 0.89 ± 0.634
2.226ThrArg: 2.226 ± 0.997
2.671ThrSer: 2.671 ± 0.163
3.117ThrThr: 3.117 ± 0.363
4.452ThrVal: 4.452 ± 1.993
0.89ThrTrp: 0.89 ± 0.399
3.117ThrTyr: 3.117 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
9.35ValAla: 9.35 ± 0.978
1.781ValCys: 1.781 ± 0.797
3.562ValAsp: 3.562 ± 0.562
3.117ValGlu: 3.117 ± 0.363
2.671ValPhe: 2.671 ± 1.196
4.007ValGly: 4.007 ± 1.304
2.226ValHis: 2.226 ± 0.997
3.562ValIle: 3.562 ± 1.595
3.117ValLys: 3.117 ± 1.395
4.452ValLeu: 4.452 ± 1.993
1.781ValMet: 1.781 ± 0.797
3.117ValAsn: 3.117 ± 2.736
3.117ValPro: 3.117 ± 1.395
4.007ValGln: 4.007 ± 0.761
3.117ValArg: 3.117 ± 1.703
3.562ValSer: 3.562 ± 0.562
5.343ValThr: 5.343 ± 1.359
3.117ValVal: 3.117 ± 0.67
0.445ValTrp: 0.445 ± 0.199
2.226ValTyr: 2.226 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.336TrpAla: 1.336 ± 0.435
0.0TrpCys: 0.0 ± 0.0
0.89TrpAsp: 0.89 ± 0.399
0.445TrpGlu: 0.445 ± 0.833
0.89TrpPhe: 0.89 ± 0.399
1.336TrpGly: 1.336 ± 0.598
0.0TrpHis: 0.0 ± 0.0
0.89TrpIle: 0.89 ± 0.399
0.0TrpLys: 0.0 ± 0.0
1.336TrpLeu: 1.336 ± 1.468
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.445TrpGln: 0.445 ± 0.199
0.89TrpArg: 0.89 ± 0.399
1.781TrpSer: 1.781 ± 0.235
0.89TrpThr: 0.89 ± 0.634
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.89TrpTyr: 0.89 ± 0.399
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.343TyrAla: 5.343 ± 0.706
1.336TyrCys: 1.336 ± 0.598
3.562TyrAsp: 3.562 ± 0.471
1.336TyrGlu: 1.336 ± 0.435
1.781TyrPhe: 1.781 ± 0.797
1.781TyrGly: 1.781 ± 0.797
2.671TyrHis: 2.671 ± 0.163
1.781TyrIle: 1.781 ± 0.235
2.226TyrLys: 2.226 ± 1.069
3.562TyrLeu: 3.562 ± 0.562
1.336TyrMet: 1.336 ± 0.598
0.445TyrAsn: 0.445 ± 0.199
1.781TyrPro: 1.781 ± 2.301
0.89TyrGln: 0.89 ± 0.634
2.226TyrArg: 2.226 ± 0.036
2.226TyrSer: 2.226 ± 0.997
1.336TyrThr: 1.336 ± 0.598
1.336TyrVal: 1.336 ± 0.598
0.0TyrTrp: 0.0 ± 0.0
0.89TyrTyr: 0.89 ± 0.399
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2247 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski