Amino acid dipepetide frequency for Wutai mosquito phasivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.524AlaAla: 3.524 ± 1.104
0.813AlaCys: 0.813 ± 0.802
1.898AlaAsp: 1.898 ± 0.474
2.982AlaGlu: 2.982 ± 0.963
1.084AlaPhe: 1.084 ± 0.319
2.982AlaGly: 2.982 ± 0.603
1.355AlaHis: 1.355 ± 0.571
4.066AlaIle: 4.066 ± 3.135
3.795AlaLys: 3.795 ± 0.937
5.964AlaLeu: 5.964 ± 0.584
1.355AlaMet: 1.355 ± 0.455
2.169AlaAsn: 2.169 ± 1.344
1.084AlaPro: 1.084 ± 0.305
1.084AlaGln: 1.084 ± 0.305
3.253AlaArg: 3.253 ± 0.441
7.59AlaSer: 7.59 ± 2.146
2.982AlaThr: 2.982 ± 1.848
2.169AlaVal: 2.169 ± 1.207
0.0AlaTrp: 0.0 ± 0.0
0.813AlaTyr: 0.813 ± 0.181
0.0AlaXaa: 0.0 ± 0.0
Cys
0.813CysAla: 0.813 ± 0.409
0.271CysCys: 0.271 ± 0.267
0.542CysAsp: 0.542 ± 0.535
1.898CysGlu: 1.898 ± 1.471
1.626CysPhe: 1.626 ± 1.204
1.084CysGly: 1.084 ± 0.672
0.813CysHis: 0.813 ± 0.181
0.542CysIle: 0.542 ± 0.535
1.898CysLys: 1.898 ± 1.08
1.898CysLeu: 1.898 ± 1.08
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.813CysPro: 0.813 ± 0.181
0.271CysGln: 0.271 ± 0.267
0.813CysArg: 0.813 ± 0.644
4.066CysSer: 4.066 ± 2.813
0.813CysThr: 0.813 ± 0.802
1.084CysVal: 1.084 ± 0.672
0.542CysTrp: 0.542 ± 0.326
1.084CysTyr: 1.084 ± 0.672
0.0CysXaa: 0.0 ± 0.0
Asp
2.711AspAla: 2.711 ± 0.237
1.898AspCys: 1.898 ± 1.08
2.169AspAsp: 2.169 ± 0.611
4.337AspGlu: 4.337 ± 0.608
4.066AspPhe: 4.066 ± 3.264
2.44AspGly: 2.44 ± 0.542
0.271AspHis: 0.271 ± 0.163
4.066AspIle: 4.066 ± 0.769
4.337AspLys: 4.337 ± 1.398
4.337AspLeu: 4.337 ± 1.275
1.898AspMet: 1.898 ± 0.396
2.44AspAsn: 2.44 ± 0.929
1.626AspPro: 1.626 ± 0.612
1.626AspGln: 1.626 ± 0.361
2.711AspArg: 2.711 ± 1.899
2.44AspSer: 2.44 ± 1.226
2.44AspThr: 2.44 ± 0.757
1.355AspVal: 1.355 ± 0.571
1.084AspTrp: 1.084 ± 0.319
2.169AspTyr: 2.169 ± 0.71
0.0AspXaa: 0.0 ± 0.0
Glu
4.608GluAla: 4.608 ± 1.258
0.542GluCys: 0.542 ± 0.535
4.879GluAsp: 4.879 ± 0.932
7.319GluGlu: 7.319 ± 3.192
1.898GluPhe: 1.898 ± 0.771
2.44GluGly: 2.44 ± 0.757
1.355GluHis: 1.355 ± 0.56
5.422GluIle: 5.422 ± 1.179
5.693GluLys: 5.693 ± 1.332
5.422GluLeu: 5.422 ± 0.871
1.626GluMet: 1.626 ± 0.979
3.524GluAsn: 3.524 ± 0.143
3.253GluPro: 3.253 ± 0.723
2.982GluGln: 2.982 ± 1.417
2.711GluArg: 2.711 ± 1.252
6.777GluSer: 6.777 ± 1.686
3.524GluThr: 3.524 ± 0.28
4.879GluVal: 4.879 ± 0.26
1.626GluTrp: 1.626 ± 0.612
2.711GluTyr: 2.711 ± 1.35
0.0GluXaa: 0.0 ± 0.0
Phe
1.898PheAla: 1.898 ± 1.319
0.271PheCys: 0.271 ± 0.163
4.066PheAsp: 4.066 ± 0.858
3.795PheGlu: 3.795 ± 1.542
1.355PhePhe: 1.355 ± 0.455
2.711PheGly: 2.711 ± 0.388
0.542PheHis: 0.542 ± 0.326
2.169PheIle: 2.169 ± 0.637
3.253PheLys: 3.253 ± 0.817
2.711PheLeu: 2.711 ± 0.237
1.626PheMet: 1.626 ± 0.687
1.898PheAsn: 1.898 ± 1.143
1.355PhePro: 1.355 ± 2.122
0.271PheGln: 0.271 ± 0.163
2.982PheArg: 2.982 ± 0.751
4.608PheSer: 4.608 ± 0.927
2.982PheThr: 2.982 ± 0.232
2.982PheVal: 2.982 ± 0.292
1.084PheTrp: 1.084 ± 0.319
2.44PheTyr: 2.44 ± 0.871
0.0PheXaa: 0.0 ± 0.0
Gly
2.711GlyAla: 2.711 ± 1.378
1.355GlyCys: 1.355 ± 0.938
1.898GlyAsp: 1.898 ± 0.536
1.898GlyGlu: 1.898 ± 0.606
3.253GlyPhe: 3.253 ± 0.916
1.626GlyGly: 1.626 ± 0.463
1.084GlyHis: 1.084 ± 0.319
3.795GlyIle: 3.795 ± 1.542
5.15GlyLys: 5.15 ± 1.988
4.337GlyLeu: 4.337 ± 1.598
1.084GlyMet: 1.084 ± 0.305
2.44GlyAsn: 2.44 ± 1.108
1.084GlyPro: 1.084 ± 0.653
1.626GlyGln: 1.626 ± 0.361
3.253GlyArg: 3.253 ± 0.441
3.795GlySer: 3.795 ± 1.364
3.795GlyThr: 3.795 ± 0.702
4.066GlyVal: 4.066 ± 0.929
0.542GlyTrp: 0.542 ± 0.326
1.898GlyTyr: 1.898 ± 0.606
0.0GlyXaa: 0.0 ± 0.0
His
0.271HisAla: 0.271 ± 0.163
0.542HisCys: 0.542 ± 0.159
1.084HisAsp: 1.084 ± 0.7
0.542HisGlu: 0.542 ± 0.694
1.626HisPhe: 1.626 ± 0.478
3.253HisGly: 3.253 ± 1.58
0.0HisHis: 0.0 ± 0.0
1.626HisIle: 1.626 ± 0.852
1.626HisLys: 1.626 ± 0.612
2.169HisLeu: 2.169 ± 0.611
0.271HisMet: 0.271 ± 0.153
0.813HisAsn: 0.813 ± 0.181
1.084HisPro: 1.084 ± 0.305
0.542HisGln: 0.542 ± 0.159
1.355HisArg: 1.355 ± 0.765
1.355HisSer: 1.355 ± 0.299
0.271HisThr: 0.271 ± 0.267
1.084HisVal: 1.084 ± 0.672
0.0HisTrp: 0.0 ± 0.0
1.084HisTyr: 1.084 ± 0.305
0.0HisXaa: 0.0 ± 0.0
Ile
5.15IleAla: 5.15 ± 1.306
1.084IleCys: 1.084 ± 1.069
3.524IleAsp: 3.524 ± 1.062
7.048IleGlu: 7.048 ± 1.849
1.355IlePhe: 1.355 ± 0.533
3.253IleGly: 3.253 ± 0.41
0.542IleHis: 0.542 ± 0.694
5.964IleIle: 5.964 ± 0.328
4.608IleLys: 4.608 ± 0.825
8.403IleLeu: 8.403 ± 1.756
2.169IleMet: 2.169 ± 0.468
5.15IleAsn: 5.15 ± 1.183
3.253IlePro: 3.253 ± 0.441
1.355IleGln: 1.355 ± 1.197
1.898IleArg: 1.898 ± 1.225
7.59IleSer: 7.59 ± 0.663
4.066IleThr: 4.066 ± 0.897
3.524IleVal: 3.524 ± 1.655
0.271IleTrp: 0.271 ± 0.745
1.355IleTyr: 1.355 ± 0.533
0.0IleXaa: 0.0 ± 0.0
Lys
5.422LysAla: 5.422 ± 1.484
2.44LysCys: 2.44 ± 1.609
2.44LysAsp: 2.44 ± 1.222
7.048LysGlu: 7.048 ± 1.133
4.337LysPhe: 4.337 ± 0.66
4.337LysGly: 4.337 ± 1.486
1.898LysHis: 1.898 ± 0.771
4.879LysIle: 4.879 ± 1.062
4.608LysLys: 4.608 ± 1.038
8.674LysLeu: 8.674 ± 1.507
2.44LysMet: 2.44 ± 1.09
1.898LysAsn: 1.898 ± 0.396
2.982LysPro: 2.982 ± 1.066
1.355LysGln: 1.355 ± 0.816
4.879LysArg: 4.879 ± 1.192
5.964LysSer: 5.964 ± 0.815
4.066LysThr: 4.066 ± 1.344
5.15LysVal: 5.15 ± 1.695
0.542LysTrp: 0.542 ± 0.326
2.44LysTyr: 2.44 ± 0.757
0.0LysXaa: 0.0 ± 0.0
Leu
4.608LeuAla: 4.608 ± 0.207
2.44LeuCys: 2.44 ± 0.548
4.608LeuAsp: 4.608 ± 0.627
3.795LeuGlu: 3.795 ± 0.821
5.964LeuPhe: 5.964 ± 0.553
4.066LeuGly: 4.066 ± 0.079
2.711LeuHis: 2.711 ± 0.237
8.403LeuIle: 8.403 ± 2.334
6.235LeuLys: 6.235 ± 2.289
11.927LeuLeu: 11.927 ± 1.641
3.524LeuMet: 3.524 ± 1.377
4.337LeuAsn: 4.337 ± 1.006
3.524LeuPro: 3.524 ± 0.143
2.982LeuGln: 2.982 ± 0.751
4.337LeuArg: 4.337 ± 1.275
10.301LeuSer: 10.301 ± 1.099
7.319LeuThr: 7.319 ± 2.447
4.879LeuVal: 4.879 ± 1.084
0.542LeuTrp: 0.542 ± 0.326
3.524LeuTyr: 3.524 ± 0.964
0.0LeuXaa: 0.0 ± 0.0
Met
0.813MetAla: 0.813 ± 0.409
0.271MetCys: 0.271 ± 0.163
2.169MetAsp: 2.169 ± 0.699
1.626MetGlu: 1.626 ± 0.478
0.813MetPhe: 0.813 ± 0.181
2.169MetGly: 2.169 ± 0.611
0.542MetHis: 0.542 ± 0.159
2.44MetIle: 2.44 ± 1.09
1.626MetLys: 1.626 ± 0.463
3.524MetLeu: 3.524 ± 0.551
0.542MetMet: 0.542 ± 0.326
1.898MetAsn: 1.898 ± 0.444
0.542MetPro: 0.542 ± 0.159
0.813MetGln: 0.813 ± 0.644
1.626MetArg: 1.626 ± 0.547
2.169MetSer: 2.169 ± 0.39
2.169MetThr: 2.169 ± 0.39
1.355MetVal: 1.355 ± 0.299
0.271MetTrp: 0.271 ± 0.163
0.542MetTyr: 0.542 ± 0.326
0.0MetXaa: 0.0 ± 0.0
Asn
0.813AsnAla: 0.813 ± 0.181
1.626AsnCys: 1.626 ± 0.817
1.355AsnAsp: 1.355 ± 0.56
3.524AsnGlu: 3.524 ± 1.062
2.44AsnPhe: 2.44 ± 0.596
2.169AsnGly: 2.169 ± 0.967
1.355AsnHis: 1.355 ± 0.816
2.982AsnIle: 2.982 ± 0.232
3.253AsnLys: 3.253 ± 0.41
4.337AsnLeu: 4.337 ± 0.935
1.355AsnMet: 1.355 ± 0.635
2.982AsnAsn: 2.982 ± 0.232
2.169AsnPro: 2.169 ± 0.637
1.626AsnGln: 1.626 ± 0.361
1.084AsnArg: 1.084 ± 0.305
5.964AsnSer: 5.964 ± 1.503
2.44AsnThr: 2.44 ± 0.283
3.524AsnVal: 3.524 ± 1.222
1.355AsnTrp: 1.355 ± 0.56
2.169AsnTyr: 2.169 ± 0.611
0.0AsnXaa: 0.0 ± 0.0
Pro
0.542ProAla: 0.542 ± 0.748
0.813ProCys: 0.813 ± 0.409
3.253ProAsp: 3.253 ± 1.223
2.711ProGlu: 2.711 ± 0.65
1.898ProPhe: 1.898 ± 0.959
1.626ProGly: 1.626 ± 0.547
1.084ProHis: 1.084 ± 0.319
0.813ProIle: 0.813 ± 0.49
2.44ProLys: 2.44 ± 1.108
3.253ProLeu: 3.253 ± 0.74
1.626ProMet: 1.626 ± 0.478
0.813ProAsn: 0.813 ± 0.181
0.813ProPro: 0.813 ± 0.181
0.813ProGln: 0.813 ± 0.409
1.626ProArg: 1.626 ± 0.852
3.253ProSer: 3.253 ± 1.223
1.626ProThr: 1.626 ± 0.478
2.44ProVal: 2.44 ± 0.871
0.542ProTrp: 0.542 ± 0.159
1.084ProTyr: 1.084 ± 0.653
0.0ProXaa: 0.0 ± 0.0
Gln
0.813GlnAla: 0.813 ± 0.409
0.271GlnCys: 0.271 ± 0.267
1.355GlnAsp: 1.355 ± 0.56
1.355GlnGlu: 1.355 ± 0.455
0.813GlnPhe: 0.813 ± 0.181
1.626GlnGly: 1.626 ± 1.259
0.813GlnHis: 0.813 ± 0.49
2.44GlnIle: 2.44 ± 0.757
1.626GlnLys: 1.626 ± 0.478
2.982GlnLeu: 2.982 ± 0.777
1.355GlnMet: 1.355 ± 0.56
1.898GlnAsn: 1.898 ± 0.444
1.084GlnPro: 1.084 ± 0.319
0.271GlnGln: 0.271 ± 0.163
0.271GlnArg: 0.271 ± 0.163
2.44GlnSer: 2.44 ± 0.757
1.084GlnThr: 1.084 ± 0.705
2.44GlnVal: 2.44 ± 0.871
0.0GlnTrp: 0.0 ± 0.0
0.813GlnTyr: 0.813 ± 0.181
0.0GlnXaa: 0.0 ± 0.0
Arg
3.253ArgAla: 3.253 ± 0.102
0.813ArgCys: 0.813 ± 0.181
1.626ArgAsp: 1.626 ± 0.463
4.066ArgGlu: 4.066 ± 1.365
1.626ArgPhe: 1.626 ± 0.817
2.711ArgGly: 2.711 ± 2.668
0.0ArgHis: 0.0 ± 0.0
3.253ArgIle: 3.253 ± 0.927
2.982ArgLys: 2.982 ± 0.643
5.693ArgLeu: 5.693 ± 1.223
0.542ArgMet: 0.542 ± 0.326
2.982ArgAsn: 2.982 ± 0.777
0.813ArgPro: 0.813 ± 0.181
1.898ArgGln: 1.898 ± 0.474
1.898ArgArg: 1.898 ± 0.606
3.795ArgSer: 3.795 ± 1.55
3.253ArgThr: 3.253 ± 0.926
4.066ArgVal: 4.066 ± 1.081
0.542ArgTrp: 0.542 ± 0.326
0.813ArgTyr: 0.813 ± 0.181
0.0ArgXaa: 0.0 ± 0.0
Ser
5.422SerAla: 5.422 ± 0.58
2.982SerCys: 2.982 ± 2.539
4.608SerAsp: 4.608 ± 2.229
6.506SerGlu: 6.506 ± 1.403
2.982SerPhe: 2.982 ± 1.068
4.066SerGly: 4.066 ± 1.344
2.982SerHis: 2.982 ± 0.643
8.132SerIle: 8.132 ± 0.785
8.674SerLys: 8.674 ± 0.918
12.741SerLeu: 12.741 ± 1.28
1.084SerMet: 1.084 ± 0.319
4.066SerAsn: 4.066 ± 1.344
3.253SerPro: 3.253 ± 0.723
3.795SerGln: 3.795 ± 1.429
4.066SerArg: 4.066 ± 0.858
8.946SerSer: 8.946 ± 0.606
2.982SerThr: 2.982 ± 1.303
4.608SerVal: 4.608 ± 1.488
1.084SerTrp: 1.084 ± 0.653
2.711SerTyr: 2.711 ± 0.797
0.0SerXaa: 0.0 ± 0.0
Thr
2.982ThrAla: 2.982 ± 1.785
1.626ThrCys: 1.626 ± 0.817
2.982ThrAsp: 2.982 ± 1.234
4.337ThrGlu: 4.337 ± 0.224
2.44ThrPhe: 2.44 ± 0.448
3.795ThrGly: 3.795 ± 1.116
1.355ThrHis: 1.355 ± 0.765
2.711ThrIle: 2.711 ± 0.388
3.253ThrLys: 3.253 ± 0.926
5.422ThrLeu: 5.422 ± 1.757
1.355ThrMet: 1.355 ± 0.299
3.795ThrAsn: 3.795 ± 0.821
1.898ThrPro: 1.898 ± 0.606
0.542ThrGln: 0.542 ± 0.326
3.795ThrArg: 3.795 ± 1.542
4.337ThrSer: 4.337 ± 1.479
4.337ThrThr: 4.337 ± 0.66
3.524ThrVal: 3.524 ± 1.222
0.271ThrTrp: 0.271 ± 0.267
1.626ThrTyr: 1.626 ± 0.547
0.0ThrXaa: 0.0 ± 0.0
Val
2.982ValAla: 2.982 ± 1.769
0.271ValCys: 0.271 ± 0.267
2.982ValAsp: 2.982 ± 1.528
4.337ValGlu: 4.337 ± 0.268
2.711ValPhe: 2.711 ± 0.237
2.982ValGly: 2.982 ± 0.292
0.813ValHis: 0.813 ± 0.181
4.879ValIle: 4.879 ± 1.358
6.506ValLys: 6.506 ± 1.454
3.524ValLeu: 3.524 ± 0.28
1.898ValMet: 1.898 ± 0.372
3.795ValAsn: 3.795 ± 0.951
1.355ValPro: 1.355 ± 0.56
1.084ValGln: 1.084 ± 0.305
2.44ValArg: 2.44 ± 0.757
7.319ValSer: 7.319 ± 0.994
3.795ValThr: 3.795 ± 1.778
4.879ValVal: 4.879 ± 1.192
0.542ValTrp: 0.542 ± 0.326
1.898ValTyr: 1.898 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
0.271TrpAla: 0.271 ± 0.163
0.0TrpCys: 0.0 ± 0.0
0.542TrpAsp: 0.542 ± 0.159
1.084TrpGlu: 1.084 ± 0.305
0.813TrpPhe: 0.813 ± 0.181
0.271TrpGly: 0.271 ± 0.267
0.271TrpHis: 0.271 ± 0.163
0.542TrpIle: 0.542 ± 0.326
0.813TrpLys: 0.813 ± 0.681
1.355TrpLeu: 1.355 ± 0.816
0.271TrpMet: 0.271 ± 0.267
0.542TrpAsn: 0.542 ± 0.326
0.542TrpPro: 0.542 ± 0.159
0.271TrpGln: 0.271 ± 0.163
0.813TrpArg: 0.813 ± 0.49
0.813TrpSer: 0.813 ± 0.181
0.271TrpThr: 0.271 ± 0.163
1.355TrpVal: 1.355 ± 0.299
0.0TrpTrp: 0.0 ± 0.0
0.271TrpTyr: 0.271 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.355TyrAla: 1.355 ± 0.56
0.542TyrCys: 0.542 ± 0.535
2.44TyrAsp: 2.44 ± 0.596
2.982TyrGlu: 2.982 ± 1.303
1.898TyrPhe: 1.898 ± 0.444
1.084TyrGly: 1.084 ± 0.305
0.813TyrHis: 0.813 ± 0.49
2.44TyrIle: 2.44 ± 0.757
5.422TyrLys: 5.422 ± 0.368
1.355TyrLeu: 1.355 ± 0.299
1.626TyrMet: 1.626 ± 1.288
1.084TyrAsn: 1.084 ± 0.653
0.542TyrPro: 0.542 ± 0.326
0.542TyrGln: 0.542 ± 0.694
0.813TyrArg: 0.813 ± 0.49
2.169TyrSer: 2.169 ± 0.611
2.169TyrThr: 2.169 ± 0.637
1.898TyrVal: 1.898 ± 0.771
0.271TyrTrp: 0.271 ± 0.163
1.355TyrTyr: 1.355 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3690 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski