Amino acid dipepetide frequency for Wenzhou picorna-like virus 37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.718AlaAla: 2.718 ± 0.133
1.019AlaCys: 1.019 ± 0.232
2.379AlaAsp: 2.379 ± 0.429
1.019AlaGlu: 1.019 ± 0.496
2.379AlaPhe: 2.379 ± 0.298
3.398AlaGly: 3.398 ± 2.713
0.34AlaHis: 0.34 ± 0.165
3.058AlaIle: 3.058 ± 1.488
2.718AlaLys: 2.718 ± 0.133
5.437AlaLeu: 5.437 ± 1.189
1.019AlaMet: 1.019 ± 0.496
3.058AlaAsn: 3.058 ± 0.032
3.398AlaPro: 3.398 ± 1.653
2.379AlaGln: 2.379 ± 1.754
1.699AlaArg: 1.699 ± 0.099
3.398AlaSer: 3.398 ± 0.53
3.738AlaThr: 3.738 ± 1.82
1.699AlaVal: 1.699 ± 0.099
0.34AlaTrp: 0.34 ± 0.562
3.738AlaTyr: 3.738 ± 0.363
0.0AlaXaa: 0.0 ± 0.0
Cys
0.34CysAla: 0.34 ± 0.562
0.0CysCys: 0.0 ± 0.0
0.68CysAsp: 0.68 ± 0.331
2.039CysGlu: 2.039 ± 0.264
1.359CysPhe: 1.359 ± 0.661
2.039CysGly: 2.039 ± 0.992
0.0CysHis: 0.0 ± 0.0
1.019CysIle: 1.019 ± 0.232
0.0CysLys: 0.0 ± 0.0
2.039CysLeu: 2.039 ± 0.264
0.0CysMet: 0.0 ± 0.0
0.68CysAsn: 0.68 ± 0.331
1.359CysPro: 1.359 ± 0.661
0.68CysGln: 0.68 ± 0.331
0.68CysArg: 0.68 ± 0.331
0.68CysSer: 0.68 ± 0.331
1.699CysThr: 1.699 ± 0.826
3.738CysVal: 3.738 ± 0.363
0.34CysTrp: 0.34 ± 0.165
0.68CysTyr: 0.68 ± 0.331
0.0CysXaa: 0.0 ± 0.0
Asp
2.039AspAla: 2.039 ± 0.464
0.34AspCys: 0.34 ± 0.165
6.116AspAsp: 6.116 ± 0.663
2.718AspGlu: 2.718 ± 1.322
3.398AspPhe: 3.398 ± 1.258
3.398AspGly: 3.398 ± 0.198
0.34AspHis: 0.34 ± 0.165
4.417AspIle: 4.417 ± 0.762
3.398AspLys: 3.398 ± 0.925
6.116AspLeu: 6.116 ± 2.247
2.039AspMet: 2.039 ± 0.992
2.718AspAsn: 2.718 ± 0.133
3.058AspPro: 3.058 ± 0.032
0.68AspGln: 0.68 ± 0.397
2.379AspArg: 2.379 ± 0.429
3.398AspSer: 3.398 ± 0.925
3.738AspThr: 3.738 ± 1.82
5.776AspVal: 5.776 ± 0.627
0.34AspTrp: 0.34 ± 0.165
2.718AspTyr: 2.718 ± 1.322
0.0AspXaa: 0.0 ± 0.0
Glu
2.039GluAla: 2.039 ± 0.992
1.699GluCys: 1.699 ± 0.629
3.738GluAsp: 3.738 ± 1.09
4.077GluGlu: 4.077 ± 1.256
3.738GluPhe: 3.738 ± 0.365
1.359GluGly: 1.359 ± 0.067
0.68GluHis: 0.68 ± 0.331
2.039GluIle: 2.039 ± 0.264
2.718GluLys: 2.718 ± 0.861
5.097GluLeu: 5.097 ± 0.431
1.019GluMet: 1.019 ± 0.232
1.359GluAsn: 1.359 ± 0.067
2.039GluPro: 2.039 ± 0.264
1.359GluGln: 1.359 ± 0.661
3.058GluArg: 3.058 ± 1.488
2.379GluSer: 2.379 ± 0.429
2.718GluThr: 2.718 ± 0.861
5.097GluVal: 5.097 ± 1.752
0.34GluTrp: 0.34 ± 0.165
4.077GluTyr: 4.077 ± 0.2
0.0GluXaa: 0.0 ± 0.0
Phe
4.077PheAla: 4.077 ± 0.927
0.68PheCys: 0.68 ± 0.331
2.718PheAsp: 2.718 ± 0.595
1.699PheGlu: 1.699 ± 0.826
0.0PhePhe: 0.0 ± 0.0
1.359PheGly: 1.359 ± 0.661
1.019PheHis: 1.019 ± 0.959
2.039PheIle: 2.039 ± 0.464
3.398PheLys: 3.398 ± 0.53
4.077PheLeu: 4.077 ± 1.256
1.699PheMet: 1.699 ± 0.099
1.019PheAsn: 1.019 ± 0.496
1.359PhePro: 1.359 ± 0.794
1.699PheGln: 1.699 ± 0.099
2.379PheArg: 2.379 ± 0.298
4.757PheSer: 4.757 ± 0.597
3.738PheThr: 3.738 ± 1.82
4.417PheVal: 4.417 ± 0.762
0.68PheTrp: 0.68 ± 0.397
0.68PheTyr: 0.68 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
2.379GlyAla: 2.379 ± 0.429
1.019GlyCys: 1.019 ± 0.232
3.738GlyAsp: 3.738 ± 1.092
3.058GlyGlu: 3.058 ± 0.76
3.398GlyPhe: 3.398 ± 1.258
1.359GlyGly: 1.359 ± 0.794
1.019GlyHis: 1.019 ± 0.496
3.058GlyIle: 3.058 ± 0.695
3.738GlyLys: 3.738 ± 0.363
4.077GlyLeu: 4.077 ± 1.655
1.359GlyMet: 1.359 ± 0.794
3.058GlyAsn: 3.058 ± 0.76
2.379GlyPro: 2.379 ± 0.429
2.379GlyGln: 2.379 ± 0.298
3.058GlyArg: 3.058 ± 0.695
4.417GlySer: 4.417 ± 0.034
1.699GlyThr: 1.699 ± 0.629
5.097GlyVal: 5.097 ± 1.159
0.0GlyTrp: 0.0 ± 0.0
2.379GlyTyr: 2.379 ± 1.026
0.0GlyXaa: 0.0 ± 0.0
His
1.699HisAla: 1.699 ± 0.099
0.0HisCys: 0.0 ± 0.0
1.359HisAsp: 1.359 ± 0.067
0.68HisGlu: 0.68 ± 0.331
1.359HisPhe: 1.359 ± 0.661
1.019HisGly: 1.019 ± 0.496
0.68HisHis: 0.68 ± 0.331
1.699HisIle: 1.699 ± 0.826
1.699HisLys: 1.699 ± 0.099
1.699HisLeu: 1.699 ± 0.099
1.019HisMet: 1.019 ± 0.232
2.379HisAsn: 2.379 ± 0.298
1.359HisPro: 1.359 ± 0.067
1.019HisGln: 1.019 ± 0.232
0.68HisArg: 0.68 ± 0.331
2.039HisSer: 2.039 ± 0.264
1.019HisThr: 1.019 ± 0.232
1.699HisVal: 1.699 ± 0.826
0.34HisTrp: 0.34 ± 0.165
1.019HisTyr: 1.019 ± 0.496
0.0HisXaa: 0.0 ± 0.0
Ile
3.398IleAla: 3.398 ± 1.653
0.68IleCys: 0.68 ± 0.331
2.718IleAsp: 2.718 ± 0.595
3.398IleGlu: 3.398 ± 1.653
1.019IlePhe: 1.019 ± 0.232
2.718IleGly: 2.718 ± 0.133
2.718IleHis: 2.718 ± 1.322
4.417IleIle: 4.417 ± 0.693
2.379IleLys: 2.379 ± 0.429
3.738IleLeu: 3.738 ± 1.818
0.68IleMet: 0.68 ± 0.331
2.718IleAsn: 2.718 ± 0.595
4.757IlePro: 4.757 ± 0.131
0.68IleGln: 0.68 ± 0.331
3.738IleArg: 3.738 ± 0.363
5.437IleSer: 5.437 ± 0.266
4.417IleThr: 4.417 ± 2.945
2.718IleVal: 2.718 ± 0.133
1.019IleTrp: 1.019 ± 0.232
3.398IleTyr: 3.398 ± 1.258
0.0IleXaa: 0.0 ± 0.0
Lys
3.398LysAla: 3.398 ± 0.925
0.34LysCys: 0.34 ± 0.165
4.417LysAsp: 4.417 ± 0.693
3.058LysGlu: 3.058 ± 0.695
3.058LysPhe: 3.058 ± 0.032
3.738LysGly: 3.738 ± 1.09
2.039LysHis: 2.039 ± 0.992
1.699LysIle: 1.699 ± 0.629
3.058LysLys: 3.058 ± 1.488
6.796LysLeu: 6.796 ± 0.395
1.019LysMet: 1.019 ± 0.232
1.019LysAsn: 1.019 ± 0.232
1.699LysPro: 1.699 ± 0.099
1.359LysGln: 1.359 ± 0.661
5.097LysArg: 5.097 ± 2.479
5.097LysSer: 5.097 ± 1.024
2.718LysThr: 2.718 ± 1.322
4.757LysVal: 4.757 ± 0.859
0.0LysTrp: 0.0 ± 0.0
5.097LysTyr: 5.097 ± 2.614
0.0LysXaa: 0.0 ± 0.0
Leu
5.097LeuAla: 5.097 ± 1.024
3.058LeuCys: 3.058 ± 1.488
5.437LeuAsp: 5.437 ± 0.462
5.097LeuGlu: 5.097 ± 0.296
3.398LeuPhe: 3.398 ± 0.198
6.116LeuGly: 6.116 ± 0.065
2.379LeuHis: 2.379 ± 1.157
4.757LeuIle: 4.757 ± 1.586
5.437LeuLys: 5.437 ± 0.266
7.475LeuLeu: 7.475 ± 1.453
2.718LeuMet: 2.718 ± 1.588
5.097LeuAsn: 5.097 ± 1.159
4.757LeuPro: 4.757 ± 1.324
3.738LeuGln: 3.738 ± 1.092
5.097LeuArg: 5.097 ± 0.431
9.174LeuSer: 9.174 ± 2.28
5.437LeuThr: 5.437 ± 1.189
7.136LeuVal: 7.136 ± 0.895
0.68LeuTrp: 0.68 ± 0.331
3.058LeuTyr: 3.058 ± 0.695
0.0LeuXaa: 0.0 ± 0.0
Met
1.019MetAla: 1.019 ± 1.687
1.359MetCys: 1.359 ± 0.661
1.359MetAsp: 1.359 ± 0.794
1.699MetGlu: 1.699 ± 0.629
1.359MetPhe: 1.359 ± 0.067
1.019MetGly: 1.019 ± 0.496
0.68MetHis: 0.68 ± 0.397
0.0MetIle: 0.0 ± 0.0
1.699MetLys: 1.699 ± 0.826
3.398MetLeu: 3.398 ± 1.258
0.0MetMet: 0.0 ± 0.0
1.359MetAsn: 1.359 ± 1.522
0.68MetPro: 0.68 ± 0.331
1.019MetGln: 1.019 ± 0.496
0.34MetArg: 0.34 ± 0.165
3.058MetSer: 3.058 ± 1.488
1.699MetThr: 1.699 ± 0.099
2.039MetVal: 2.039 ± 0.464
0.0MetTrp: 0.0 ± 0.0
1.019MetTyr: 1.019 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
3.738AsnAla: 3.738 ± 1.092
0.68AsnCys: 0.68 ± 0.331
2.039AsnAsp: 2.039 ± 0.264
1.359AsnGlu: 1.359 ± 0.794
2.718AsnPhe: 2.718 ± 0.861
2.379AsnGly: 2.379 ± 1.754
1.019AsnHis: 1.019 ± 0.959
3.738AsnIle: 3.738 ± 1.09
2.039AsnLys: 2.039 ± 0.992
3.058AsnLeu: 3.058 ± 0.76
2.039AsnMet: 2.039 ± 0.916
1.699AsnAsn: 1.699 ± 0.099
4.417AsnPro: 4.417 ± 0.034
1.359AsnGln: 1.359 ± 0.067
2.379AsnArg: 2.379 ± 0.429
2.379AsnSer: 2.379 ± 0.298
2.718AsnThr: 2.718 ± 1.588
4.417AsnVal: 4.417 ± 0.762
0.34AsnTrp: 0.34 ± 0.165
2.039AsnTyr: 2.039 ± 0.264
0.0AsnXaa: 0.0 ± 0.0
Pro
2.039ProAla: 2.039 ± 0.992
0.34ProCys: 0.34 ± 0.165
1.699ProAsp: 1.699 ± 0.099
2.718ProGlu: 2.718 ± 0.595
2.039ProPhe: 2.039 ± 0.264
2.379ProGly: 2.379 ± 1.754
2.039ProHis: 2.039 ± 0.264
4.077ProIle: 4.077 ± 0.2
1.019ProLys: 1.019 ± 0.496
6.116ProLeu: 6.116 ± 0.663
0.68ProMet: 0.68 ± 0.331
2.718ProAsn: 2.718 ± 0.595
4.077ProPro: 4.077 ± 0.927
0.68ProGln: 0.68 ± 0.397
2.039ProArg: 2.039 ± 0.264
3.398ProSer: 3.398 ± 0.53
4.757ProThr: 4.757 ± 1.324
3.738ProVal: 3.738 ± 0.365
1.699ProTrp: 1.699 ± 0.826
2.718ProTyr: 2.718 ± 0.861
0.0ProXaa: 0.0 ± 0.0
Gln
1.019GlnAla: 1.019 ± 0.232
0.34GlnCys: 0.34 ± 0.165
1.019GlnAsp: 1.019 ± 0.496
2.379GlnGlu: 2.379 ± 1.157
0.34GlnPhe: 0.34 ± 0.165
1.359GlnGly: 1.359 ± 0.067
1.019GlnHis: 1.019 ± 0.496
2.379GlnIle: 2.379 ± 0.429
1.359GlnLys: 1.359 ± 0.794
3.058GlnLeu: 3.058 ± 0.032
0.68GlnMet: 0.68 ± 0.688
1.019GlnAsn: 1.019 ± 0.232
1.699GlnPro: 1.699 ± 0.099
0.34GlnGln: 0.34 ± 0.165
2.718GlnArg: 2.718 ± 0.595
3.738GlnSer: 3.738 ± 1.092
2.379GlnThr: 2.379 ± 1.754
2.718GlnVal: 2.718 ± 0.133
0.68GlnTrp: 0.68 ± 0.397
1.699GlnTyr: 1.699 ± 0.099
0.0GlnXaa: 0.0 ± 0.0
Arg
2.039ArgAla: 2.039 ± 0.992
1.699ArgCys: 1.699 ± 0.099
3.738ArgAsp: 3.738 ± 0.363
1.699ArgGlu: 1.699 ± 0.629
3.738ArgPhe: 3.738 ± 1.09
2.039ArgGly: 2.039 ± 1.191
2.039ArgHis: 2.039 ± 0.992
1.699ArgIle: 1.699 ± 0.099
4.757ArgLys: 4.757 ± 1.586
5.776ArgLeu: 5.776 ± 0.101
1.019ArgMet: 1.019 ± 0.496
1.699ArgAsn: 1.699 ± 0.099
1.359ArgPro: 1.359 ± 0.661
1.359ArgGln: 1.359 ± 0.661
3.058ArgArg: 3.058 ± 1.488
4.077ArgSer: 4.077 ± 1.256
3.738ArgThr: 3.738 ± 0.365
3.058ArgVal: 3.058 ± 0.76
0.68ArgTrp: 0.68 ± 0.331
1.359ArgTyr: 1.359 ± 0.661
0.0ArgXaa: 0.0 ± 0.0
Ser
4.757SerAla: 4.757 ± 1.324
1.359SerCys: 1.359 ± 0.067
4.417SerAsp: 4.417 ± 0.693
4.417SerGlu: 4.417 ± 0.034
3.398SerPhe: 3.398 ± 0.925
4.417SerGly: 4.417 ± 0.762
2.718SerHis: 2.718 ± 0.133
4.077SerIle: 4.077 ± 1.256
5.776SerLys: 5.776 ± 0.627
8.155SerLeu: 8.155 ± 1.056
2.718SerMet: 2.718 ± 0.595
5.097SerAsn: 5.097 ± 2.614
3.058SerPro: 3.058 ± 0.032
2.379SerGln: 2.379 ± 0.298
4.757SerArg: 4.757 ± 1.586
7.136SerSer: 7.136 ± 0.56
5.097SerThr: 5.097 ± 1.159
7.475SerVal: 7.475 ± 0.73
0.68SerTrp: 0.68 ± 0.397
4.417SerTyr: 4.417 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
2.379ThrAla: 2.379 ± 1.754
1.359ThrCys: 1.359 ± 0.661
2.379ThrAsp: 2.379 ± 0.298
1.699ThrGlu: 1.699 ± 0.629
1.359ThrPhe: 1.359 ± 0.794
4.417ThrGly: 4.417 ± 0.034
1.359ThrHis: 1.359 ± 0.661
3.398ThrIle: 3.398 ± 0.198
5.776ThrLys: 5.776 ± 0.627
6.456ThrLeu: 6.456 ± 0.498
1.359ThrMet: 1.359 ± 0.794
4.417ThrAsn: 4.417 ± 2.217
3.398ThrPro: 3.398 ± 0.198
3.738ThrGln: 3.738 ± 0.365
1.699ThrArg: 1.699 ± 0.826
6.796ThrSer: 6.796 ± 3.243
3.058ThrThr: 3.058 ± 2.151
4.077ThrVal: 4.077 ± 2.382
1.699ThrTrp: 1.699 ± 0.099
1.359ThrTyr: 1.359 ± 1.522
0.0ThrXaa: 0.0 ± 0.0
Val
2.379ValAla: 2.379 ± 0.298
1.699ValCys: 1.699 ± 0.826
3.398ValAsp: 3.398 ± 0.925
6.456ValGlu: 6.456 ± 0.498
2.379ValPhe: 2.379 ± 1.026
5.776ValGly: 5.776 ± 1.556
0.34ValHis: 0.34 ± 0.165
5.437ValIle: 5.437 ± 0.462
4.417ValLys: 4.417 ± 1.421
7.475ValLeu: 7.475 ± 0.73
2.039ValMet: 2.039 ± 1.191
2.718ValAsn: 2.718 ± 1.322
2.718ValPro: 2.718 ± 0.861
2.718ValGln: 2.718 ± 0.861
2.718ValArg: 2.718 ± 0.133
10.194ValSer: 10.194 ± 1.59
4.757ValThr: 4.757 ± 0.131
3.398ValVal: 3.398 ± 0.925
0.34ValTrp: 0.34 ± 0.165
3.738ValTyr: 3.738 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
0.34TrpAla: 0.34 ± 0.165
0.34TrpCys: 0.34 ± 0.165
1.359TrpAsp: 1.359 ± 0.661
0.34TrpGlu: 0.34 ± 0.165
1.019TrpPhe: 1.019 ± 0.232
0.34TrpGly: 0.34 ± 0.165
0.0TrpHis: 0.0 ± 0.0
0.68TrpIle: 0.68 ± 0.397
1.019TrpLys: 1.019 ± 0.232
1.019TrpLeu: 1.019 ± 0.232
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.68TrpGln: 0.68 ± 0.331
0.34TrpArg: 0.34 ± 0.165
1.359TrpSer: 1.359 ± 0.067
0.68TrpThr: 0.68 ± 0.331
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.359TrpTyr: 1.359 ± 0.067
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.039TyrAla: 2.039 ± 0.464
2.039TyrCys: 2.039 ± 0.992
4.417TyrAsp: 4.417 ± 0.762
1.019TyrGlu: 1.019 ± 0.232
2.039TyrPhe: 2.039 ± 1.191
2.039TyrGly: 2.039 ± 0.464
2.039TyrHis: 2.039 ± 1.191
3.058TyrIle: 3.058 ± 0.032
3.058TyrLys: 3.058 ± 0.76
4.077TyrLeu: 4.077 ± 0.927
1.359TyrMet: 1.359 ± 0.794
2.718TyrAsn: 2.718 ± 1.322
3.398TyrPro: 3.398 ± 2.713
2.039TyrGln: 2.039 ± 0.264
2.718TyrArg: 2.718 ± 0.133
3.398TyrSer: 3.398 ± 0.53
2.379TyrThr: 2.379 ± 1.157
2.039TyrVal: 2.039 ± 1.191
0.68TyrTrp: 0.68 ± 0.331
3.398TyrTyr: 3.398 ± 1.985
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2944 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski