Amino acid dipepetide frequency for Jodiemicrovirus-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.216AlaAla: 5.216 ± 2.435
0.745AlaCys: 0.745 ± 0.512
9.687AlaAsp: 9.687 ± 4.099
1.49AlaGlu: 1.49 ± 0.646
0.0AlaPhe: 0.0 ± 0.0
5.961AlaGly: 5.961 ± 2.412
1.49AlaHis: 1.49 ± 0.575
1.49AlaIle: 1.49 ± 0.878
2.235AlaLys: 2.235 ± 0.827
5.216AlaLeu: 5.216 ± 1.644
1.49AlaMet: 1.49 ± 1.701
2.235AlaAsn: 2.235 ± 1.411
6.706AlaPro: 6.706 ± 2.796
1.49AlaGln: 1.49 ± 1.4
5.216AlaArg: 5.216 ± 1.462
6.706AlaSer: 6.706 ± 2.628
3.726AlaThr: 3.726 ± 1.14
5.216AlaVal: 5.216 ± 1.123
1.49AlaTrp: 1.49 ± 0.646
0.745AlaTyr: 0.745 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.745CysCys: 0.745 ± 0.73
1.49CysAsp: 1.49 ± 0.575
0.0CysGlu: 0.0 ± 0.0
0.745CysPhe: 0.745 ± 0.73
1.49CysGly: 1.49 ± 1.46
0.745CysHis: 0.745 ± 0.73
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.745CysLeu: 0.745 ± 0.73
0.0CysMet: 0.0 ± 0.0
1.49CysAsn: 1.49 ± 0.575
0.0CysPro: 0.0 ± 0.0
1.49CysGln: 1.49 ± 0.575
0.0CysArg: 0.0 ± 0.0
0.745CysSer: 0.745 ± 0.512
0.745CysThr: 0.745 ± 0.73
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.49CysTyr: 1.49 ± 0.575
0.0CysXaa: 0.0 ± 0.0
Asp
6.706AspAla: 6.706 ± 2.078
0.0AspCys: 0.0 ± 0.0
5.961AspAsp: 5.961 ± 2.179
2.981AspGlu: 2.981 ± 2.437
3.726AspPhe: 3.726 ± 0.8
2.235AspGly: 2.235 ± 1.537
1.49AspHis: 1.49 ± 0.575
2.235AspIle: 2.235 ± 1.612
2.981AspLys: 2.981 ± 1.612
5.961AspLeu: 5.961 ± 0.791
0.745AspMet: 0.745 ± 0.695
2.235AspAsn: 2.235 ± 0.809
1.49AspPro: 1.49 ± 1.073
0.745AspGln: 0.745 ± 0.512
4.471AspArg: 4.471 ± 1.515
2.981AspSer: 2.981 ± 0.854
3.726AspThr: 3.726 ± 0.8
5.961AspVal: 5.961 ± 1.74
0.0AspTrp: 0.0 ± 0.0
3.726AspTyr: 3.726 ± 1.52
0.0AspXaa: 0.0 ± 0.0
Glu
5.216GluAla: 5.216 ± 2.145
0.0GluCys: 0.0 ± 0.0
3.726GluAsp: 3.726 ± 0.484
0.745GluGlu: 0.745 ± 0.73
3.726GluPhe: 3.726 ± 2.174
0.745GluGly: 0.745 ± 0.512
0.745GluHis: 0.745 ± 0.512
0.745GluIle: 0.745 ± 1.072
3.726GluLys: 3.726 ± 0.484
3.726GluLeu: 3.726 ± 2.427
0.745GluMet: 0.745 ± 0.851
2.981GluAsn: 2.981 ± 1.577
1.49GluPro: 1.49 ± 0.841
0.745GluGln: 0.745 ± 0.512
2.235GluArg: 2.235 ± 1.858
2.981GluSer: 2.981 ± 1.151
0.745GluThr: 0.745 ± 0.7
2.981GluVal: 2.981 ± 2.049
1.49GluTrp: 1.49 ± 1.025
4.471GluTyr: 4.471 ± 1.781
0.0GluXaa: 0.0 ± 0.0
Phe
4.471PheAla: 4.471 ± 0.829
0.745PheCys: 0.745 ± 0.73
4.471PheAsp: 4.471 ± 1.457
0.745PheGlu: 0.745 ± 0.851
2.235PhePhe: 2.235 ± 1.211
4.471PheGly: 4.471 ± 0.972
0.745PheHis: 0.745 ± 1.072
3.726PheIle: 3.726 ± 0.856
4.471PheLys: 4.471 ± 1.547
2.235PheLeu: 2.235 ± 0.483
1.49PheMet: 1.49 ± 0.575
2.981PheAsn: 2.981 ± 1.057
2.235PhePro: 2.235 ± 1.537
0.745PheGln: 0.745 ± 0.512
4.471PheArg: 4.471 ± 1.864
6.706PheSer: 6.706 ± 1.22
2.981PheThr: 2.981 ± 0.47
5.216PheVal: 5.216 ± 1.851
1.49PheTrp: 1.49 ± 0.575
0.745PheTyr: 0.745 ± 0.7
0.0PheXaa: 0.0 ± 0.0
Gly
2.981GlyAla: 2.981 ± 1.915
0.0GlyCys: 0.0 ± 0.0
2.235GlyAsp: 2.235 ± 1.246
2.981GlyGlu: 2.981 ± 2.049
5.961GlyPhe: 5.961 ± 1.189
5.961GlyGly: 5.961 ± 2.238
1.49GlyHis: 1.49 ± 0.575
6.706GlyIle: 6.706 ± 2.151
1.49GlyLys: 1.49 ± 0.646
2.981GlyLeu: 2.981 ± 1.057
0.745GlyMet: 0.745 ± 0.512
1.49GlyAsn: 1.49 ± 0.646
1.49GlyPro: 1.49 ± 0.646
1.49GlyGln: 1.49 ± 0.575
2.235GlyArg: 2.235 ± 0.932
5.961GlySer: 5.961 ± 1.391
3.726GlyThr: 3.726 ± 1.52
1.49GlyVal: 1.49 ± 0.646
1.49GlyTrp: 1.49 ± 1.121
3.726GlyTyr: 3.726 ± 1.589
0.0GlyXaa: 0.0 ± 0.0
His
2.235HisAla: 2.235 ± 1.411
0.745HisCys: 0.745 ± 0.512
1.49HisAsp: 1.49 ± 0.841
2.235HisGlu: 2.235 ± 0.809
0.0HisPhe: 0.0 ± 0.0
2.235HisGly: 2.235 ± 0.809
0.745HisHis: 0.745 ± 0.851
0.0HisIle: 0.0 ± 0.0
1.49HisLys: 1.49 ± 1.079
3.726HisLeu: 3.726 ± 1.749
0.0HisMet: 0.0 ± 0.0
1.49HisAsn: 1.49 ± 1.079
0.745HisPro: 0.745 ± 0.7
0.745HisGln: 0.745 ± 0.512
2.981HisArg: 2.981 ± 1.256
2.981HisSer: 2.981 ± 1.151
0.0HisThr: 0.0 ± 0.0
0.745HisVal: 0.745 ± 0.73
0.0HisTrp: 0.0 ± 0.0
2.235HisTyr: 2.235 ± 1.211
0.0HisXaa: 0.0 ± 0.0
Ile
1.49IleAla: 1.49 ± 0.646
0.0IleCys: 0.0 ± 0.0
3.726IleAsp: 3.726 ± 1.308
1.49IleGlu: 1.49 ± 1.121
2.981IlePhe: 2.981 ± 1.057
2.981IleGly: 2.981 ± 0.47
3.726IleHis: 3.726 ± 2.845
0.745IleIle: 0.745 ± 0.73
2.235IleLys: 2.235 ± 1.211
3.726IleLeu: 3.726 ± 2.174
1.49IleMet: 1.49 ± 0.956
5.961IleAsn: 5.961 ± 1.189
1.49IlePro: 1.49 ± 0.646
1.49IleGln: 1.49 ± 0.878
5.216IleArg: 5.216 ± 1.218
4.471IleSer: 4.471 ± 1.035
1.49IleThr: 1.49 ± 1.025
0.745IleVal: 0.745 ± 0.851
0.745IleTrp: 0.745 ± 0.512
2.981IleTyr: 2.981 ± 1.745
0.0IleXaa: 0.0 ± 0.0
Lys
2.235LysAla: 2.235 ± 1.537
2.235LysCys: 2.235 ± 2.19
4.471LysAsp: 4.471 ± 2.336
2.981LysGlu: 2.981 ± 1.612
5.961LysPhe: 5.961 ± 2.302
2.981LysGly: 2.981 ± 0.854
1.49LysHis: 1.49 ± 1.46
4.471LysIle: 4.471 ± 1.548
4.471LysLys: 4.471 ± 3.687
6.706LysLeu: 6.706 ± 3.573
0.0LysMet: 0.0 ± 0.0
0.745LysAsn: 0.745 ± 0.512
2.235LysPro: 2.235 ± 1.321
0.0LysGln: 0.0 ± 0.0
5.961LysArg: 5.961 ± 1.948
8.942LysSer: 8.942 ± 3.443
1.49LysThr: 1.49 ± 0.575
3.726LysVal: 3.726 ± 1.232
0.0LysTrp: 0.0 ± 0.0
2.235LysTyr: 2.235 ± 1.635
0.0LysXaa: 0.0 ± 0.0
Leu
5.216LeuAla: 5.216 ± 1.09
0.745LeuCys: 0.745 ± 0.73
3.726LeuAsp: 3.726 ± 1.726
5.216LeuGlu: 5.216 ± 1.61
3.726LeuPhe: 3.726 ± 1.52
6.706LeuGly: 6.706 ± 2.479
0.745LeuHis: 0.745 ± 0.512
5.961LeuIle: 5.961 ± 2.365
6.706LeuLys: 6.706 ± 2.998
6.706LeuLeu: 6.706 ± 2.912
4.471LeuMet: 4.471 ± 1.331
7.452LeuAsn: 7.452 ± 0.968
5.961LeuPro: 5.961 ± 1.721
2.981LeuGln: 2.981 ± 1.151
2.981LeuArg: 2.981 ± 1.025
9.687LeuSer: 9.687 ± 3.829
2.981LeuThr: 2.981 ± 1.057
2.235LeuVal: 2.235 ± 1.537
0.745LeuTrp: 0.745 ± 0.512
2.981LeuTyr: 2.981 ± 1.577
0.0LeuXaa: 0.0 ± 0.0
Met
1.49MetAla: 1.49 ± 0.646
0.0MetCys: 0.0 ± 0.0
2.235MetAsp: 2.235 ± 1.801
0.0MetGlu: 0.0 ± 0.0
0.745MetPhe: 0.745 ± 0.851
0.0MetGly: 0.0 ± 0.0
1.49MetHis: 1.49 ± 0.646
2.235MetIle: 2.235 ± 1.537
2.235MetLys: 2.235 ± 0.809
0.745MetLeu: 0.745 ± 0.7
0.0MetMet: 0.0 ± 0.0
0.745MetAsn: 0.745 ± 0.73
1.49MetPro: 1.49 ± 1.025
0.0MetGln: 0.0 ± 0.0
3.726MetArg: 3.726 ± 1.352
0.745MetSer: 0.745 ± 0.512
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.49MetTyr: 1.49 ± 0.646
0.0MetXaa: 0.0 ± 0.0
Asn
4.471AsnAla: 4.471 ± 2.413
2.235AsnCys: 2.235 ± 1.211
0.745AsnAsp: 0.745 ± 0.512
2.981AsnGlu: 2.981 ± 1.393
3.726AsnPhe: 3.726 ± 1.749
0.745AsnGly: 0.745 ± 0.512
1.49AsnHis: 1.49 ± 0.878
0.0AsnIle: 0.0 ± 0.0
2.981AsnLys: 2.981 ± 1.057
5.961AsnLeu: 5.961 ± 0.478
1.49AsnMet: 1.49 ± 1.112
1.49AsnAsn: 1.49 ± 1.4
2.981AsnPro: 2.981 ± 1.581
3.726AsnGln: 3.726 ± 1.589
4.471AsnArg: 4.471 ± 1.515
3.726AsnSer: 3.726 ± 1.307
0.0AsnThr: 0.0 ± 0.0
0.745AsnVal: 0.745 ± 0.7
0.745AsnTrp: 0.745 ± 0.512
2.981AsnTyr: 2.981 ± 1.314
0.0AsnXaa: 0.0 ± 0.0
Pro
5.216ProAla: 5.216 ± 2.475
0.745ProCys: 0.745 ± 0.512
0.745ProAsp: 0.745 ± 0.851
1.49ProGlu: 1.49 ± 0.575
0.745ProPhe: 0.745 ± 0.512
2.235ProGly: 2.235 ± 1.246
0.745ProHis: 0.745 ± 0.73
3.726ProIle: 3.726 ± 1.293
0.0ProLys: 0.0 ± 0.0
5.961ProLeu: 5.961 ± 1.316
2.235ProMet: 2.235 ± 1.31
0.745ProAsn: 0.745 ± 0.512
2.235ProPro: 2.235 ± 1.608
0.745ProGln: 0.745 ± 0.7
2.981ProArg: 2.981 ± 0.47
6.706ProSer: 6.706 ± 2.811
5.216ProThr: 5.216 ± 1.972
5.961ProVal: 5.961 ± 3.185
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.726GlnAla: 3.726 ± 1.16
0.745GlnCys: 0.745 ± 0.73
1.49GlnAsp: 1.49 ± 0.646
3.726GlnGlu: 3.726 ± 1.696
1.49GlnPhe: 1.49 ± 1.025
1.49GlnGly: 1.49 ± 0.575
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.981GlnLys: 2.981 ± 1.393
0.0GlnLeu: 0.0 ± 0.0
2.235GlnMet: 2.235 ± 0.483
1.49GlnAsn: 1.49 ± 1.4
1.49GlnPro: 1.49 ± 1.025
2.981GlnGln: 2.981 ± 0.47
0.0GlnArg: 0.0 ± 0.0
2.981GlnSer: 2.981 ± 1.612
1.49GlnThr: 1.49 ± 1.121
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.49GlnTyr: 1.49 ± 1.073
0.0GlnXaa: 0.0 ± 0.0
Arg
2.981ArgAla: 2.981 ± 0.986
0.745ArgCys: 0.745 ± 0.73
0.0ArgAsp: 0.0 ± 0.0
3.726ArgGlu: 3.726 ± 1.293
4.471ArgPhe: 4.471 ± 3.292
2.235ArgGly: 2.235 ± 0.809
1.49ArgHis: 1.49 ± 0.575
3.726ArgIle: 3.726 ± 1.505
6.706ArgLys: 6.706 ± 2.635
8.197ArgLeu: 8.197 ± 2.074
0.0ArgMet: 0.0 ± 0.0
2.235ArgAsn: 2.235 ± 0.483
4.471ArgPro: 4.471 ± 0.972
0.745ArgGln: 0.745 ± 1.072
2.235ArgArg: 2.235 ± 1.211
10.432ArgSer: 10.432 ± 2.179
2.981ArgThr: 2.981 ± 1.256
2.981ArgVal: 2.981 ± 1.292
0.0ArgTrp: 0.0 ± 0.0
4.471ArgTyr: 4.471 ± 0.749
0.0ArgXaa: 0.0 ± 0.0
Ser
5.961SerAla: 5.961 ± 1.02
1.49SerCys: 1.49 ± 0.575
4.471SerAsp: 4.471 ± 2.039
4.471SerGlu: 4.471 ± 3.229
5.961SerPhe: 5.961 ± 1.661
6.706SerGly: 6.706 ± 3.121
1.49SerHis: 1.49 ± 1.025
4.471SerIle: 4.471 ± 1.774
7.452SerLys: 7.452 ± 1.942
11.923SerLeu: 11.923 ± 4.635
0.0SerMet: 0.0 ± 0.0
2.981SerAsn: 2.981 ± 1.057
5.961SerPro: 5.961 ± 2.238
2.235SerGln: 2.235 ± 0.894
5.216SerArg: 5.216 ± 2.485
10.432SerSer: 10.432 ± 3.569
7.452SerThr: 7.452 ± 1.564
7.452SerVal: 7.452 ± 0.632
1.49SerTrp: 1.49 ± 0.878
5.216SerTyr: 5.216 ± 2.491
0.0SerXaa: 0.0 ± 0.0
Thr
2.981ThrAla: 2.981 ± 2.049
0.0ThrCys: 0.0 ± 0.0
3.726ThrAsp: 3.726 ± 1.293
2.981ThrGlu: 2.981 ± 1.359
1.49ThrPhe: 1.49 ± 0.575
2.981ThrGly: 2.981 ± 1.292
2.981ThrHis: 2.981 ± 1.057
3.726ThrIle: 3.726 ± 1.14
3.726ThrLys: 3.726 ± 2.517
2.981ThrLeu: 2.981 ± 1.226
0.0ThrMet: 0.0 ± 0.0
0.745ThrAsn: 0.745 ± 0.7
1.49ThrPro: 1.49 ± 0.646
1.49ThrGln: 1.49 ± 0.575
4.471ThrArg: 4.471 ± 0.829
5.961ThrSer: 5.961 ± 1.878
2.981ThrThr: 2.981 ± 1.359
2.235ThrVal: 2.235 ± 1.103
0.745ThrTrp: 0.745 ± 0.7
2.235ThrTyr: 2.235 ± 1.211
0.0ThrXaa: 0.0 ± 0.0
Val
1.49ValAla: 1.49 ± 1.025
0.0ValCys: 0.0 ± 0.0
3.726ValAsp: 3.726 ± 1.893
2.235ValGlu: 2.235 ± 1.103
4.471ValPhe: 4.471 ± 2.318
1.49ValGly: 1.49 ± 0.841
1.49ValHis: 1.49 ± 1.025
2.981ValIle: 2.981 ± 1.151
1.49ValLys: 1.49 ± 0.878
3.726ValLeu: 3.726 ± 1.83
1.49ValMet: 1.49 ± 0.588
5.961ValAsn: 5.961 ± 1.997
3.726ValPro: 3.726 ± 1.293
0.745ValGln: 0.745 ± 0.73
1.49ValArg: 1.49 ± 0.575
6.706ValSer: 6.706 ± 1.111
4.471ValThr: 4.471 ± 1.189
4.471ValVal: 4.471 ± 0.684
0.745ValTrp: 0.745 ± 0.73
1.49ValTyr: 1.49 ± 0.646
0.0ValXaa: 0.0 ± 0.0
Trp
0.745TrpAla: 0.745 ± 0.73
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.745TrpGlu: 0.745 ± 0.512
1.49TrpPhe: 1.49 ± 0.646
0.0TrpGly: 0.0 ± 0.0
0.745TrpHis: 0.745 ± 0.512
0.745TrpIle: 0.745 ± 1.072
1.49TrpLys: 1.49 ± 0.878
0.745TrpLeu: 0.745 ± 0.7
0.0TrpMet: 0.0 ± 0.0
0.745TrpAsn: 0.745 ± 0.512
0.0TrpPro: 0.0 ± 0.0
0.745TrpGln: 0.745 ± 0.73
1.49TrpArg: 1.49 ± 1.025
0.0TrpSer: 0.0 ± 0.0
0.745TrpThr: 0.745 ± 0.512
0.745TrpVal: 0.745 ± 0.512
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.726TyrAla: 3.726 ± 1.14
0.0TyrCys: 0.0 ± 0.0
2.235TyrAsp: 2.235 ± 0.827
0.745TyrGlu: 0.745 ± 0.851
3.726TyrPhe: 3.726 ± 0.856
2.981TyrGly: 2.981 ± 1.057
1.49TyrHis: 1.49 ± 0.575
1.49TyrIle: 1.49 ± 1.079
4.471TyrLys: 4.471 ± 2.648
5.961TyrLeu: 5.961 ± 2.482
0.0TyrMet: 0.0 ± 0.0
2.235TyrAsn: 2.235 ± 0.932
0.745TyrPro: 0.745 ± 0.512
4.471TyrGln: 4.471 ± 1.483
2.981TyrArg: 2.981 ± 1.057
2.981TyrSer: 2.981 ± 2.047
2.981TyrThr: 2.981 ± 1.226
1.49TyrVal: 1.49 ± 1.701
0.0TyrTrp: 0.0 ± 0.0
0.745TyrTyr: 0.745 ± 0.7
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1343 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski