Amino acid dipepetide frequency for Sanxia picorna-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.157AlaAla: 5.157 ± 0.443
0.793AlaCys: 0.793 ± 0.44
4.76AlaAsp: 4.76 ± 0.663
2.38AlaGlu: 2.38 ± 0.001
4.363AlaPhe: 4.363 ± 0.222
5.553AlaGly: 5.553 ± 0.883
0.793AlaHis: 0.793 ± 0.44
4.76AlaIle: 4.76 ± 1.318
3.57AlaLys: 3.57 ± 0.002
3.173AlaLeu: 3.173 ± 0.438
1.19AlaMet: 1.19 ± 0.661
4.363AlaAsn: 4.363 ± 2.863
3.173AlaPro: 3.173 ± 0.438
2.38AlaGln: 2.38 ± 1.319
1.587AlaArg: 1.587 ± 0.879
5.553AlaSer: 5.553 ± 2.864
3.967AlaThr: 3.967 ± 2.423
1.983AlaVal: 1.983 ± 0.881
1.19AlaTrp: 1.19 ± 0.661
2.38AlaTyr: 2.38 ± 0.001
0.0AlaXaa: 0.0 ± 0.0
Cys
1.19CysAla: 1.19 ± 0.001
0.0CysCys: 0.0 ± 0.0
1.983CysAsp: 1.983 ± 1.099
1.587CysGlu: 1.587 ± 0.219
1.19CysPhe: 1.19 ± 0.659
1.19CysGly: 1.19 ± 0.659
0.397CysHis: 0.397 ± 0.22
1.587CysIle: 1.587 ± 0.879
1.983CysLys: 1.983 ± 1.099
0.793CysLeu: 0.793 ± 0.44
0.397CysMet: 0.397 ± 0.22
0.793CysAsn: 0.793 ± 0.881
3.173CysPro: 3.173 ± 1.098
0.397CysGln: 0.397 ± 0.22
0.0CysArg: 0.0 ± 0.0
0.397CysSer: 0.397 ± 0.22
0.397CysThr: 0.397 ± 0.22
1.983CysVal: 1.983 ± 0.439
0.0CysTrp: 0.0 ± 0.0
1.983CysTyr: 1.983 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
2.38AspAla: 2.38 ± 0.661
1.19AspCys: 1.19 ± 0.659
1.983AspAsp: 1.983 ± 1.099
2.777AspGlu: 2.777 ± 0.442
1.587AspPhe: 1.587 ± 0.219
3.57AspGly: 3.57 ± 0.662
0.397AspHis: 0.397 ± 0.22
7.537AspIle: 7.537 ± 0.876
5.157AspLys: 5.157 ± 2.858
4.363AspLeu: 4.363 ± 1.758
2.38AspMet: 2.38 ± 1.319
3.967AspAsn: 3.967 ± 1.762
3.967AspPro: 3.967 ± 0.218
1.19AspGln: 1.19 ± 0.659
0.397AspArg: 0.397 ± 0.22
3.57AspSer: 3.57 ± 1.322
4.363AspThr: 4.363 ± 0.438
3.967AspVal: 3.967 ± 0.218
0.397AspTrp: 0.397 ± 0.22
1.19AspTyr: 1.19 ± 0.659
0.0AspXaa: 0.0 ± 0.0
Glu
2.38GluAla: 2.38 ± 0.659
0.397GluCys: 0.397 ± 0.22
3.967GluAsp: 3.967 ± 1.538
2.777GluGlu: 2.777 ± 0.879
2.38GluPhe: 2.38 ± 0.001
1.983GluGly: 1.983 ± 0.439
0.793GluHis: 0.793 ± 0.44
2.38GluIle: 2.38 ± 0.661
3.173GluLys: 3.173 ± 1.098
2.38GluLeu: 2.38 ± 1.322
3.173GluMet: 3.173 ± 0.357
2.38GluAsn: 2.38 ± 0.661
1.587GluPro: 1.587 ± 0.219
1.19GluGln: 1.19 ± 0.001
1.983GluArg: 1.983 ± 1.099
4.363GluSer: 4.363 ± 0.222
1.587GluThr: 1.587 ± 0.219
1.19GluVal: 1.19 ± 0.659
0.793GluTrp: 0.793 ± 0.22
3.57GluTyr: 3.57 ± 0.002
0.0GluXaa: 0.0 ± 0.0
Phe
4.363PheAla: 4.363 ± 1.098
1.19PheCys: 1.19 ± 0.659
2.38PheAsp: 2.38 ± 0.661
1.587PheGlu: 1.587 ± 0.441
2.38PhePhe: 2.38 ± 0.659
2.38PheGly: 2.38 ± 0.001
1.587PheHis: 1.587 ± 0.219
2.777PheIle: 2.777 ± 0.219
2.777PheLys: 2.777 ± 0.219
5.95PheLeu: 5.95 ± 0.657
0.793PheMet: 0.793 ± 0.44
2.777PheAsn: 2.777 ± 0.219
2.777PhePro: 2.777 ± 0.879
0.793PheGln: 0.793 ± 0.22
3.173PheArg: 3.173 ± 0.222
4.363PheSer: 4.363 ± 1.098
4.363PheThr: 4.363 ± 0.883
3.173PheVal: 3.173 ± 0.222
0.0PheTrp: 0.0 ± 0.0
2.777PheTyr: 2.777 ± 0.442
0.0PheXaa: 0.0 ± 0.0
Gly
3.173GlyAla: 3.173 ± 2.862
1.983GlyCys: 1.983 ± 0.221
3.967GlyAsp: 3.967 ± 0.878
3.967GlyGlu: 3.967 ± 1.102
1.19GlyPhe: 1.19 ± 0.661
1.587GlyGly: 1.587 ± 0.879
0.793GlyHis: 0.793 ± 0.44
2.38GlyIle: 2.38 ± 0.001
4.76GlyLys: 4.76 ± 1.318
3.967GlyLeu: 3.967 ± 0.442
0.793GlyMet: 0.793 ± 0.44
2.777GlyAsn: 2.777 ± 0.879
1.983GlyPro: 1.983 ± 0.439
1.983GlyGln: 1.983 ± 0.221
1.587GlyArg: 1.587 ± 0.219
4.363GlySer: 4.363 ± 1.098
6.347GlyThr: 6.347 ± 2.424
5.157GlyVal: 5.157 ± 0.217
0.793GlyTrp: 0.793 ± 0.22
3.967GlyTyr: 3.967 ± 1.762
0.0GlyXaa: 0.0 ± 0.0
His
1.587HisAla: 1.587 ± 0.219
0.397HisCys: 0.397 ± 0.22
0.397HisAsp: 0.397 ± 0.44
0.397HisGlu: 0.397 ± 0.22
1.19HisPhe: 1.19 ± 0.001
1.587HisGly: 1.587 ± 0.219
0.0HisHis: 0.0 ± 0.0
2.38HisIle: 2.38 ± 0.659
1.983HisLys: 1.983 ± 0.439
1.983HisLeu: 1.983 ± 0.221
0.0HisMet: 0.0 ± 0.0
1.587HisAsn: 1.587 ± 0.219
1.19HisPro: 1.19 ± 0.659
0.397HisGln: 0.397 ± 0.22
0.793HisArg: 0.793 ± 0.22
1.19HisSer: 1.19 ± 0.659
1.983HisThr: 1.983 ± 0.221
2.777HisVal: 2.777 ± 0.442
0.0HisTrp: 0.0 ± 0.0
1.19HisTyr: 1.19 ± 0.001
0.0HisXaa: 0.0 ± 0.0
Ile
5.157IleAla: 5.157 ± 0.877
1.587IleCys: 1.587 ± 0.879
3.57IleAsp: 3.57 ± 0.662
7.14IleGlu: 7.14 ± 1.976
2.38IlePhe: 2.38 ± 0.001
3.57IleGly: 3.57 ± 1.322
1.983IleHis: 1.983 ± 0.221
4.363IleIle: 4.363 ± 1.098
3.967IleLys: 3.967 ± 0.218
3.967IleLeu: 3.967 ± 1.538
1.19IleMet: 1.19 ± 0.659
2.38IleAsn: 2.38 ± 0.001
4.363IlePro: 4.363 ± 0.883
2.777IleGln: 2.777 ± 0.219
2.777IleArg: 2.777 ± 0.879
3.967IleSer: 3.967 ± 0.878
3.57IleThr: 3.57 ± 0.658
5.95IleVal: 5.95 ± 0.657
1.19IleTrp: 1.19 ± 0.001
2.777IleTyr: 2.777 ± 0.219
0.0IleXaa: 0.0 ± 0.0
Lys
1.587LysAla: 1.587 ± 0.219
1.587LysCys: 1.587 ± 0.879
3.967LysAsp: 3.967 ± 1.538
2.777LysGlu: 2.777 ± 1.539
5.157LysPhe: 5.157 ± 2.198
3.57LysGly: 3.57 ± 0.658
1.983LysHis: 1.983 ± 0.439
3.967LysIle: 3.967 ± 0.218
3.173LysLys: 3.173 ± 1.098
7.14LysLeu: 7.14 ± 0.664
1.587LysMet: 1.587 ± 0.441
2.777LysAsn: 2.777 ± 0.442
2.38LysPro: 2.38 ± 0.659
2.38LysGln: 2.38 ± 0.659
1.983LysArg: 1.983 ± 0.439
3.173LysSer: 3.173 ± 1.098
2.777LysThr: 2.777 ± 0.219
2.38LysVal: 2.38 ± 1.319
0.793LysTrp: 0.793 ± 0.44
5.553LysTyr: 5.553 ± 2.417
0.0LysXaa: 0.0 ± 0.0
Leu
2.777LeuAla: 2.777 ± 0.219
1.983LeuCys: 1.983 ± 0.221
5.553LeuAsp: 5.553 ± 1.757
2.777LeuGlu: 2.777 ± 0.442
3.173LeuPhe: 3.173 ± 0.438
2.38LeuGly: 2.38 ± 0.661
1.983LeuHis: 1.983 ± 0.881
5.157LeuIle: 5.157 ± 0.877
7.537LeuLys: 7.537 ± 2.196
7.14LeuLeu: 7.14 ± 1.324
1.983LeuMet: 1.983 ± 1.099
4.363LeuAsn: 4.363 ± 0.438
1.983LeuPro: 1.983 ± 1.541
2.777LeuGln: 2.777 ± 0.219
3.57LeuArg: 3.57 ± 0.002
7.933LeuSer: 7.933 ± 0.224
5.95LeuThr: 5.95 ± 0.663
4.76LeuVal: 4.76 ± 0.663
1.587LeuTrp: 1.587 ± 0.219
4.363LeuTyr: 4.363 ± 0.883
0.0LeuXaa: 0.0 ± 0.0
Met
2.777MetAla: 2.777 ± 0.219
0.793MetCys: 0.793 ± 0.44
1.19MetAsp: 1.19 ± 0.659
0.793MetGlu: 0.793 ± 0.44
1.19MetPhe: 1.19 ± 0.001
0.793MetGly: 0.793 ± 0.881
0.0MetHis: 0.0 ± 0.0
0.793MetIle: 0.793 ± 0.44
0.0MetLys: 0.0 ± 0.0
3.173MetLeu: 3.173 ± 1.098
0.793MetMet: 0.793 ± 0.44
0.397MetAsn: 0.397 ± 0.22
1.983MetPro: 1.983 ± 0.439
1.19MetGln: 1.19 ± 0.661
1.19MetArg: 1.19 ± 0.661
1.587MetSer: 1.587 ± 0.441
2.38MetThr: 2.38 ± 1.322
0.397MetVal: 0.397 ± 0.22
0.793MetTrp: 0.793 ± 0.44
1.983MetTyr: 1.983 ± 0.439
0.0MetXaa: 0.0 ± 0.0
Asn
1.19AsnAla: 1.19 ± 0.661
1.19AsnCys: 1.19 ± 0.659
2.38AsnAsp: 2.38 ± 0.659
1.19AsnGlu: 1.19 ± 0.001
2.777AsnPhe: 2.777 ± 0.442
3.173AsnGly: 3.173 ± 0.222
2.777AsnHis: 2.777 ± 0.219
6.743AsnIle: 6.743 ± 0.224
3.173AsnLys: 3.173 ± 0.222
3.967AsnLeu: 3.967 ± 0.878
1.587AsnMet: 1.587 ± 1.761
3.967AsnAsn: 3.967 ± 0.218
3.967AsnPro: 3.967 ± 0.442
2.777AsnGln: 2.777 ± 1.102
1.19AsnArg: 1.19 ± 0.659
2.777AsnSer: 2.777 ± 0.219
3.57AsnThr: 3.57 ± 0.662
3.173AsnVal: 3.173 ± 1.542
0.793AsnTrp: 0.793 ± 0.881
1.19AsnTyr: 1.19 ± 0.661
0.0AsnXaa: 0.0 ± 0.0
Pro
1.19ProAla: 1.19 ± 1.321
0.397ProCys: 0.397 ± 0.44
1.587ProAsp: 1.587 ± 0.879
1.983ProGlu: 1.983 ± 1.099
3.173ProPhe: 3.173 ± 0.882
3.173ProGly: 3.173 ± 1.098
0.397ProHis: 0.397 ± 0.44
2.38ProIle: 2.38 ± 0.001
1.19ProLys: 1.19 ± 0.661
3.967ProLeu: 3.967 ± 0.218
1.19ProMet: 1.19 ± 0.001
3.173ProAsn: 3.173 ± 1.098
0.793ProPro: 0.793 ± 0.22
2.777ProGln: 2.777 ± 0.219
1.587ProArg: 1.587 ± 0.441
5.157ProSer: 5.157 ± 3.083
5.157ProThr: 5.157 ± 0.443
4.76ProVal: 4.76 ± 1.323
0.793ProTrp: 0.793 ± 0.44
3.57ProTyr: 3.57 ± 0.662
0.0ProXaa: 0.0 ± 0.0
Gln
2.38GlnAla: 2.38 ± 1.322
0.397GlnCys: 0.397 ± 0.22
1.19GlnAsp: 1.19 ± 0.659
1.587GlnGlu: 1.587 ± 0.219
1.587GlnPhe: 1.587 ± 0.219
1.983GlnGly: 1.983 ± 0.221
0.397GlnHis: 0.397 ± 0.22
1.983GlnIle: 1.983 ± 0.439
0.0GlnLys: 0.0 ± 0.0
3.173GlnLeu: 3.173 ± 0.882
1.19GlnMet: 1.19 ± 0.661
1.19GlnAsn: 1.19 ± 0.659
1.19GlnPro: 1.19 ± 0.001
1.19GlnGln: 1.19 ± 0.659
2.38GlnArg: 2.38 ± 0.659
6.347GlnSer: 6.347 ± 1.764
1.19GlnThr: 1.19 ± 0.659
2.38GlnVal: 2.38 ± 0.659
0.793GlnTrp: 0.793 ± 0.44
2.38GlnTyr: 2.38 ± 0.659
0.0GlnXaa: 0.0 ± 0.0
Arg
3.173ArgAla: 3.173 ± 1.759
1.19ArgCys: 1.19 ± 0.659
0.793ArgAsp: 0.793 ± 0.44
1.19ArgGlu: 1.19 ± 0.001
1.19ArgPhe: 1.19 ± 0.001
2.777ArgGly: 2.777 ± 1.102
0.793ArgHis: 0.793 ± 0.44
2.777ArgIle: 2.777 ± 0.219
1.983ArgLys: 1.983 ± 1.099
4.363ArgLeu: 4.363 ± 1.758
1.19ArgMet: 1.19 ± 0.659
1.983ArgAsn: 1.983 ± 0.439
2.38ArgPro: 2.38 ± 0.661
1.19ArgGln: 1.19 ± 0.001
1.19ArgArg: 1.19 ± 0.659
0.397ArgSer: 0.397 ± 0.22
2.38ArgThr: 2.38 ± 0.001
3.967ArgVal: 3.967 ± 1.102
0.0ArgTrp: 0.0 ± 0.0
1.983ArgTyr: 1.983 ± 0.221
0.0ArgXaa: 0.0 ± 0.0
Ser
5.157SerAla: 5.157 ± 0.217
1.587SerCys: 1.587 ± 0.219
4.363SerAsp: 4.363 ± 1.543
3.173SerGlu: 3.173 ± 0.222
3.967SerPhe: 3.967 ± 0.878
5.553SerGly: 5.553 ± 0.437
1.983SerHis: 1.983 ± 0.221
4.363SerIle: 4.363 ± 0.438
7.14SerLys: 7.14 ± 1.316
6.347SerLeu: 6.347 ± 2.424
0.397SerMet: 0.397 ± 0.44
3.57SerAsn: 3.57 ± 2.642
1.983SerPro: 1.983 ± 2.201
2.38SerGln: 2.38 ± 0.659
3.57SerArg: 3.57 ± 1.318
5.95SerSer: 5.95 ± 1.323
5.553SerThr: 5.553 ± 2.864
6.347SerVal: 6.347 ± 2.424
2.38SerTrp: 2.38 ± 1.322
2.777SerTyr: 2.777 ± 1.102
0.0SerXaa: 0.0 ± 0.0
Thr
7.933ThrAla: 7.933 ± 2.205
1.19ThrCys: 1.19 ± 0.659
3.57ThrAsp: 3.57 ± 0.658
0.793ThrGlu: 0.793 ± 0.44
5.157ThrPhe: 5.157 ± 0.217
4.363ThrGly: 4.363 ± 0.883
1.19ThrHis: 1.19 ± 0.661
3.967ThrIle: 3.967 ± 1.762
2.38ThrLys: 2.38 ± 0.001
3.967ThrLeu: 3.967 ± 1.762
1.19ThrMet: 1.19 ± 0.891
3.967ThrAsn: 3.967 ± 0.218
3.57ThrPro: 3.57 ± 1.322
3.57ThrGln: 3.57 ± 1.322
3.57ThrArg: 3.57 ± 0.658
5.553ThrSer: 5.553 ± 2.864
3.967ThrThr: 3.967 ± 0.442
2.38ThrVal: 2.38 ± 0.659
0.397ThrTrp: 0.397 ± 0.22
3.173ThrTyr: 3.173 ± 1.542
0.0ThrXaa: 0.0 ± 0.0
Val
5.553ValAla: 5.553 ± 1.543
1.587ValCys: 1.587 ± 0.879
5.553ValAsp: 5.553 ± 0.883
2.38ValGlu: 2.38 ± 0.661
3.57ValPhe: 3.57 ± 0.002
3.57ValGly: 3.57 ± 1.322
1.587ValHis: 1.587 ± 0.219
4.76ValIle: 4.76 ± 1.318
3.173ValLys: 3.173 ± 1.098
3.967ValLeu: 3.967 ± 0.218
1.19ValMet: 1.19 ± 0.659
5.95ValAsn: 5.95 ± 0.663
3.173ValPro: 3.173 ± 0.882
1.19ValGln: 1.19 ± 0.659
3.173ValArg: 3.173 ± 0.222
5.157ValSer: 5.157 ± 2.423
3.173ValThr: 3.173 ± 0.882
4.363ValVal: 4.363 ± 1.098
0.0ValTrp: 0.0 ± 0.0
2.777ValTyr: 2.777 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.793TrpAla: 0.793 ± 0.881
0.793TrpCys: 0.793 ± 0.44
0.397TrpAsp: 0.397 ± 0.22
0.0TrpGlu: 0.0 ± 0.0
1.19TrpPhe: 1.19 ± 0.001
0.793TrpGly: 0.793 ± 0.44
0.397TrpHis: 0.397 ± 0.22
0.0TrpIle: 0.0 ± 0.0
0.793TrpLys: 0.793 ± 0.44
3.173TrpLeu: 3.173 ± 0.222
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.397TrpPro: 0.397 ± 0.22
0.397TrpGln: 0.397 ± 0.44
0.397TrpArg: 0.397 ± 0.44
1.19TrpSer: 1.19 ± 0.001
0.397TrpThr: 0.397 ± 0.44
1.587TrpVal: 1.587 ± 0.441
0.0TrpTrp: 0.0 ± 0.0
0.397TrpTyr: 0.397 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.173TyrAla: 3.173 ± 0.882
1.19TyrCys: 1.19 ± 0.661
2.777TyrAsp: 2.777 ± 0.442
3.173TyrGlu: 3.173 ± 1.098
3.57TyrPhe: 3.57 ± 0.002
3.967TyrGly: 3.967 ± 0.218
2.777TyrHis: 2.777 ± 0.879
3.57TyrIle: 3.57 ± 1.318
2.777TyrLys: 2.777 ± 0.879
2.777TyrLeu: 2.777 ± 0.219
1.587TyrMet: 1.587 ± 0.219
1.19TyrAsn: 1.19 ± 0.001
2.38TyrPro: 2.38 ± 1.982
1.983TyrGln: 1.983 ± 0.221
0.793TyrArg: 0.793 ± 0.22
5.553TyrSer: 5.553 ± 2.203
3.173TyrThr: 3.173 ± 0.882
3.173TyrVal: 3.173 ± 0.882
0.397TyrTrp: 0.397 ± 0.22
1.587TyrTyr: 1.587 ± 0.441
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski