Amino acid dipepetide frequency for Sanxia picorna-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.633AlaAla: 4.633 ± 1.477
1.544AlaCys: 1.544 ± 0.136
2.703AlaAsp: 2.703 ± 0.081
0.386AlaGlu: 0.386 ± 0.191
3.089AlaPhe: 3.089 ± 0.9
4.247AlaGly: 4.247 ± 1.04
1.158AlaHis: 1.158 ± 0.573
3.861AlaIle: 3.861 ± 0.025
3.089AlaLys: 3.089 ± 0.9
3.861AlaLeu: 3.861 ± 1.282
0.386AlaMet: 0.386 ± 0.437
3.089AlaAsn: 3.089 ± 0.356
3.861AlaPro: 3.861 ± 1.859
0.772AlaGln: 0.772 ± 0.382
3.089AlaArg: 3.089 ± 0.984
4.247AlaSer: 4.247 ± 0.412
2.703AlaThr: 2.703 ± 0.547
3.475AlaVal: 3.475 ± 0.165
0.772AlaTrp: 0.772 ± 0.246
1.931AlaTyr: 1.931 ± 0.301
0.0AlaXaa: 0.0 ± 0.0
Cys
1.544CysAla: 1.544 ± 0.764
0.772CysCys: 0.772 ± 0.382
0.386CysAsp: 0.386 ± 0.191
1.931CysGlu: 1.931 ± 0.327
1.544CysPhe: 1.544 ± 0.492
1.931CysGly: 1.931 ± 0.955
0.386CysHis: 0.386 ± 0.191
1.158CysIle: 1.158 ± 0.683
0.0CysLys: 0.0 ± 0.0
1.158CysLeu: 1.158 ± 0.573
0.386CysMet: 0.386 ± 0.191
1.544CysAsn: 1.544 ± 0.764
0.772CysPro: 0.772 ± 0.382
0.0CysGln: 0.0 ± 0.0
0.386CysArg: 0.386 ± 0.437
1.158CysSer: 1.158 ± 0.573
0.772CysThr: 0.772 ± 0.382
1.158CysVal: 1.158 ± 0.573
0.0CysTrp: 0.0 ± 0.0
2.317CysTyr: 2.317 ± 0.518
0.0CysXaa: 0.0 ± 0.0
Asp
2.317AspAla: 2.317 ± 0.738
0.0AspCys: 0.0 ± 0.0
3.475AspAsp: 3.475 ± 0.165
4.633AspGlu: 4.633 ± 0.407
2.317AspPhe: 2.317 ± 0.738
1.544AspGly: 1.544 ± 0.136
0.386AspHis: 0.386 ± 0.191
4.247AspIle: 4.247 ± 1.472
3.089AspLys: 3.089 ± 1.528
6.564AspLeu: 6.564 ± 0.106
1.158AspMet: 1.158 ± 0.055
0.772AspAsn: 0.772 ± 0.246
2.703AspPro: 2.703 ± 1.175
1.931AspGln: 1.931 ± 0.929
3.861AspArg: 3.861 ± 0.653
5.792AspSer: 5.792 ± 0.276
1.931AspThr: 1.931 ± 0.955
2.317AspVal: 2.317 ± 0.518
0.386AspTrp: 0.386 ± 0.437
3.089AspTyr: 3.089 ± 1.528
0.0AspXaa: 0.0 ± 0.0
Glu
2.317GluAla: 2.317 ± 0.11
1.544GluCys: 1.544 ± 0.764
1.158GluAsp: 1.158 ± 0.055
4.247GluGlu: 4.247 ± 1.472
1.931GluPhe: 1.931 ± 0.301
1.544GluGly: 1.544 ± 0.492
0.772GluHis: 0.772 ± 0.382
3.475GluIle: 3.475 ± 0.463
0.386GluLys: 0.386 ± 0.437
4.633GluLeu: 4.633 ± 0.407
1.544GluMet: 1.544 ± 0.423
3.475GluAsn: 3.475 ± 0.165
1.544GluPro: 1.544 ± 0.492
5.405GluGln: 5.405 ± 0.789
3.861GluArg: 3.861 ± 1.282
1.931GluSer: 1.931 ± 0.301
2.317GluThr: 2.317 ± 0.738
3.475GluVal: 3.475 ± 2.678
1.544GluTrp: 1.544 ± 0.136
3.475GluTyr: 3.475 ± 1.091
0.0GluXaa: 0.0 ± 0.0
Phe
2.703PheAla: 2.703 ± 1.337
0.386PheCys: 0.386 ± 0.437
3.089PheAsp: 3.089 ± 0.984
4.247PheGlu: 4.247 ± 0.216
1.544PhePhe: 1.544 ± 1.12
3.475PheGly: 3.475 ± 0.165
2.317PheHis: 2.317 ± 0.738
1.158PheIle: 1.158 ± 0.573
2.317PheLys: 2.317 ± 0.11
4.247PheLeu: 4.247 ± 1.04
1.544PheMet: 1.544 ± 1.12
2.703PheAsn: 2.703 ± 0.709
1.931PhePro: 1.931 ± 0.301
0.772PheGln: 0.772 ± 0.246
2.703PheArg: 2.703 ± 0.547
3.089PheSer: 3.089 ± 0.984
2.703PheThr: 2.703 ± 0.081
2.317PheVal: 2.317 ± 0.518
0.386PheTrp: 0.386 ± 0.437
1.158PheTyr: 1.158 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
3.089GlyAla: 3.089 ± 0.356
0.772GlyCys: 0.772 ± 0.246
3.089GlyAsp: 3.089 ± 0.984
2.703GlyGlu: 2.703 ± 0.547
4.247GlyPhe: 4.247 ± 0.412
1.544GlyGly: 1.544 ± 0.764
1.931GlyHis: 1.931 ± 0.955
3.861GlyIle: 3.861 ± 0.603
4.247GlyLys: 4.247 ± 0.412
5.405GlyLeu: 5.405 ± 0.161
1.544GlyMet: 1.544 ± 0.136
3.861GlyAsn: 3.861 ± 1.282
2.703GlyPro: 2.703 ± 0.081
1.544GlyGln: 1.544 ± 0.136
1.544GlyArg: 1.544 ± 0.136
3.475GlySer: 3.475 ± 1.091
5.019GlyThr: 5.019 ± 1.286
4.247GlyVal: 4.247 ± 1.04
1.158GlyTrp: 1.158 ± 1.311
1.931GlyTyr: 1.931 ± 0.955
0.0GlyXaa: 0.0 ± 0.0
His
0.386HisAla: 0.386 ± 0.437
0.772HisCys: 0.772 ± 0.382
1.158HisAsp: 1.158 ± 0.055
0.772HisGlu: 0.772 ± 0.382
0.772HisPhe: 0.772 ± 0.874
1.544HisGly: 1.544 ± 0.136
0.386HisHis: 0.386 ± 0.191
3.475HisIle: 3.475 ± 1.091
1.158HisLys: 1.158 ± 0.055
1.931HisLeu: 1.931 ± 0.327
0.772HisMet: 0.772 ± 0.382
1.158HisAsn: 1.158 ± 0.055
0.386HisPro: 0.386 ± 0.437
0.386HisGln: 0.386 ± 0.191
0.772HisArg: 0.772 ± 0.246
2.317HisSer: 2.317 ± 0.518
3.861HisThr: 3.861 ± 0.653
0.772HisVal: 0.772 ± 0.382
0.386HisTrp: 0.386 ± 0.191
1.158HisTyr: 1.158 ± 0.055
0.0HisXaa: 0.0 ± 0.0
Ile
5.019IleAla: 5.019 ± 1.914
1.544IleCys: 1.544 ± 0.136
4.247IleAsp: 4.247 ± 0.412
3.089IleGlu: 3.089 ± 0.356
1.544IlePhe: 1.544 ± 0.136
3.089IleGly: 3.089 ± 0.9
3.089IleHis: 3.089 ± 0.272
2.317IleIle: 2.317 ± 0.518
3.089IleLys: 3.089 ± 0.9
6.178IleLeu: 6.178 ± 1.799
1.158IleMet: 1.158 ± 0.573
2.317IleAsn: 2.317 ± 0.518
1.931IlePro: 1.931 ± 0.955
1.544IleGln: 1.544 ± 0.492
3.475IleArg: 3.475 ± 0.165
6.178IleSer: 6.178 ± 2.597
3.861IleThr: 3.861 ± 0.025
3.861IleVal: 3.861 ± 0.025
1.544IleTrp: 1.544 ± 0.764
2.703IleTyr: 2.703 ± 0.081
0.0IleXaa: 0.0 ± 0.0
Lys
2.317LysAla: 2.317 ± 0.518
1.544LysCys: 1.544 ± 0.764
1.931LysAsp: 1.931 ± 0.955
0.772LysGlu: 0.772 ± 0.382
3.475LysPhe: 3.475 ± 0.165
1.931LysGly: 1.931 ± 0.301
0.386LysHis: 0.386 ± 0.191
1.931LysIle: 1.931 ± 0.301
1.544LysLys: 1.544 ± 0.764
3.475LysLeu: 3.475 ± 0.794
0.386LysMet: 0.386 ± 0.191
1.931LysAsn: 1.931 ± 0.955
2.703LysPro: 2.703 ± 0.709
0.386LysGln: 0.386 ± 0.437
2.317LysArg: 2.317 ± 0.11
3.089LysSer: 3.089 ± 1.528
3.475LysThr: 3.475 ± 0.165
3.089LysVal: 3.089 ± 0.272
0.772LysTrp: 0.772 ± 0.382
3.861LysTyr: 3.861 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
5.019LeuAla: 5.019 ± 0.658
1.931LeuCys: 1.931 ± 0.327
3.861LeuAsp: 3.861 ± 0.025
3.475LeuGlu: 3.475 ± 0.794
4.633LeuPhe: 4.633 ± 0.221
6.95LeuGly: 6.95 ± 1.553
1.544LeuHis: 1.544 ± 0.492
5.792LeuIle: 5.792 ± 0.276
3.475LeuLys: 3.475 ± 1.091
7.722LeuLeu: 7.722 ± 1.935
0.772LeuMet: 0.772 ± 0.382
3.089LeuAsn: 3.089 ± 0.984
5.019LeuPro: 5.019 ± 0.658
2.703LeuGln: 2.703 ± 0.709
6.178LeuArg: 6.178 ± 0.713
9.266LeuSer: 9.266 ± 0.441
6.178LeuThr: 6.178 ± 1.171
6.178LeuVal: 6.178 ± 0.543
1.158LeuTrp: 1.158 ± 0.055
4.247LeuTyr: 4.247 ± 1.472
0.0LeuXaa: 0.0 ± 0.0
Met
1.544MetAla: 1.544 ± 0.136
0.386MetCys: 0.386 ± 0.191
1.158MetAsp: 1.158 ± 0.573
0.772MetGlu: 0.772 ± 0.382
1.158MetPhe: 1.158 ± 0.055
1.158MetGly: 1.158 ± 0.683
0.772MetHis: 0.772 ± 0.246
0.386MetIle: 0.386 ± 0.191
0.772MetLys: 0.772 ± 0.382
2.703MetLeu: 2.703 ± 0.709
0.0MetMet: 0.0 ± 0.0
0.772MetAsn: 0.772 ± 0.382
2.317MetPro: 2.317 ± 0.738
0.386MetGln: 0.386 ± 0.437
1.544MetArg: 1.544 ± 0.136
3.089MetSer: 3.089 ± 0.9
0.772MetThr: 0.772 ± 0.874
1.931MetVal: 1.931 ± 0.327
0.0MetTrp: 0.0 ± 0.0
1.544MetTyr: 1.544 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
2.703AsnAla: 2.703 ± 0.547
0.386AsnCys: 0.386 ± 0.191
1.544AsnAsp: 1.544 ± 0.136
0.772AsnGlu: 0.772 ± 0.246
1.158AsnPhe: 1.158 ± 0.055
3.861AsnGly: 3.861 ± 0.025
0.386AsnHis: 0.386 ± 0.191
5.019AsnIle: 5.019 ± 0.03
2.317AsnLys: 2.317 ± 1.146
3.089AsnLeu: 3.089 ± 1.528
2.317AsnMet: 2.317 ± 1.146
2.703AsnAsn: 2.703 ± 0.081
2.703AsnPro: 2.703 ± 0.709
3.089AsnGln: 3.089 ± 0.984
4.633AsnArg: 4.633 ± 0.221
7.336AsnSer: 7.336 ± 2.024
3.089AsnThr: 3.089 ± 0.356
3.861AsnVal: 3.861 ± 1.231
0.772AsnTrp: 0.772 ± 0.246
2.317AsnTyr: 2.317 ± 0.11
0.0AsnXaa: 0.0 ± 0.0
Pro
1.931ProAla: 1.931 ± 0.327
0.386ProCys: 0.386 ± 0.437
3.475ProAsp: 3.475 ± 0.463
2.703ProGlu: 2.703 ± 0.081
3.475ProPhe: 3.475 ± 1.422
1.544ProGly: 1.544 ± 0.492
1.544ProHis: 1.544 ± 0.136
3.861ProIle: 3.861 ± 0.603
0.772ProLys: 0.772 ± 0.382
6.178ProLeu: 6.178 ± 1.341
1.544ProMet: 1.544 ± 0.492
2.703ProAsn: 2.703 ± 0.547
0.386ProPro: 0.386 ± 0.191
0.772ProGln: 0.772 ± 0.382
1.544ProArg: 1.544 ± 0.492
4.247ProSer: 4.247 ± 1.668
2.703ProThr: 2.703 ± 1.175
3.089ProVal: 3.089 ± 0.9
0.386ProTrp: 0.386 ± 0.437
3.861ProTyr: 3.861 ± 0.653
0.0ProXaa: 0.0 ± 0.0
Gln
0.772GlnAla: 0.772 ± 0.382
0.772GlnCys: 0.772 ± 0.382
1.158GlnAsp: 1.158 ± 0.573
1.931GlnGlu: 1.931 ± 1.557
0.772GlnPhe: 0.772 ± 0.382
0.772GlnGly: 0.772 ± 0.382
1.158GlnHis: 1.158 ± 0.573
1.158GlnIle: 1.158 ± 1.311
1.931GlnLys: 1.931 ± 0.929
5.019GlnLeu: 5.019 ± 0.03
0.386GlnMet: 0.386 ± 0.191
1.544GlnAsn: 1.544 ± 0.136
1.544GlnPro: 1.544 ± 0.136
0.0GlnGln: 0.0 ± 0.0
1.544GlnArg: 1.544 ± 0.764
5.019GlnSer: 5.019 ± 0.03
0.386GlnThr: 0.386 ± 0.191
3.475GlnVal: 3.475 ± 0.165
0.0GlnTrp: 0.0 ± 0.0
1.544GlnTyr: 1.544 ± 0.492
0.0GlnXaa: 0.0 ± 0.0
Arg
2.703ArgAla: 2.703 ± 0.709
0.0ArgCys: 0.0 ± 0.0
4.247ArgAsp: 4.247 ± 2.101
2.317ArgGlu: 2.317 ± 0.518
1.544ArgPhe: 1.544 ± 0.136
3.861ArgGly: 3.861 ± 0.603
2.317ArgHis: 2.317 ± 0.518
5.019ArgIle: 5.019 ± 0.03
3.475ArgLys: 3.475 ± 0.463
3.861ArgLeu: 3.861 ± 0.025
3.089ArgMet: 3.089 ± 0.9
2.703ArgAsn: 2.703 ± 0.547
2.703ArgPro: 2.703 ± 0.547
2.703ArgGln: 2.703 ± 1.803
2.703ArgArg: 2.703 ± 0.081
2.703ArgSer: 2.703 ± 0.081
2.317ArgThr: 2.317 ± 1.146
5.019ArgVal: 5.019 ± 0.03
0.772ArgTrp: 0.772 ± 0.246
2.703ArgTyr: 2.703 ± 1.175
0.0ArgXaa: 0.0 ± 0.0
Ser
1.544SerAla: 1.544 ± 1.12
0.772SerCys: 0.772 ± 0.382
3.475SerAsp: 3.475 ± 0.165
7.336SerGlu: 7.336 ± 0.14
3.089SerPhe: 3.089 ± 0.356
6.95SerGly: 6.95 ± 1.587
1.931SerHis: 1.931 ± 0.327
6.95SerIle: 6.95 ± 1.553
3.861SerLys: 3.861 ± 0.025
6.95SerLeu: 6.95 ± 0.959
0.386SerMet: 0.386 ± 0.191
6.178SerAsn: 6.178 ± 1.341
2.317SerPro: 2.317 ± 0.11
1.931SerGln: 1.931 ± 0.955
3.089SerArg: 3.089 ± 0.272
7.722SerSer: 7.722 ± 2.461
9.266SerThr: 9.266 ± 1.697
7.336SerVal: 7.336 ± 1.396
1.158SerTrp: 1.158 ± 0.573
3.475SerTyr: 3.475 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
4.633ThrAla: 4.633 ± 1.035
1.931ThrCys: 1.931 ± 0.955
4.633ThrAsp: 4.633 ± 0.221
2.703ThrGlu: 2.703 ± 0.081
3.861ThrPhe: 3.861 ± 0.603
3.089ThrGly: 3.089 ± 0.984
0.772ThrHis: 0.772 ± 0.246
3.475ThrIle: 3.475 ± 0.794
1.931ThrLys: 1.931 ± 0.327
4.247ThrLeu: 4.247 ± 2.296
1.931ThrMet: 1.931 ± 0.327
5.405ThrAsn: 5.405 ± 0.467
5.019ThrPro: 5.019 ± 1.914
2.317ThrGln: 2.317 ± 1.146
4.247ThrArg: 4.247 ± 0.412
4.633ThrSer: 4.633 ± 0.221
5.405ThrThr: 5.405 ± 0.161
6.564ThrVal: 6.564 ± 0.522
0.772ThrTrp: 0.772 ± 0.382
2.703ThrTyr: 2.703 ± 0.709
0.0ThrXaa: 0.0 ± 0.0
Val
4.633ValAla: 4.633 ± 0.849
2.317ValCys: 2.317 ± 1.146
4.247ValAsp: 4.247 ± 0.216
2.317ValGlu: 2.317 ± 1.146
3.089ValPhe: 3.089 ± 0.9
3.861ValGly: 3.861 ± 0.025
1.158ValHis: 1.158 ± 0.055
1.931ValIle: 1.931 ± 0.301
1.158ValLys: 1.158 ± 1.311
7.722ValLeu: 7.722 ± 0.051
1.158ValMet: 1.158 ± 0.22
3.861ValAsn: 3.861 ± 1.231
4.633ValPro: 4.633 ± 0.407
1.544ValGln: 1.544 ± 1.12
4.633ValArg: 4.633 ± 1.035
5.792ValSer: 5.792 ± 1.532
6.95ValThr: 6.95 ± 0.959
6.178ValVal: 6.178 ± 1.341
0.772ValTrp: 0.772 ± 0.246
3.861ValTyr: 3.861 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
0.386TrpAla: 0.386 ± 0.191
0.0TrpCys: 0.0 ± 0.0
1.158TrpAsp: 1.158 ± 0.573
0.772TrpGlu: 0.772 ± 0.246
1.158TrpPhe: 1.158 ± 0.683
0.386TrpGly: 0.386 ± 0.437
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.544TrpLys: 1.544 ± 0.136
1.158TrpLeu: 1.158 ± 0.055
0.772TrpMet: 0.772 ± 0.246
1.544TrpAsn: 1.544 ± 0.136
0.386TrpPro: 0.386 ± 0.437
0.0TrpGln: 0.0 ± 0.0
0.386TrpArg: 0.386 ± 0.437
1.158TrpSer: 1.158 ± 0.573
1.544TrpThr: 1.544 ± 1.12
0.386TrpVal: 0.386 ± 0.191
0.0TrpTrp: 0.0 ± 0.0
1.158TrpTyr: 1.158 ± 0.573
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.703TyrAla: 2.703 ± 0.547
1.544TyrCys: 1.544 ± 0.136
2.317TyrAsp: 2.317 ± 0.11
3.089TyrGlu: 3.089 ± 0.984
0.386TyrPhe: 0.386 ± 0.191
4.633TyrGly: 4.633 ± 0.407
1.931TyrHis: 1.931 ± 0.301
2.703TyrIle: 2.703 ± 0.709
1.158TyrLys: 1.158 ± 0.573
3.089TyrLeu: 3.089 ± 1.528
1.544TyrMet: 1.544 ± 0.492
2.703TyrAsn: 2.703 ± 0.081
1.544TyrPro: 1.544 ± 0.492
2.703TyrGln: 2.703 ± 1.337
4.247TyrArg: 4.247 ± 2.101
3.475TyrSer: 3.475 ± 1.091
5.019TyrThr: 5.019 ± 0.598
3.089TyrVal: 3.089 ± 0.272
1.158TyrTrp: 1.158 ± 0.055
2.703TyrTyr: 2.703 ± 0.709
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski