Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_391

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.915AlaAla: 9.915 ± 3.618
1.416AlaCys: 1.416 ± 1.213
9.207AlaAsp: 9.207 ± 2.886
9.207AlaGlu: 9.207 ± 2.57
4.958AlaPhe: 4.958 ± 1.203
6.374AlaGly: 6.374 ± 2.624
1.416AlaHis: 1.416 ± 0.992
4.249AlaIle: 4.249 ± 1.452
2.833AlaLys: 2.833 ± 1.223
4.249AlaLeu: 4.249 ± 2.217
5.666AlaMet: 5.666 ± 1.932
3.541AlaAsn: 3.541 ± 2.46
1.416AlaPro: 1.416 ± 0.992
3.541AlaGln: 3.541 ± 1.573
2.833AlaArg: 2.833 ± 0.914
2.125AlaSer: 2.125 ± 2.308
3.541AlaThr: 3.541 ± 1.782
4.958AlaVal: 4.958 ± 1.9
2.125AlaTrp: 2.125 ± 1.045
7.79AlaTyr: 7.79 ± 1.948
0.0AlaXaa: 0.0 ± 0.0
Cys
0.708CysAla: 0.708 ± 0.606
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.416CysGly: 1.416 ± 1.213
0.0CysHis: 0.0 ± 0.0
0.708CysIle: 0.708 ± 0.606
0.0CysLys: 0.0 ± 0.0
2.125CysLeu: 2.125 ± 1.045
0.0CysMet: 0.0 ± 0.0
0.708CysAsn: 0.708 ± 0.606
1.416CysPro: 1.416 ± 0.707
0.0CysGln: 0.0 ± 0.0
0.708CysArg: 0.708 ± 0.606
2.125CysSer: 2.125 ± 1.819
0.708CysThr: 0.708 ± 0.606
0.708CysVal: 0.708 ± 0.496
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.541AspAla: 3.541 ± 1.117
0.0AspCys: 0.0 ± 0.0
1.416AspAsp: 1.416 ± 0.549
2.833AspGlu: 2.833 ± 1.618
2.125AspPhe: 2.125 ± 1.684
2.833AspGly: 2.833 ± 1.282
3.541AspHis: 3.541 ± 1.345
2.125AspIle: 2.125 ± 1.645
0.708AspLys: 0.708 ± 0.769
4.958AspLeu: 4.958 ± 0.934
1.416AspMet: 1.416 ± 1.071
2.125AspAsn: 2.125 ± 1.496
2.833AspPro: 2.833 ± 1.078
3.541AspGln: 3.541 ± 1.202
4.249AspArg: 4.249 ± 1.26
7.082AspSer: 7.082 ± 2.604
2.833AspThr: 2.833 ± 1.078
0.708AspVal: 0.708 ± 0.878
1.416AspTrp: 1.416 ± 0.843
5.666AspTyr: 5.666 ± 0.943
0.0AspXaa: 0.0 ± 0.0
Glu
10.623GluAla: 10.623 ± 3.679
0.708GluCys: 0.708 ± 0.606
4.958GluAsp: 4.958 ± 2.571
3.541GluGlu: 3.541 ± 1.009
2.125GluPhe: 2.125 ± 1.342
4.958GluGly: 4.958 ± 2.636
0.708GluHis: 0.708 ± 0.606
2.833GluIle: 2.833 ± 1.097
2.833GluLys: 2.833 ± 2.098
1.416GluLeu: 1.416 ± 0.549
1.416GluMet: 1.416 ± 0.828
2.125GluAsn: 2.125 ± 0.851
2.125GluPro: 2.125 ± 2.595
3.541GluGln: 3.541 ± 1.844
3.541GluArg: 3.541 ± 0.79
2.125GluSer: 2.125 ± 1.342
5.666GluThr: 5.666 ± 1.714
2.125GluVal: 2.125 ± 0.863
1.416GluTrp: 1.416 ± 0.992
4.249GluTyr: 4.249 ± 1.456
0.0GluXaa: 0.0 ± 0.0
Phe
1.416PheAla: 1.416 ± 0.992
0.0PheCys: 0.0 ± 0.0
2.125PheAsp: 2.125 ± 0.949
1.416PheGlu: 1.416 ± 1.642
2.125PhePhe: 2.125 ± 1.128
2.833PheGly: 2.833 ± 1.282
0.0PheHis: 0.0 ± 0.0
0.708PheIle: 0.708 ± 0.878
3.541PheLys: 3.541 ± 1.021
0.708PheLeu: 0.708 ± 0.606
2.125PheMet: 2.125 ± 1.108
2.833PheAsn: 2.833 ± 1.591
0.708PhePro: 0.708 ± 0.496
0.708PheGln: 0.708 ± 1.375
1.416PheArg: 1.416 ± 0.84
1.416PheSer: 1.416 ± 0.992
2.833PheThr: 2.833 ± 1.983
0.708PheVal: 0.708 ± 0.496
0.708PheTrp: 0.708 ± 0.496
2.125PheTyr: 2.125 ± 0.852
0.0PheXaa: 0.0 ± 0.0
Gly
7.79GlyAla: 7.79 ± 1.893
2.125GlyCys: 2.125 ± 1.045
4.958GlyAsp: 4.958 ± 2.785
4.958GlyGlu: 4.958 ± 1.88
0.708GlyPhe: 0.708 ± 0.769
6.374GlyGly: 6.374 ± 1.559
2.833GlyHis: 2.833 ± 1.776
7.79GlyIle: 7.79 ± 1.595
6.374GlyLys: 6.374 ± 1.556
4.958GlyLeu: 4.958 ± 3.488
0.708GlyMet: 0.708 ± 0.496
0.708GlyAsn: 0.708 ± 0.496
0.708GlyPro: 0.708 ± 0.769
2.125GlyGln: 2.125 ± 1.045
1.416GlyArg: 1.416 ± 0.549
9.207GlySer: 9.207 ± 3.361
4.958GlyThr: 4.958 ± 2.087
4.249GlyVal: 4.249 ± 1.639
0.708GlyTrp: 0.708 ± 0.496
2.125GlyTyr: 2.125 ± 0.852
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.125HisAsp: 2.125 ± 0.554
0.0HisGlu: 0.0 ± 0.0
1.416HisPhe: 1.416 ± 0.549
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.125HisIle: 2.125 ± 1.819
1.416HisLys: 1.416 ± 0.549
2.833HisLeu: 2.833 ± 1.097
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.708HisPro: 0.708 ± 0.606
0.708HisGln: 0.708 ± 0.606
1.416HisArg: 1.416 ± 1.049
0.0HisSer: 0.0 ± 0.0
2.125HisThr: 2.125 ± 0.852
1.416HisVal: 1.416 ± 1.213
0.0HisTrp: 0.0 ± 0.0
2.125HisTyr: 2.125 ± 0.852
0.0HisXaa: 0.0 ± 0.0
Ile
3.541IleAla: 3.541 ± 1.844
0.0IleCys: 0.0 ± 0.0
2.125IleAsp: 2.125 ± 1.045
2.833IleGlu: 2.833 ± 1.572
2.125IlePhe: 2.125 ± 1.064
1.416IleGly: 1.416 ± 0.549
0.0IleHis: 0.0 ± 0.0
3.541IleIle: 3.541 ± 1.432
5.666IleLys: 5.666 ± 1.76
4.958IleLeu: 4.958 ± 1.394
3.541IleMet: 3.541 ± 0.749
1.416IleAsn: 1.416 ± 0.992
4.249IlePro: 4.249 ± 2.225
2.125IleGln: 2.125 ± 1.645
1.416IleArg: 1.416 ± 1.37
3.541IleSer: 3.541 ± 1.117
2.125IleThr: 2.125 ± 1.472
2.125IleVal: 2.125 ± 2.164
2.125IleTrp: 2.125 ± 1.045
7.79IleTyr: 7.79 ± 1.5
0.0IleXaa: 0.0 ± 0.0
Lys
6.374LysAla: 6.374 ± 0.926
0.0LysCys: 0.0 ± 0.0
2.833LysAsp: 2.833 ± 2.042
2.833LysGlu: 2.833 ± 0.628
2.125LysPhe: 2.125 ± 0.852
3.541LysGly: 3.541 ± 2.209
1.416LysHis: 1.416 ± 0.549
5.666LysIle: 5.666 ± 3.018
4.249LysLys: 4.249 ± 2.501
3.541LysLeu: 3.541 ± 1.686
0.708LysMet: 0.708 ± 0.69
3.541LysAsn: 3.541 ± 1.701
2.125LysPro: 2.125 ± 1.645
1.416LysGln: 1.416 ± 0.549
2.833LysArg: 2.833 ± 1.296
1.416LysSer: 1.416 ± 0.549
2.125LysThr: 2.125 ± 1.045
2.833LysVal: 2.833 ± 0.628
2.125LysTrp: 2.125 ± 1.488
1.416LysTyr: 1.416 ± 0.549
0.0LysXaa: 0.0 ± 0.0
Leu
5.666LeuAla: 5.666 ± 2.791
0.0LeuCys: 0.0 ± 0.0
2.833LeuAsp: 2.833 ± 1.097
4.958LeuGlu: 4.958 ± 1.8
2.125LeuPhe: 2.125 ± 0.949
6.374LeuGly: 6.374 ± 1.085
0.708LeuHis: 0.708 ± 0.606
4.249LeuIle: 4.249 ± 2.089
3.541LeuLys: 3.541 ± 2.209
2.833LeuLeu: 2.833 ± 1.704
1.416LeuMet: 1.416 ± 1.213
3.541LeuAsn: 3.541 ± 1.117
3.541LeuPro: 3.541 ± 1.432
2.833LeuGln: 2.833 ± 0.628
4.249LeuArg: 4.249 ± 1.108
2.833LeuSer: 2.833 ± 1.403
5.666LeuThr: 5.666 ± 1.249
1.416LeuVal: 1.416 ± 1.642
2.125LeuTrp: 2.125 ± 0.852
2.833LeuTyr: 2.833 ± 1.078
0.0LeuXaa: 0.0 ± 0.0
Met
4.249MetAla: 4.249 ± 1.674
0.0MetCys: 0.0 ± 0.0
4.249MetAsp: 4.249 ± 1.578
1.416MetGlu: 1.416 ± 1.755
0.708MetPhe: 0.708 ± 0.769
2.833MetGly: 2.833 ± 1.415
0.708MetHis: 0.708 ± 0.606
2.833MetIle: 2.833 ± 1.078
0.708MetLys: 0.708 ± 0.606
1.416MetLeu: 1.416 ± 0.843
0.708MetMet: 0.708 ± 0.769
0.708MetAsn: 0.708 ± 0.496
3.541MetPro: 3.541 ± 1.86
0.708MetGln: 0.708 ± 0.496
4.249MetArg: 4.249 ± 1.646
4.249MetSer: 4.249 ± 1.812
0.708MetThr: 0.708 ± 0.769
0.708MetVal: 0.708 ± 0.769
0.708MetTrp: 0.708 ± 1.375
0.708MetTyr: 0.708 ± 0.769
0.0MetXaa: 0.0 ± 0.0
Asn
2.833AsnAla: 2.833 ± 2.723
1.416AsnCys: 1.416 ± 1.213
2.125AsnAsp: 2.125 ± 0.554
2.125AsnGlu: 2.125 ± 1.128
1.416AsnPhe: 1.416 ± 0.84
1.416AsnGly: 1.416 ± 1.539
0.0AsnHis: 0.0 ± 0.0
4.249AsnIle: 4.249 ± 1.578
0.708AsnLys: 0.708 ± 0.878
4.958AsnLeu: 4.958 ± 2.12
2.125AsnMet: 2.125 ± 0.949
0.708AsnAsn: 0.708 ± 0.496
0.708AsnPro: 0.708 ± 0.769
2.125AsnGln: 2.125 ± 0.949
1.416AsnArg: 1.416 ± 0.549
1.416AsnSer: 1.416 ± 0.707
3.541AsnThr: 3.541 ± 2.07
2.833AsnVal: 2.833 ± 1.221
2.125AsnTrp: 2.125 ± 1.819
2.833AsnTyr: 2.833 ± 1.415
0.0AsnXaa: 0.0 ± 0.0
Pro
4.249ProAla: 4.249 ± 0.749
0.708ProCys: 0.708 ± 0.606
0.708ProAsp: 0.708 ± 0.496
3.541ProGlu: 3.541 ± 1.432
0.708ProPhe: 0.708 ± 0.496
5.666ProGly: 5.666 ± 2.38
0.708ProHis: 0.708 ± 0.606
4.249ProIle: 4.249 ± 1.431
0.708ProLys: 0.708 ± 0.606
2.833ProLeu: 2.833 ± 0.717
1.416ProMet: 1.416 ± 0.707
0.708ProAsn: 0.708 ± 0.496
1.416ProPro: 1.416 ± 0.992
2.125ProGln: 2.125 ± 1.488
1.416ProArg: 1.416 ± 1.37
2.833ProSer: 2.833 ± 1.579
2.833ProThr: 2.833 ± 0.915
5.666ProVal: 5.666 ± 1.454
0.0ProTrp: 0.0 ± 0.0
1.416ProTyr: 1.416 ± 0.549
0.0ProXaa: 0.0 ± 0.0
Gln
2.833GlnAla: 2.833 ± 0.915
1.416GlnCys: 1.416 ± 0.843
1.416GlnAsp: 1.416 ± 0.549
3.541GlnGlu: 3.541 ± 0.987
1.416GlnPhe: 1.416 ± 1.071
4.249GlnGly: 4.249 ± 1.639
0.708GlnHis: 0.708 ± 0.496
2.833GlnIle: 2.833 ± 1.215
2.833GlnLys: 2.833 ± 1.983
4.958GlnLeu: 4.958 ± 2.222
3.541GlnMet: 3.541 ± 2.56
1.416GlnAsn: 1.416 ± 0.707
2.125GlnPro: 2.125 ± 0.851
4.958GlnGln: 4.958 ± 2.283
4.958GlnArg: 4.958 ± 2.057
1.416GlnSer: 1.416 ± 0.707
3.541GlnThr: 3.541 ± 1.202
1.416GlnVal: 1.416 ± 0.992
0.0GlnTrp: 0.0 ± 0.0
1.416GlnTyr: 1.416 ± 0.992
0.0GlnXaa: 0.0 ± 0.0
Arg
5.666ArgAla: 5.666 ± 2.035
1.416ArgCys: 1.416 ± 0.549
2.833ArgAsp: 2.833 ± 0.921
3.541ArgGlu: 3.541 ± 1.432
1.416ArgPhe: 1.416 ± 0.549
0.708ArgGly: 0.708 ± 0.496
0.708ArgHis: 0.708 ± 0.606
1.416ArgIle: 1.416 ± 0.549
2.833ArgLys: 2.833 ± 1.992
0.708ArgLeu: 0.708 ± 0.606
4.249ArgMet: 4.249 ± 1.26
2.833ArgAsn: 2.833 ± 1.296
4.249ArgPro: 4.249 ± 2.089
2.833ArgGln: 2.833 ± 1.43
2.833ArgArg: 2.833 ± 1.776
3.541ArgSer: 3.541 ± 1.782
4.958ArgThr: 4.958 ± 2.12
2.125ArgVal: 2.125 ± 1.684
1.416ArgTrp: 1.416 ± 1.213
4.249ArgTyr: 4.249 ± 0.944
0.0ArgXaa: 0.0 ± 0.0
Ser
2.833SerAla: 2.833 ± 0.914
0.708SerCys: 0.708 ± 0.496
4.249SerAsp: 4.249 ± 1.31
4.249SerGlu: 4.249 ± 1.578
2.125SerPhe: 2.125 ± 1.488
7.082SerGly: 7.082 ± 5.787
1.416SerHis: 1.416 ± 0.549
2.125SerIle: 2.125 ± 1.559
2.125SerLys: 2.125 ± 1.472
5.666SerLeu: 5.666 ± 1.93
1.416SerMet: 1.416 ± 1.463
4.958SerAsn: 4.958 ± 1.9
2.833SerPro: 2.833 ± 1.339
2.833SerGln: 2.833 ± 1.415
5.666SerArg: 5.666 ± 1.712
7.082SerSer: 7.082 ± 5.785
3.541SerThr: 3.541 ± 2.479
2.833SerVal: 2.833 ± 1.983
2.833SerTrp: 2.833 ± 0.914
1.416SerTyr: 1.416 ± 1.539
0.0SerXaa: 0.0 ± 0.0
Thr
7.082ThrAla: 7.082 ± 1.349
0.708ThrCys: 0.708 ± 0.606
2.125ThrAsp: 2.125 ± 0.949
4.249ThrGlu: 4.249 ± 1.705
0.0ThrPhe: 0.0 ± 0.0
8.499ThrGly: 8.499 ± 1.698
0.708ThrHis: 0.708 ± 0.606
2.833ThrIle: 2.833 ± 1.097
4.958ThrLys: 4.958 ± 2.014
3.541ThrLeu: 3.541 ± 0.79
0.708ThrMet: 0.708 ± 0.769
3.541ThrAsn: 3.541 ± 1.599
2.125ThrPro: 2.125 ± 0.949
2.833ThrGln: 2.833 ± 0.921
2.833ThrArg: 2.833 ± 1.618
7.79ThrSer: 7.79 ± 3.047
2.833ThrThr: 2.833 ± 1.983
2.125ThrVal: 2.125 ± 1.342
0.708ThrTrp: 0.708 ± 1.375
3.541ThrTyr: 3.541 ± 1.345
0.0ThrXaa: 0.0 ± 0.0
Val
4.249ValAla: 4.249 ± 3.657
0.0ValCys: 0.0 ± 0.0
0.708ValAsp: 0.708 ± 0.496
3.541ValGlu: 3.541 ± 1.519
0.708ValPhe: 0.708 ± 0.496
3.541ValGly: 3.541 ± 1.312
0.708ValHis: 0.708 ± 0.878
0.0ValIle: 0.0 ± 0.0
3.541ValLys: 3.541 ± 1.801
2.125ValLeu: 2.125 ± 0.852
0.708ValMet: 0.708 ± 0.496
1.416ValAsn: 1.416 ± 0.707
4.958ValPro: 4.958 ± 2.12
3.541ValGln: 3.541 ± 1.918
1.416ValArg: 1.416 ± 1.854
2.833ValSer: 2.833 ± 1.39
4.958ValThr: 4.958 ± 1.491
3.541ValVal: 3.541 ± 1.345
0.708ValTrp: 0.708 ± 0.606
0.708ValTyr: 0.708 ± 0.606
0.0ValXaa: 0.0 ± 0.0
Trp
4.249TrpAla: 4.249 ± 1.254
0.0TrpCys: 0.0 ± 0.0
2.125TrpAsp: 2.125 ± 1.342
2.125TrpGlu: 2.125 ± 1.819
0.708TrpPhe: 0.708 ± 0.606
1.416TrpGly: 1.416 ± 1.213
0.708TrpHis: 0.708 ± 0.496
1.416TrpIle: 1.416 ± 0.992
0.708TrpLys: 0.708 ± 0.496
0.0TrpLeu: 0.0 ± 0.0
0.708TrpMet: 0.708 ± 0.496
1.416TrpAsn: 1.416 ± 0.549
0.708TrpPro: 0.708 ± 0.496
2.125TrpGln: 2.125 ± 1.128
0.708TrpArg: 0.708 ± 0.606
2.125TrpSer: 2.125 ± 1.496
0.708TrpThr: 0.708 ± 0.496
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.708TrpTyr: 0.708 ± 0.496
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.541TyrAla: 3.541 ± 1.86
0.708TyrCys: 0.708 ± 0.606
2.833TyrAsp: 2.833 ± 1.215
2.833TyrGlu: 2.833 ± 1.992
1.416TyrPhe: 1.416 ± 0.992
4.249TyrGly: 4.249 ± 1.701
1.416TyrHis: 1.416 ± 1.213
0.708TyrIle: 0.708 ± 0.769
2.833TyrLys: 2.833 ± 1.221
4.958TyrLeu: 4.958 ± 1.373
2.833TyrMet: 2.833 ± 1.205
2.833TyrAsn: 2.833 ± 2.136
1.416TyrPro: 1.416 ± 1.213
6.374TyrGln: 6.374 ± 2.979
4.958TyrArg: 4.958 ± 2.71
2.833TyrSer: 2.833 ± 1.097
3.541TyrThr: 3.541 ± 1.458
1.416TyrVal: 1.416 ± 0.992
1.416TyrTrp: 1.416 ± 0.992
2.125TyrTyr: 2.125 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1413 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski