Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_159

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.996AlaAla: 1.996 ± 1.318
1.331AlaCys: 1.331 ± 1.079
2.661AlaAsp: 2.661 ± 1.057
5.988AlaGlu: 5.988 ± 2.145
3.992AlaPhe: 3.992 ± 1.746
3.992AlaGly: 3.992 ± 1.393
2.661AlaHis: 2.661 ± 1.276
3.327AlaIle: 3.327 ± 1.978
1.996AlaLys: 1.996 ± 0.882
7.319AlaLeu: 7.319 ± 2.081
2.661AlaMet: 2.661 ± 0.826
6.653AlaAsn: 6.653 ± 2.74
1.331AlaPro: 1.331 ± 0.668
3.992AlaGln: 3.992 ± 1.125
3.327AlaArg: 3.327 ± 0.93
7.984AlaSer: 7.984 ± 1.634
5.988AlaThr: 5.988 ± 1.993
5.323AlaVal: 5.323 ± 1.454
1.996AlaTrp: 1.996 ± 0.882
2.661AlaTyr: 2.661 ± 2.208
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.665CysCys: 0.665 ± 0.439
2.661CysAsp: 2.661 ± 0.99
0.665CysGlu: 0.665 ± 0.439
1.331CysPhe: 1.331 ± 1.079
1.996CysGly: 1.996 ± 1.619
0.0CysHis: 0.0 ± 0.0
1.996CysIle: 1.996 ± 1.726
1.996CysLys: 1.996 ± 1.221
0.665CysLeu: 0.665 ± 0.54
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.665CysPro: 0.665 ± 0.54
0.0CysGln: 0.0 ± 0.0
1.996CysArg: 1.996 ± 0.924
1.331CysSer: 1.331 ± 0.879
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.665CysTrp: 0.665 ± 0.439
1.996CysTyr: 1.996 ± 0.748
0.0CysXaa: 0.0 ± 0.0
Asp
3.327AspAla: 3.327 ± 1.075
2.661AspCys: 2.661 ± 0.848
1.331AspAsp: 1.331 ± 0.482
8.649AspGlu: 8.649 ± 1.729
7.319AspPhe: 7.319 ± 1.518
1.331AspGly: 1.331 ± 1.088
1.331AspHis: 1.331 ± 0.879
3.992AspIle: 3.992 ± 1.125
3.327AspLys: 3.327 ± 1.218
6.653AspLeu: 6.653 ± 2.092
1.331AspMet: 1.331 ± 0.694
3.992AspAsn: 3.992 ± 1.48
1.996AspPro: 1.996 ± 0.748
0.665AspGln: 0.665 ± 0.946
1.996AspArg: 1.996 ± 0.975
3.327AspSer: 3.327 ± 2.596
1.996AspThr: 1.996 ± 1.433
3.327AspVal: 3.327 ± 1.179
0.0AspTrp: 0.0 ± 0.0
3.992AspTyr: 3.992 ± 0.969
0.0AspXaa: 0.0 ± 0.0
Glu
7.319GluAla: 7.319 ± 3.656
0.0GluCys: 0.0 ± 0.0
2.661GluAsp: 2.661 ± 0.848
4.657GluGlu: 4.657 ± 1.797
1.996GluPhe: 1.996 ± 0.748
1.331GluGly: 1.331 ± 0.482
1.331GluHis: 1.331 ± 0.879
4.657GluIle: 4.657 ± 1.089
3.992GluLys: 3.992 ± 1.855
3.327GluLeu: 3.327 ± 1.773
0.0GluMet: 0.0 ± 0.0
3.327GluAsn: 3.327 ± 1.569
1.996GluPro: 1.996 ± 0.748
2.661GluGln: 2.661 ± 0.871
1.331GluArg: 1.331 ± 0.879
1.996GluSer: 1.996 ± 0.527
1.996GluThr: 1.996 ± 1.211
5.323GluVal: 5.323 ± 3.496
1.331GluTrp: 1.331 ± 0.668
4.657GluTyr: 4.657 ± 1.257
0.0GluXaa: 0.0 ± 0.0
Phe
2.661PheAla: 2.661 ± 1.128
0.665PheCys: 0.665 ± 0.54
4.657PheAsp: 4.657 ± 1.319
1.331PheGlu: 1.331 ± 0.888
3.327PhePhe: 3.327 ± 1.11
2.661PheGly: 2.661 ± 1.128
0.665PheHis: 0.665 ± 0.906
5.323PheIle: 5.323 ± 1.036
1.996PheLys: 1.996 ± 1.396
2.661PheLeu: 2.661 ± 1.647
1.331PheMet: 1.331 ± 0.842
0.665PheAsn: 0.665 ± 0.54
2.661PhePro: 2.661 ± 0.739
0.0PheGln: 0.0 ± 0.0
5.323PheArg: 5.323 ± 1.27
2.661PheSer: 2.661 ± 1.128
3.327PheThr: 3.327 ± 1.179
3.992PheVal: 3.992 ± 1.663
0.0PheTrp: 0.0 ± 0.0
1.996PheTyr: 1.996 ± 1.223
0.0PheXaa: 0.0 ± 0.0
Gly
5.988GlyAla: 5.988 ± 0.826
1.331GlyCys: 1.331 ± 1.079
2.661GlyAsp: 2.661 ± 1.128
4.657GlyGlu: 4.657 ± 1.855
1.996GlyPhe: 1.996 ± 0.782
3.992GlyGly: 3.992 ± 0.986
1.331GlyHis: 1.331 ± 0.668
3.992GlyIle: 3.992 ± 0.969
5.323GlyLys: 5.323 ± 1.627
7.984GlyLeu: 7.984 ± 1.502
1.331GlyMet: 1.331 ± 0.891
4.657GlyAsn: 4.657 ± 1.563
0.665GlyPro: 0.665 ± 0.439
0.0GlyGln: 0.0 ± 0.0
1.331GlyArg: 1.331 ± 0.482
6.653GlySer: 6.653 ± 0.897
2.661GlyThr: 2.661 ± 0.871
1.996GlyVal: 1.996 ± 0.527
0.665GlyTrp: 0.665 ± 0.439
3.327GlyTyr: 3.327 ± 1.179
0.0GlyXaa: 0.0 ± 0.0
His
1.996HisAla: 1.996 ± 0.924
0.665HisCys: 0.665 ± 0.439
1.331HisAsp: 1.331 ± 1.785
3.992HisGlu: 3.992 ± 2.383
1.331HisPhe: 1.331 ± 0.879
2.661HisGly: 2.661 ± 1.435
0.665HisHis: 0.665 ± 0.906
0.0HisIle: 0.0 ± 0.0
1.996HisLys: 1.996 ± 0.748
3.992HisLeu: 3.992 ± 1.052
0.0HisMet: 0.0 ± 0.0
0.665HisAsn: 0.665 ± 0.631
0.665HisPro: 0.665 ± 0.54
0.665HisGln: 0.665 ± 0.631
2.661HisArg: 2.661 ± 1.006
2.661HisSer: 2.661 ± 1.126
1.331HisThr: 1.331 ± 0.879
1.996HisVal: 1.996 ± 1.557
0.0HisTrp: 0.0 ± 0.0
0.665HisTyr: 0.665 ± 0.54
0.0HisXaa: 0.0 ± 0.0
Ile
2.661IleAla: 2.661 ± 1.15
0.665IleCys: 0.665 ± 0.54
7.319IleAsp: 7.319 ± 1.292
2.661IleGlu: 2.661 ± 0.739
0.0IlePhe: 0.0 ± 0.0
3.327IleGly: 3.327 ± 0.924
0.665IleHis: 0.665 ± 0.54
1.996IleIle: 1.996 ± 0.924
4.657IleLys: 4.657 ± 1.599
3.992IleLeu: 3.992 ± 1.707
1.331IleMet: 1.331 ± 0.923
2.661IleAsn: 2.661 ± 1.305
5.988IlePro: 5.988 ± 1.56
0.665IleGln: 0.665 ± 0.439
1.996IleArg: 1.996 ± 1.081
4.657IleSer: 4.657 ± 1.281
3.327IleThr: 3.327 ± 1.179
3.327IleVal: 3.327 ± 1.413
1.996IleTrp: 1.996 ± 0.924
1.331IleTyr: 1.331 ± 0.482
0.0IleXaa: 0.0 ± 0.0
Lys
2.661LysAla: 2.661 ± 3.625
0.665LysCys: 0.665 ± 0.54
1.996LysAsp: 1.996 ± 0.748
3.327LysGlu: 3.327 ± 1.179
1.996LysPhe: 1.996 ± 0.77
2.661LysGly: 2.661 ± 1.128
1.996LysHis: 1.996 ± 1.221
3.992LysIle: 3.992 ± 1.758
5.323LysLys: 5.323 ± 2.996
3.327LysLeu: 3.327 ± 1.372
1.996LysMet: 1.996 ± 1.55
2.661LysAsn: 2.661 ± 1.276
3.327LysPro: 3.327 ± 1.11
3.327LysGln: 3.327 ± 1.299
5.323LysArg: 5.323 ± 1.251
6.653LysSer: 6.653 ± 3.154
1.996LysThr: 1.996 ± 1.844
1.996LysVal: 1.996 ± 1.042
0.665LysTrp: 0.665 ± 0.54
0.665LysTyr: 0.665 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
9.315LeuAla: 9.315 ± 1.963
1.996LeuCys: 1.996 ± 0.748
6.653LeuAsp: 6.653 ± 2.121
2.661LeuGlu: 2.661 ± 0.678
1.996LeuPhe: 1.996 ± 0.748
7.319LeuGly: 7.319 ± 2.356
3.992LeuHis: 3.992 ± 1.169
4.657LeuIle: 4.657 ± 1.196
2.661LeuLys: 2.661 ± 1.382
4.657LeuLeu: 4.657 ± 0.755
0.665LeuMet: 0.665 ± 0.54
4.657LeuAsn: 4.657 ± 1.818
6.653LeuPro: 6.653 ± 1.465
6.653LeuGln: 6.653 ± 2.521
5.323LeuArg: 5.323 ± 1.251
5.988LeuSer: 5.988 ± 2.745
4.657LeuThr: 4.657 ± 1.976
5.323LeuVal: 5.323 ± 1.29
0.0LeuTrp: 0.0 ± 0.0
1.331LeuTyr: 1.331 ± 0.888
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.665MetCys: 0.665 ± 0.54
1.331MetAsp: 1.331 ± 0.668
0.0MetGlu: 0.0 ± 0.0
1.996MetPhe: 1.996 ± 1.081
1.996MetGly: 1.996 ± 1.511
0.665MetHis: 0.665 ± 0.54
0.665MetIle: 0.665 ± 0.439
0.665MetLys: 0.665 ± 0.893
1.331MetLeu: 1.331 ± 1.088
1.331MetMet: 1.331 ± 0.89
1.996MetAsn: 1.996 ± 1.223
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.665MetArg: 0.665 ± 0.946
3.327MetSer: 3.327 ± 1.886
1.331MetThr: 1.331 ± 0.668
1.331MetVal: 1.331 ± 0.861
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
7.319AsnAla: 7.319 ± 1.629
0.0AsnCys: 0.0 ± 0.0
1.996AsnAsp: 1.996 ± 0.975
2.661AsnGlu: 2.661 ± 1.328
1.331AsnPhe: 1.331 ± 0.888
1.996AsnGly: 1.996 ± 0.939
1.331AsnHis: 1.331 ± 1.079
1.996AsnIle: 1.996 ± 0.527
3.327AsnLys: 3.327 ± 1.228
3.992AsnLeu: 3.992 ± 1.509
0.665AsnMet: 0.665 ± 0.839
5.323AsnAsn: 5.323 ± 1.079
4.657AsnPro: 4.657 ± 1.545
1.331AsnGln: 1.331 ± 1.261
1.331AsnArg: 1.331 ± 1.261
6.653AsnSer: 6.653 ± 1.906
3.992AsnThr: 3.992 ± 2.052
1.996AsnVal: 1.996 ± 0.924
0.0AsnTrp: 0.0 ± 0.0
1.331AsnTyr: 1.331 ± 0.879
0.0AsnXaa: 0.0 ± 0.0
Pro
1.331ProAla: 1.331 ± 0.879
1.331ProCys: 1.331 ± 1.079
5.323ProAsp: 5.323 ± 2.312
3.327ProGlu: 3.327 ± 1.11
4.657ProPhe: 4.657 ± 1.392
1.331ProGly: 1.331 ± 0.879
0.665ProHis: 0.665 ± 0.54
3.327ProIle: 3.327 ± 1.014
1.996ProLys: 1.996 ± 0.939
5.988ProLeu: 5.988 ± 2.051
1.331ProMet: 1.331 ± 1.405
2.661ProAsn: 2.661 ± 1.722
4.657ProPro: 4.657 ± 2.014
2.661ProGln: 2.661 ± 1.39
2.661ProArg: 2.661 ± 1.647
2.661ProSer: 2.661 ± 0.831
3.327ProThr: 3.327 ± 1.412
1.996ProVal: 1.996 ± 0.527
0.0ProTrp: 0.0 ± 0.0
1.331ProTyr: 1.331 ± 0.668
0.0ProXaa: 0.0 ± 0.0
Gln
1.996GlnAla: 1.996 ± 1.223
1.331GlnCys: 1.331 ± 0.888
1.996GlnAsp: 1.996 ± 0.527
1.331GlnGlu: 1.331 ± 0.668
0.665GlnPhe: 0.665 ± 0.439
1.996GlnGly: 1.996 ± 0.924
1.331GlnHis: 1.331 ± 0.482
1.331GlnIle: 1.331 ± 0.694
3.327GlnLys: 3.327 ± 1.299
1.996GlnLeu: 1.996 ± 0.527
0.665GlnMet: 0.665 ± 0.439
0.0GlnAsn: 0.0 ± 0.0
1.331GlnPro: 1.331 ± 1.372
1.331GlnGln: 1.331 ± 0.668
4.657GlnArg: 4.657 ± 1.164
4.657GlnSer: 4.657 ± 3.168
0.665GlnThr: 0.665 ± 0.439
1.331GlnVal: 1.331 ± 0.482
0.665GlnTrp: 0.665 ± 0.54
1.996GlnTyr: 1.996 ± 0.77
0.0GlnXaa: 0.0 ± 0.0
Arg
4.657ArgAla: 4.657 ± 1.306
1.331ArgCys: 1.331 ± 0.879
4.657ArgAsp: 4.657 ± 2.305
2.661ArgGlu: 2.661 ± 0.971
1.331ArgPhe: 1.331 ± 0.694
2.661ArgGly: 2.661 ± 0.871
1.331ArgHis: 1.331 ± 0.944
2.661ArgIle: 2.661 ± 2.159
3.327ArgLys: 3.327 ± 1.222
7.984ArgLeu: 7.984 ± 1.558
1.331ArgMet: 1.331 ± 0.641
1.996ArgAsn: 1.996 ± 1.071
1.996ArgPro: 1.996 ± 0.748
1.996ArgGln: 1.996 ± 1.344
1.996ArgArg: 1.996 ± 0.782
3.327ArgSer: 3.327 ± 1.54
1.331ArgThr: 1.331 ± 1.05
3.327ArgVal: 3.327 ± 1.66
0.0ArgTrp: 0.0 ± 0.0
4.657ArgTyr: 4.657 ± 1.059
0.0ArgXaa: 0.0 ± 0.0
Ser
10.645SerAla: 10.645 ± 2.805
0.0SerCys: 0.0 ± 0.0
3.992SerAsp: 3.992 ± 1.44
2.661SerGlu: 2.661 ± 1.393
2.661SerPhe: 2.661 ± 1.262
8.649SerGly: 8.649 ± 2.847
2.661SerHis: 2.661 ± 1.262
3.327SerIle: 3.327 ± 1.74
5.988SerLys: 5.988 ± 1.506
5.988SerLeu: 5.988 ± 1.648
0.665SerMet: 0.665 ± 0.906
2.661SerAsn: 2.661 ± 1.126
6.653SerPro: 6.653 ± 1.848
3.992SerGln: 3.992 ± 0.823
5.323SerArg: 5.323 ± 1.986
3.992SerSer: 3.992 ± 1.851
5.988SerThr: 5.988 ± 2.05
1.996SerVal: 1.996 ± 0.987
2.661SerTrp: 2.661 ± 0.847
0.665SerTyr: 0.665 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
3.327ThrAla: 3.327 ± 1.475
0.0ThrCys: 0.0 ± 0.0
2.661ThrAsp: 2.661 ± 0.848
1.331ThrGlu: 1.331 ± 0.668
2.661ThrPhe: 2.661 ± 0.826
4.657ThrGly: 4.657 ± 2.109
3.327ThrHis: 3.327 ± 1.111
2.661ThrIle: 2.661 ± 0.871
0.0ThrLys: 0.0 ± 0.0
5.323ThrLeu: 5.323 ± 1.807
1.996ThrMet: 1.996 ± 1.222
1.331ThrAsn: 1.331 ± 0.668
3.327ThrPro: 3.327 ± 0.924
0.0ThrGln: 0.0 ± 0.0
2.661ThrArg: 2.661 ± 0.871
6.653ThrSer: 6.653 ± 1.471
3.327ThrThr: 3.327 ± 1.179
2.661ThrVal: 2.661 ± 1.057
0.0ThrTrp: 0.0 ± 0.0
3.327ThrTyr: 3.327 ± 0.727
0.0ThrXaa: 0.0 ± 0.0
Val
5.323ValAla: 5.323 ± 2.133
1.331ValCys: 1.331 ± 1.785
3.992ValAsp: 3.992 ± 2.321
2.661ValGlu: 2.661 ± 0.871
2.661ValPhe: 2.661 ± 0.831
4.657ValGly: 4.657 ± 1.042
1.996ValHis: 1.996 ± 1.892
3.327ValIle: 3.327 ± 1.706
3.327ValLys: 3.327 ± 1.875
3.992ValLeu: 3.992 ± 1.108
0.0ValMet: 0.0 ± 0.0
4.657ValAsn: 4.657 ± 1.132
1.996ValPro: 1.996 ± 0.748
1.996ValGln: 1.996 ± 1.318
3.327ValArg: 3.327 ± 1.913
1.331ValSer: 1.331 ± 0.861
1.996ValThr: 1.996 ± 1.223
0.0ValVal: 0.0 ± 0.0
0.665ValTrp: 0.665 ± 0.439
0.665ValTyr: 0.665 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
0.665TrpAla: 0.665 ± 0.54
1.331TrpCys: 1.331 ± 0.482
0.0TrpAsp: 0.0 ± 0.0
0.665TrpGlu: 0.665 ± 0.439
2.661TrpPhe: 2.661 ± 1.128
0.665TrpGly: 0.665 ± 0.54
0.665TrpHis: 0.665 ± 0.439
1.331TrpIle: 1.331 ± 1.079
0.665TrpLys: 0.665 ± 0.439
1.331TrpLeu: 1.331 ± 0.891
0.0TrpMet: 0.0 ± 0.0
0.665TrpAsn: 0.665 ± 0.439
0.0TrpPro: 0.0 ± 0.0
0.665TrpGln: 0.665 ± 0.631
0.665TrpArg: 0.665 ± 0.946
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.992TyrAla: 3.992 ± 1.813
0.665TyrCys: 0.665 ± 0.54
2.661TyrAsp: 2.661 ± 0.871
0.0TyrGlu: 0.0 ± 0.0
1.996TyrPhe: 1.996 ± 0.748
2.661TyrGly: 2.661 ± 1.15
1.331TyrHis: 1.331 ± 1.079
0.665TyrIle: 0.665 ± 0.54
1.331TyrLys: 1.331 ± 0.888
4.657TyrLeu: 4.657 ± 1.591
0.0TyrMet: 0.0 ± 0.0
1.996TyrAsn: 1.996 ± 1.318
1.996TyrPro: 1.996 ± 0.975
1.996TyrGln: 1.996 ± 0.939
0.665TyrArg: 0.665 ± 0.439
4.657TyrSer: 4.657 ± 1.836
1.996TyrThr: 1.996 ± 0.748
2.661TyrVal: 2.661 ± 0.678
0.665TyrTrp: 0.665 ± 0.439
1.331TyrTyr: 1.331 ± 0.482
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1504 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski