Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.673AlaAla: 4.673 ± 1.773
0.0AlaCys: 0.0 ± 0.0
3.505AlaAsp: 3.505 ± 3.003
2.336AlaGlu: 2.336 ± 0.549
1.168AlaPhe: 1.168 ± 0.889
2.921AlaGly: 2.921 ± 1.331
1.168AlaHis: 1.168 ± 0.756
2.336AlaIle: 2.336 ± 0.512
2.336AlaLys: 2.336 ± 1.809
6.425AlaLeu: 6.425 ± 1.775
0.584AlaMet: 0.584 ± 0.444
4.089AlaAsn: 4.089 ± 1.429
1.168AlaPro: 1.168 ± 0.455
3.505AlaGln: 3.505 ± 1.94
2.336AlaArg: 2.336 ± 0.512
6.425AlaSer: 6.425 ± 1.227
4.089AlaThr: 4.089 ± 1.999
2.336AlaVal: 2.336 ± 0.915
1.168AlaTrp: 1.168 ± 0.713
2.921AlaTyr: 2.921 ± 0.9
0.0AlaXaa: 0.0 ± 0.0
Cys
1.752CysAla: 1.752 ± 0.621
0.584CysCys: 0.584 ± 0.478
0.0CysAsp: 0.0 ± 0.0
1.168CysGlu: 1.168 ± 0.904
1.752CysPhe: 1.752 ± 0.842
0.584CysGly: 0.584 ± 0.69
0.584CysHis: 0.584 ± 0.478
1.168CysIle: 1.168 ± 0.957
1.168CysLys: 1.168 ± 0.713
2.921CysLeu: 2.921 ± 1.776
0.584CysMet: 0.584 ± 0.444
2.336CysAsn: 2.336 ± 1.231
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.584CysArg: 0.584 ± 0.444
0.584CysSer: 0.584 ± 0.444
0.584CysThr: 0.584 ± 0.478
0.0CysVal: 0.0 ± 0.0
0.584CysTrp: 0.584 ± 0.444
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.921AspAla: 2.921 ± 1.625
2.336AspCys: 2.336 ± 0.849
5.841AspAsp: 5.841 ± 2.493
4.089AspGlu: 4.089 ± 1.913
6.425AspPhe: 6.425 ± 1.846
3.505AspGly: 3.505 ± 1.545
1.752AspHis: 1.752 ± 0.762
2.921AspIle: 2.921 ± 1.043
5.257AspLys: 5.257 ± 0.659
7.009AspLeu: 7.009 ± 1.455
1.168AspMet: 1.168 ± 0.553
5.257AspAsn: 5.257 ± 2.306
1.752AspPro: 1.752 ± 1.435
0.0AspGln: 0.0 ± 0.0
0.584AspArg: 0.584 ± 0.829
2.921AspSer: 2.921 ± 0.773
4.673AspThr: 4.673 ± 1.168
4.089AspVal: 4.089 ± 1.503
1.168AspTrp: 1.168 ± 0.553
0.584AspTyr: 0.584 ± 0.574
0.0AspXaa: 0.0 ± 0.0
Glu
1.168GluAla: 1.168 ± 0.889
1.168GluCys: 1.168 ± 1.379
2.921GluAsp: 2.921 ± 0.936
1.752GluGlu: 1.752 ± 0.912
5.257GluPhe: 5.257 ± 1.404
1.752GluGly: 1.752 ± 0.821
1.752GluHis: 1.752 ± 1.064
1.752GluIle: 1.752 ± 0.794
1.168GluLys: 1.168 ± 0.593
5.257GluLeu: 5.257 ± 1.617
0.584GluMet: 0.584 ± 0.574
1.752GluAsn: 1.752 ± 1.721
0.584GluPro: 0.584 ± 0.829
2.336GluGln: 2.336 ± 1.107
2.921GluArg: 2.921 ± 1.298
3.505GluSer: 3.505 ± 0.446
2.921GluThr: 2.921 ± 1.043
2.336GluVal: 2.336 ± 0.819
0.0GluTrp: 0.0 ± 0.0
2.336GluTyr: 2.336 ± 0.549
0.0GluXaa: 0.0 ± 0.0
Phe
4.089PheAla: 4.089 ± 0.898
1.168PheCys: 1.168 ± 0.889
5.257PheAsp: 5.257 ± 1.369
0.584PheGlu: 0.584 ± 0.478
2.921PhePhe: 2.921 ± 1.094
5.257PheGly: 5.257 ± 1.521
1.168PheHis: 1.168 ± 0.756
2.336PheIle: 2.336 ± 1.161
5.257PheLys: 5.257 ± 1.395
4.089PheLeu: 4.089 ± 1.435
1.168PheMet: 1.168 ± 0.455
7.009PheAsn: 7.009 ± 1.407
1.168PhePro: 1.168 ± 0.455
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
5.257PheSer: 5.257 ± 1.658
2.921PheThr: 2.921 ± 1.585
3.505PheVal: 3.505 ± 1.612
0.0PheTrp: 0.0 ± 0.0
5.257PheTyr: 5.257 ± 1.369
0.0PheXaa: 0.0 ± 0.0
Gly
3.505GlyAla: 3.505 ± 2.224
0.584GlyCys: 0.584 ± 0.69
2.336GlyAsp: 2.336 ± 1.778
2.921GlyGlu: 2.921 ± 0.551
4.673GlyPhe: 4.673 ± 1.664
2.921GlyGly: 2.921 ± 2.223
0.0GlyHis: 0.0 ± 0.0
4.089GlyIle: 4.089 ± 0.739
5.841GlyLys: 5.841 ± 1.483
4.089GlyLeu: 4.089 ± 1.109
0.584GlyMet: 0.584 ± 0.574
3.505GlyAsn: 3.505 ± 1.112
0.584GlyPro: 0.584 ± 0.829
1.752GlyGln: 1.752 ± 0.824
0.0GlyArg: 0.0 ± 0.0
4.673GlySer: 4.673 ± 2.196
4.089GlyThr: 4.089 ± 0.855
2.336GlyVal: 2.336 ± 2.444
1.168GlyTrp: 1.168 ± 0.904
1.752GlyTyr: 1.752 ± 0.621
0.0GlyXaa: 0.0 ± 0.0
His
2.921HisAla: 2.921 ± 1.238
0.584HisCys: 0.584 ± 0.69
0.584HisAsp: 0.584 ± 0.444
1.168HisGlu: 1.168 ± 0.904
2.336HisPhe: 2.336 ± 1.231
0.584HisGly: 0.584 ± 0.574
0.584HisHis: 0.584 ± 0.478
1.168HisIle: 1.168 ± 0.455
1.752HisLys: 1.752 ± 1.435
2.336HisLeu: 2.336 ± 0.819
0.584HisMet: 0.584 ± 0.69
0.584HisAsn: 0.584 ± 0.478
0.584HisPro: 0.584 ± 0.478
0.584HisGln: 0.584 ± 0.444
0.584HisArg: 0.584 ± 0.69
1.752HisSer: 1.752 ± 0.821
1.168HisThr: 1.168 ± 0.455
1.168HisVal: 1.168 ± 0.756
0.0HisTrp: 0.0 ± 0.0
2.921HisTyr: 2.921 ± 1.356
0.0HisXaa: 0.0 ± 0.0
Ile
2.336IleAla: 2.336 ± 1.38
0.0IleCys: 0.0 ± 0.0
2.921IleAsp: 2.921 ± 1.298
0.584IleGlu: 0.584 ± 0.574
1.752IlePhe: 1.752 ± 0.331
3.505IleGly: 3.505 ± 1.562
1.752IleHis: 1.752 ± 0.621
1.752IleIle: 1.752 ± 0.842
3.505IleLys: 3.505 ± 1.523
6.425IleLeu: 6.425 ± 1.835
0.584IleMet: 0.584 ± 0.478
2.921IleAsn: 2.921 ± 0.776
2.921IlePro: 2.921 ± 1.238
2.336IleGln: 2.336 ± 1.42
6.425IleArg: 6.425 ± 1.219
4.089IleSer: 4.089 ± 0.897
4.673IleThr: 4.673 ± 1.264
1.168IleVal: 1.168 ± 0.957
1.168IleTrp: 1.168 ± 0.593
6.425IleTyr: 6.425 ± 2.083
0.0IleXaa: 0.0 ± 0.0
Lys
3.505LysAla: 3.505 ± 1.324
2.921LysCys: 2.921 ± 0.999
3.505LysAsp: 3.505 ± 1.132
5.257LysGlu: 5.257 ± 2.135
2.336LysPhe: 2.336 ± 1.009
4.089LysGly: 4.089 ± 2.077
0.0LysHis: 0.0 ± 0.0
4.089LysIle: 4.089 ± 1.376
2.336LysLys: 2.336 ± 1.33
2.921LysLeu: 2.921 ± 0.773
3.505LysMet: 3.505 ± 1.773
3.505LysAsn: 3.505 ± 1.642
0.584LysPro: 0.584 ± 0.444
4.089LysGln: 4.089 ± 1.406
3.505LysArg: 3.505 ± 1.008
4.673LysSer: 4.673 ± 1.098
5.841LysThr: 5.841 ± 1.9
5.257LysVal: 5.257 ± 2.207
1.168LysTrp: 1.168 ± 0.593
5.257LysTyr: 5.257 ± 1.878
0.0LysXaa: 0.0 ± 0.0
Leu
3.505LeuAla: 3.505 ± 2.037
1.168LeuCys: 1.168 ± 0.455
8.762LeuAsp: 8.762 ± 2.417
2.921LeuGlu: 2.921 ± 1.494
5.257LeuPhe: 5.257 ± 0.95
7.009LeuGly: 7.009 ± 3.044
3.505LeuHis: 3.505 ± 1.542
2.921LeuIle: 2.921 ± 1.018
4.673LeuLys: 4.673 ± 1.068
5.841LeuLeu: 5.841 ± 1.278
0.584LeuMet: 0.584 ± 0.574
8.762LeuAsn: 8.762 ± 2.67
4.089LeuPro: 4.089 ± 1.435
2.921LeuGln: 2.921 ± 0.839
3.505LeuArg: 3.505 ± 0.893
11.098LeuSer: 11.098 ± 1.757
6.425LeuThr: 6.425 ± 1.029
4.673LeuVal: 4.673 ± 0.714
0.0LeuTrp: 0.0 ± 0.0
2.921LeuTyr: 2.921 ± 0.858
0.0LeuXaa: 0.0 ± 0.0
Met
0.584MetAla: 0.584 ± 0.574
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.584MetGlu: 0.584 ± 0.574
0.584MetPhe: 0.584 ± 0.574
1.168MetGly: 1.168 ± 1.147
1.752MetHis: 1.752 ± 0.978
0.584MetIle: 0.584 ± 0.69
0.584MetLys: 0.584 ± 0.444
2.921MetLeu: 2.921 ± 1.356
0.0MetMet: 0.0 ± 0.0
1.168MetAsn: 1.168 ± 0.889
2.336MetPro: 2.336 ± 0.91
1.168MetGln: 1.168 ± 1.147
0.584MetArg: 0.584 ± 0.444
2.336MetSer: 2.336 ± 0.549
0.584MetThr: 0.584 ± 0.829
1.168MetVal: 1.168 ± 0.593
0.584MetTrp: 0.584 ± 0.574
0.584MetTyr: 0.584 ± 0.444
0.0MetXaa: 0.0 ± 0.0
Asn
5.257AsnAla: 5.257 ± 1.6
0.584AsnCys: 0.584 ± 0.574
7.593AsnAsp: 7.593 ± 3.397
2.336AsnGlu: 2.336 ± 1.58
1.752AsnPhe: 1.752 ± 0.762
3.505AsnGly: 3.505 ± 1.573
0.584AsnHis: 0.584 ± 0.478
5.257AsnIle: 5.257 ± 2.266
4.673AsnLys: 4.673 ± 1.931
9.93AsnLeu: 9.93 ± 4.83
0.584AsnMet: 0.584 ± 0.679
7.009AsnAsn: 7.009 ± 0.907
4.089AsnPro: 4.089 ± 1.728
2.336AsnGln: 2.336 ± 1.58
2.336AsnArg: 2.336 ± 0.91
7.009AsnSer: 7.009 ± 0.975
2.336AsnThr: 2.336 ± 1.057
4.673AsnVal: 4.673 ± 1.179
1.168AsnTrp: 1.168 ± 0.593
5.257AsnTyr: 5.257 ± 0.659
0.0AsnXaa: 0.0 ± 0.0
Pro
1.752ProAla: 1.752 ± 1.099
0.584ProCys: 0.584 ± 0.478
0.0ProAsp: 0.0 ± 0.0
1.168ProGlu: 1.168 ± 0.553
4.673ProPhe: 4.673 ± 1.398
2.921ProGly: 2.921 ± 0.851
1.168ProHis: 1.168 ± 0.756
2.336ProIle: 2.336 ± 0.91
2.336ProLys: 2.336 ± 0.771
6.425ProLeu: 6.425 ± 1.92
1.168ProMet: 1.168 ± 0.455
2.336ProAsn: 2.336 ± 1.574
1.168ProPro: 1.168 ± 1.658
0.584ProGln: 0.584 ± 0.574
1.168ProArg: 1.168 ± 0.593
0.584ProSer: 0.584 ± 0.478
4.089ProThr: 4.089 ± 0.853
1.168ProVal: 1.168 ± 0.901
0.584ProTrp: 0.584 ± 0.478
2.336ProTyr: 2.336 ± 0.91
0.0ProXaa: 0.0 ± 0.0
Gln
1.752GlnAla: 1.752 ± 1.036
0.584GlnCys: 0.584 ± 0.574
0.584GlnAsp: 0.584 ± 0.478
0.584GlnGlu: 0.584 ± 0.444
1.168GlnPhe: 1.168 ± 0.889
2.336GlnGly: 2.336 ± 1.604
0.584GlnHis: 0.584 ± 0.478
2.921GlnIle: 2.921 ± 0.776
0.584GlnLys: 0.584 ± 0.829
4.089GlnLeu: 4.089 ± 0.898
0.584GlnMet: 0.584 ± 0.574
2.921GlnAsn: 2.921 ± 2.407
2.921GlnPro: 2.921 ± 0.9
3.505GlnGln: 3.505 ± 2.072
1.168GlnArg: 1.168 ± 0.593
5.257GlnSer: 5.257 ± 2.408
1.752GlnThr: 1.752 ± 0.938
1.168GlnVal: 1.168 ± 0.455
1.168GlnTrp: 1.168 ± 0.904
1.752GlnTyr: 1.752 ± 0.621
0.0GlnXaa: 0.0 ± 0.0
Arg
1.752ArgAla: 1.752 ± 1.036
0.584ArgCys: 0.584 ± 0.478
3.505ArgAsp: 3.505 ± 0.802
1.168ArgGlu: 1.168 ± 0.904
1.752ArgPhe: 1.752 ± 1.333
1.168ArgGly: 1.168 ± 0.455
1.168ArgHis: 1.168 ± 0.756
2.921ArgIle: 2.921 ± 1.559
5.841ArgLys: 5.841 ± 2.804
4.673ArgLeu: 4.673 ± 2.67
0.584ArgMet: 0.584 ± 0.574
1.752ArgAsn: 1.752 ± 0.821
1.168ArgPro: 1.168 ± 0.889
1.752ArgGln: 1.752 ± 0.762
0.584ArgArg: 0.584 ± 0.444
2.921ArgSer: 2.921 ± 1.78
0.584ArgThr: 0.584 ± 0.444
0.584ArgVal: 0.584 ± 0.69
0.0ArgTrp: 0.0 ± 0.0
2.336ArgTyr: 2.336 ± 0.849
0.0ArgXaa: 0.0 ± 0.0
Ser
5.257SerAla: 5.257 ± 1.183
1.168SerCys: 1.168 ± 0.713
4.089SerAsp: 4.089 ± 1.348
3.505SerGlu: 3.505 ± 1.112
5.841SerPhe: 5.841 ± 1.611
1.168SerGly: 1.168 ± 0.713
1.752SerHis: 1.752 ± 0.762
5.257SerIle: 5.257 ± 0.48
6.425SerLys: 6.425 ± 2.885
5.257SerLeu: 5.257 ± 0.992
1.752SerMet: 1.752 ± 1.042
7.593SerAsn: 7.593 ± 1.408
4.673SerPro: 4.673 ± 1.203
3.505SerGln: 3.505 ± 0.964
3.505SerArg: 3.505 ± 1.545
8.178SerSer: 8.178 ± 1.613
2.921SerThr: 2.921 ± 1.2
2.921SerVal: 2.921 ± 1.331
0.0SerTrp: 0.0 ± 0.0
7.009SerTyr: 7.009 ± 1.54
0.0SerXaa: 0.0 ± 0.0
Thr
5.257ThrAla: 5.257 ± 2.642
0.584ThrCys: 0.584 ± 0.574
2.921ThrAsp: 2.921 ± 1.585
5.841ThrGlu: 5.841 ± 1.552
4.673ThrPhe: 4.673 ± 1.635
1.168ThrGly: 1.168 ± 0.553
1.168ThrHis: 1.168 ± 0.593
4.089ThrIle: 4.089 ± 1.203
5.257ThrLys: 5.257 ± 1.061
4.089ThrLeu: 4.089 ± 1.149
1.168ThrMet: 1.168 ± 0.866
4.673ThrAsn: 4.673 ± 1.362
4.673ThrPro: 4.673 ± 0.538
3.505ThrGln: 3.505 ± 1.647
1.752ThrArg: 1.752 ± 0.331
2.921ThrSer: 2.921 ± 1.725
6.425ThrThr: 6.425 ± 2.405
0.584ThrVal: 0.584 ± 0.444
0.0ThrTrp: 0.0 ± 0.0
2.336ThrTyr: 2.336 ± 0.91
0.0ThrXaa: 0.0 ± 0.0
Val
0.584ValAla: 0.584 ± 0.444
1.752ValCys: 1.752 ± 0.621
5.257ValAsp: 5.257 ± 2.532
1.752ValGlu: 1.752 ± 1.181
1.168ValPhe: 1.168 ± 0.455
2.921ValGly: 2.921 ± 0.551
0.0ValHis: 0.0 ± 0.0
3.505ValIle: 3.505 ± 0.82
4.089ValLys: 4.089 ± 0.602
3.505ValLeu: 3.505 ± 0.896
1.752ValMet: 1.752 ± 0.912
4.673ValAsn: 4.673 ± 1.805
2.921ValPro: 2.921 ± 0.858
0.584ValGln: 0.584 ± 0.69
2.336ValArg: 2.336 ± 1.24
4.673ValSer: 4.673 ± 1.398
4.089ValThr: 4.089 ± 0.955
1.752ValVal: 1.752 ± 1.636
0.0ValTrp: 0.0 ± 0.0
1.168ValTyr: 1.168 ± 0.889
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.584TrpGlu: 0.584 ± 0.574
1.752TrpPhe: 1.752 ± 1.036
0.0TrpGly: 0.0 ± 0.0
1.168TrpHis: 1.168 ± 0.455
0.584TrpIle: 0.584 ± 0.69
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.584TrpMet: 0.584 ± 0.777
0.584TrpAsn: 0.584 ± 0.574
0.0TrpPro: 0.0 ± 0.0
1.752TrpGln: 1.752 ± 0.821
0.584TrpArg: 0.584 ± 0.69
0.584TrpSer: 0.584 ± 0.574
0.584TrpThr: 0.584 ± 0.574
0.584TrpVal: 0.584 ± 0.478
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.921TyrAla: 2.921 ± 1.238
0.584TyrCys: 0.584 ± 0.829
4.673TyrAsp: 4.673 ± 1.321
2.921TyrGlu: 2.921 ± 0.839
1.168TyrPhe: 1.168 ± 0.455
2.921TyrGly: 2.921 ± 0.551
2.336TyrHis: 2.336 ± 1.161
5.257TyrIle: 5.257 ± 2.397
5.257TyrLys: 5.257 ± 1.686
2.336TyrLeu: 2.336 ± 1.913
0.584TyrMet: 0.584 ± 0.444
5.841TyrAsn: 5.841 ± 0.996
1.168TyrPro: 1.168 ± 0.713
1.168TyrGln: 1.168 ± 0.593
2.336TyrArg: 2.336 ± 1.33
2.336TyrSer: 2.336 ± 1.161
2.336TyrThr: 2.336 ± 1.326
7.009TyrVal: 7.009 ± 2.091
0.0TyrTrp: 0.0 ± 0.0
4.089TyrTyr: 4.089 ± 2.077
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1713 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski