Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_228

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.074AlaAla: 12.074 ± 3.812
1.42AlaCys: 1.42 ± 1.1
8.523AlaAsp: 8.523 ± 2.608
7.102AlaGlu: 7.102 ± 2.776
5.682AlaPhe: 5.682 ± 1.771
4.261AlaGly: 4.261 ± 1.713
0.0AlaHis: 0.0 ± 0.0
2.131AlaIle: 2.131 ± 1.185
2.131AlaLys: 2.131 ± 0.628
9.233AlaLeu: 9.233 ± 3.401
2.131AlaMet: 2.131 ± 1.011
5.682AlaAsn: 5.682 ± 2.41
2.841AlaPro: 2.841 ± 2.342
3.551AlaGln: 3.551 ± 2.351
5.682AlaArg: 5.682 ± 1.405
2.841AlaSer: 2.841 ± 1.292
7.102AlaThr: 7.102 ± 2.796
6.392AlaVal: 6.392 ± 1.753
0.0AlaTrp: 0.0 ± 0.0
6.392AlaTyr: 6.392 ± 1.789
0.0AlaXaa: 0.0 ± 0.0
Cys
2.841CysAla: 2.841 ± 1.92
0.0CysCys: 0.0 ± 0.0
1.42CysAsp: 1.42 ± 1.981
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.42CysGly: 1.42 ± 1.327
0.0CysHis: 0.0 ± 0.0
0.71CysIle: 0.71 ± 0.664
0.0CysLys: 0.0 ± 0.0
0.71CysLeu: 0.71 ± 0.47
0.71CysMet: 0.71 ± 0.664
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.42CysGln: 1.42 ± 1.367
0.71CysArg: 0.71 ± 0.664
0.71CysSer: 0.71 ± 0.991
2.131CysThr: 2.131 ± 2.017
0.71CysVal: 0.71 ± 0.664
0.0CysTrp: 0.0 ± 0.0
0.71CysTyr: 0.71 ± 0.664
0.0CysXaa: 0.0 ± 0.0
Asp
4.972AspAla: 4.972 ± 1.9
0.71AspCys: 0.71 ± 0.991
1.42AspAsp: 1.42 ± 0.905
5.682AspGlu: 5.682 ± 1.766
2.841AspPhe: 2.841 ± 0.956
2.841AspGly: 2.841 ± 1.239
0.71AspHis: 0.71 ± 0.47
2.131AspIle: 2.131 ± 1.917
1.42AspLys: 1.42 ± 1.037
4.972AspLeu: 4.972 ± 1.417
1.42AspMet: 1.42 ± 1.45
2.131AspAsn: 2.131 ± 0.867
2.131AspPro: 2.131 ± 1.917
3.551AspGln: 3.551 ± 2.108
1.42AspArg: 1.42 ± 0.646
4.261AspSer: 4.261 ± 1.223
4.972AspThr: 4.972 ± 1.346
2.131AspVal: 2.131 ± 0.867
0.0AspTrp: 0.0 ± 0.0
4.261AspTyr: 4.261 ± 1.437
0.0AspXaa: 0.0 ± 0.0
Glu
4.261GluAla: 4.261 ± 2.188
1.42GluCys: 1.42 ± 1.037
0.0GluAsp: 0.0 ± 0.0
0.0GluGlu: 0.0 ± 0.0
3.551GluPhe: 3.551 ± 2.014
0.71GluGly: 0.71 ± 0.664
0.71GluHis: 0.71 ± 0.47
3.551GluIle: 3.551 ± 1.578
4.972GluLys: 4.972 ± 2.591
2.131GluLeu: 2.131 ± 1.576
0.71GluMet: 0.71 ± 0.991
2.131GluAsn: 2.131 ± 1.078
1.42GluPro: 1.42 ± 1.037
2.841GluGln: 2.841 ± 1.292
4.972GluArg: 4.972 ± 1.721
3.551GluSer: 3.551 ± 1.036
2.841GluThr: 2.841 ± 2.07
0.71GluVal: 0.71 ± 0.725
2.131GluTrp: 2.131 ± 1.689
2.131GluTyr: 2.131 ± 0.969
0.0GluXaa: 0.0 ± 0.0
Phe
7.102PheAla: 7.102 ± 3.092
0.71PheCys: 0.71 ± 1.027
4.972PheAsp: 4.972 ± 1.624
3.551PheGlu: 3.551 ± 2.411
4.261PhePhe: 4.261 ± 1.632
4.261PheGly: 4.261 ± 1.733
1.42PheHis: 1.42 ± 0.834
1.42PheIle: 1.42 ± 1.036
1.42PheLys: 1.42 ± 1.1
2.131PheLeu: 2.131 ± 1.491
2.131PheMet: 2.131 ± 1.23
2.841PheAsn: 2.841 ± 0.749
1.42PhePro: 1.42 ± 0.684
2.131PheGln: 2.131 ± 1.291
3.551PheArg: 3.551 ± 1.937
4.972PheSer: 4.972 ± 2.634
1.42PheThr: 1.42 ± 0.94
3.551PheVal: 3.551 ± 1.611
0.71PheTrp: 0.71 ± 0.47
2.131PheTyr: 2.131 ± 1.264
0.0PheXaa: 0.0 ± 0.0
Gly
6.392GlyAla: 6.392 ± 2.399
1.42GlyCys: 1.42 ± 1.095
3.551GlyAsp: 3.551 ± 1.29
4.261GlyGlu: 4.261 ± 1.632
2.131GlyPhe: 2.131 ± 1.354
6.392GlyGly: 6.392 ± 3.178
0.0GlyHis: 0.0 ± 0.0
4.972GlyIle: 4.972 ± 1.565
2.131GlyLys: 2.131 ± 0.628
4.261GlyLeu: 4.261 ± 2.236
0.71GlyMet: 0.71 ± 0.991
4.972GlyAsn: 4.972 ± 1.28
5.682GlyPro: 5.682 ± 1.09
2.131GlyGln: 2.131 ± 1.322
3.551GlyArg: 3.551 ± 1.753
7.812GlySer: 7.812 ± 2.341
4.972GlyThr: 4.972 ± 1.46
8.523GlyVal: 8.523 ± 1.116
0.0GlyTrp: 0.0 ± 0.0
3.551GlyTyr: 3.551 ± 1.79
0.0GlyXaa: 0.0 ± 0.0
His
0.71HisAla: 0.71 ± 0.664
0.0HisCys: 0.0 ± 0.0
2.131HisAsp: 2.131 ± 1.01
0.71HisGlu: 0.71 ± 0.664
2.131HisPhe: 2.131 ± 1.41
1.42HisGly: 1.42 ± 0.94
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.71HisLys: 0.71 ± 0.664
0.71HisLeu: 0.71 ± 0.47
0.0HisMet: 0.0 ± 0.0
0.71HisAsn: 0.71 ± 0.725
1.42HisPro: 1.42 ± 0.905
1.42HisGln: 1.42 ± 0.646
0.71HisArg: 0.71 ± 0.47
1.42HisSer: 1.42 ± 0.684
0.0HisThr: 0.0 ± 0.0
0.71HisVal: 0.71 ± 0.664
0.0HisTrp: 0.0 ± 0.0
1.42HisTyr: 1.42 ± 1.037
0.0HisXaa: 0.0 ± 0.0
Ile
4.261IleAla: 4.261 ± 2.208
0.0IleCys: 0.0 ± 0.0
2.131IleAsp: 2.131 ± 1.291
1.42IleGlu: 1.42 ± 0.834
1.42IlePhe: 1.42 ± 1.472
2.841IleGly: 2.841 ± 1.668
0.0IleHis: 0.0 ± 0.0
0.71IleIle: 0.71 ± 0.47
1.42IleLys: 1.42 ± 0.646
1.42IleLeu: 1.42 ± 1.036
0.0IleMet: 0.0 ± 0.0
2.841IleAsn: 2.841 ± 1.436
4.972IlePro: 4.972 ± 2.274
2.841IleGln: 2.841 ± 0.731
1.42IleArg: 1.42 ± 1.327
2.841IleSer: 2.841 ± 3.018
2.131IleThr: 2.131 ± 0.969
2.841IleVal: 2.841 ± 2.51
1.42IleTrp: 1.42 ± 0.684
0.71IleTyr: 0.71 ± 0.47
0.0IleXaa: 0.0 ± 0.0
Lys
3.551LysAla: 3.551 ± 1.193
0.0LysCys: 0.0 ± 0.0
1.42LysAsp: 1.42 ± 0.834
1.42LysGlu: 1.42 ± 0.646
1.42LysPhe: 1.42 ± 1.327
3.551LysGly: 3.551 ± 0.914
0.0LysHis: 0.0 ± 0.0
2.131LysIle: 2.131 ± 1.185
4.261LysLys: 4.261 ± 2.643
3.551LysLeu: 3.551 ± 1.788
2.841LysMet: 2.841 ± 1.634
2.131LysAsn: 2.131 ± 2.176
1.42LysPro: 1.42 ± 0.646
0.71LysGln: 0.71 ± 0.664
6.392LysArg: 6.392 ± 3.812
3.551LysSer: 3.551 ± 0.592
1.42LysThr: 1.42 ± 0.646
4.972LysVal: 4.972 ± 2.34
0.0LysTrp: 0.0 ± 0.0
0.71LysTyr: 0.71 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
5.682LeuAla: 5.682 ± 1.538
0.71LeuCys: 0.71 ± 1.027
2.841LeuAsp: 2.841 ± 0.933
0.71LeuGlu: 0.71 ± 0.664
2.131LeuPhe: 2.131 ± 1.264
4.972LeuGly: 4.972 ± 2.733
0.71LeuHis: 0.71 ± 0.47
2.131LeuIle: 2.131 ± 0.867
4.972LeuLys: 4.972 ± 2.403
3.551LeuLeu: 3.551 ± 1.316
3.551LeuMet: 3.551 ± 1.526
4.261LeuAsn: 4.261 ± 2.179
4.261LeuPro: 4.261 ± 1.755
4.261LeuGln: 4.261 ± 0.939
4.972LeuArg: 4.972 ± 1.556
7.812LeuSer: 7.812 ± 4.207
4.261LeuThr: 4.261 ± 2.053
7.102LeuVal: 7.102 ± 2.388
0.0LeuTrp: 0.0 ± 0.0
2.131LeuTyr: 2.131 ± 1.01
0.0LeuXaa: 0.0 ± 0.0
Met
2.131MetAla: 2.131 ± 0.994
0.0MetCys: 0.0 ± 0.0
1.42MetAsp: 1.42 ± 0.646
0.0MetGlu: 0.0 ± 0.0
1.42MetPhe: 1.42 ± 0.94
0.71MetGly: 0.71 ± 0.47
1.42MetHis: 1.42 ± 1.327
0.0MetIle: 0.0 ± 0.0
3.551MetLys: 3.551 ± 1.773
1.42MetLeu: 1.42 ± 1.45
0.0MetMet: 0.0 ± 0.0
1.42MetAsn: 1.42 ± 0.646
1.42MetPro: 1.42 ± 0.684
0.0MetGln: 0.0 ± 0.0
2.131MetArg: 2.131 ± 1.856
4.972MetSer: 4.972 ± 2.566
3.551MetThr: 3.551 ± 1.743
1.42MetVal: 1.42 ± 0.922
0.0MetTrp: 0.0 ± 0.0
1.42MetTyr: 1.42 ± 0.646
0.0MetXaa: 0.0 ± 0.0
Asn
2.131AsnAla: 2.131 ± 1.291
0.71AsnCys: 0.71 ± 0.664
1.42AsnAsp: 1.42 ± 1.981
0.71AsnGlu: 0.71 ± 0.725
2.131AsnPhe: 2.131 ± 0.867
3.551AsnGly: 3.551 ± 1.036
0.71AsnHis: 0.71 ± 0.991
4.261AsnIle: 4.261 ± 2.546
1.42AsnLys: 1.42 ± 0.684
5.682AsnLeu: 5.682 ± 2.471
2.131AsnMet: 2.131 ± 0.86
0.71AsnAsn: 0.71 ± 0.47
4.261AsnPro: 4.261 ± 1.437
2.131AsnGln: 2.131 ± 1.41
3.551AsnArg: 3.551 ± 1.908
4.261AsnSer: 4.261 ± 2.582
1.42AsnThr: 1.42 ± 1.037
2.841AsnVal: 2.841 ± 1.732
0.0AsnTrp: 0.0 ± 0.0
2.131AsnTyr: 2.131 ± 0.969
0.0AsnXaa: 0.0 ± 0.0
Pro
5.682ProAla: 5.682 ± 2.604
1.42ProCys: 1.42 ± 1.327
2.841ProAsp: 2.841 ± 0.933
2.841ProGlu: 2.841 ± 1.672
2.841ProPhe: 2.841 ± 2.175
6.392ProGly: 6.392 ± 0.999
0.71ProHis: 0.71 ± 0.664
2.131ProIle: 2.131 ± 0.628
2.841ProLys: 2.841 ± 1.236
2.841ProLeu: 2.841 ± 1.369
1.42ProMet: 1.42 ± 0.646
1.42ProAsn: 1.42 ± 0.684
1.42ProPro: 1.42 ± 0.922
1.42ProGln: 1.42 ± 0.94
0.71ProArg: 0.71 ± 0.664
1.42ProSer: 1.42 ± 0.684
5.682ProThr: 5.682 ± 2.401
5.682ProVal: 5.682 ± 2.14
0.71ProTrp: 0.71 ± 0.47
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.551GlnAla: 3.551 ± 1.088
1.42GlnCys: 1.42 ± 1.095
2.841GlnAsp: 2.841 ± 1.487
3.551GlnGlu: 3.551 ± 0.592
1.42GlnPhe: 1.42 ± 0.646
4.972GlnGly: 4.972 ± 1.589
0.0GlnHis: 0.0 ± 0.0
2.841GlnIle: 2.841 ± 1.171
1.42GlnLys: 1.42 ± 0.684
2.131GlnLeu: 2.131 ± 0.628
0.71GlnMet: 0.71 ± 0.725
5.682GlnAsn: 5.682 ± 1.654
0.71GlnPro: 0.71 ± 0.47
2.131GlnGln: 2.131 ± 1.291
3.551GlnArg: 3.551 ± 1.455
2.131GlnSer: 2.131 ± 0.867
2.841GlnThr: 2.841 ± 0.731
2.131GlnVal: 2.131 ± 0.867
1.42GlnTrp: 1.42 ± 0.684
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.551ArgAla: 3.551 ± 1.657
1.42ArgCys: 1.42 ± 0.684
3.551ArgAsp: 3.551 ± 1.088
4.972ArgGlu: 4.972 ± 2.361
6.392ArgPhe: 6.392 ± 2.563
2.841ArgGly: 2.841 ± 1.251
2.131ArgHis: 2.131 ± 1.01
1.42ArgIle: 1.42 ± 1.367
2.131ArgLys: 2.131 ± 1.991
7.102ArgLeu: 7.102 ± 1.195
1.42ArgMet: 1.42 ± 0.94
0.71ArgAsn: 0.71 ± 0.725
3.551ArgPro: 3.551 ± 2.549
4.261ArgGln: 4.261 ± 1.827
4.261ArgArg: 4.261 ± 2.396
6.392ArgSer: 6.392 ± 2.106
0.71ArgThr: 0.71 ± 0.903
0.71ArgVal: 0.71 ± 0.903
0.0ArgTrp: 0.0 ± 0.0
4.261ArgTyr: 4.261 ± 0.762
0.0ArgXaa: 0.0 ± 0.0
Ser
9.233SerAla: 9.233 ± 5.018
1.42SerCys: 1.42 ± 1.037
3.551SerAsp: 3.551 ± 1.108
1.42SerGlu: 1.42 ± 1.237
4.261SerPhe: 4.261 ± 3.088
9.943SerGly: 9.943 ± 3.871
3.551SerHis: 3.551 ± 1.057
3.551SerIle: 3.551 ± 2.909
0.71SerLys: 0.71 ± 0.664
6.392SerLeu: 6.392 ± 1.805
2.131SerMet: 2.131 ± 1.291
2.131SerAsn: 2.131 ± 1.264
4.261SerPro: 4.261 ± 2.236
3.551SerGln: 3.551 ± 1.455
4.261SerArg: 4.261 ± 1.813
9.943SerSer: 9.943 ± 2.864
5.682SerThr: 5.682 ± 1.882
4.972SerVal: 4.972 ± 2.027
0.71SerTrp: 0.71 ± 0.903
2.131SerTyr: 2.131 ± 1.078
0.0SerXaa: 0.0 ± 0.0
Thr
7.812ThrAla: 7.812 ± 1.559
0.0ThrCys: 0.0 ± 0.0
2.841ThrAsp: 2.841 ± 1.845
1.42ThrGlu: 1.42 ± 0.94
4.972ThrPhe: 4.972 ± 1.556
8.523ThrGly: 8.523 ± 2.072
0.71ThrHis: 0.71 ± 0.47
2.841ThrIle: 2.841 ± 0.731
4.972ThrLys: 4.972 ± 2.361
2.131ThrLeu: 2.131 ± 0.969
1.42ThrMet: 1.42 ± 0.94
2.131ThrAsn: 2.131 ± 0.867
1.42ThrPro: 1.42 ± 0.94
1.42ThrGln: 1.42 ± 0.94
4.972ThrArg: 4.972 ± 2.172
7.102ThrSer: 7.102 ± 2.489
2.841ThrThr: 2.841 ± 1.36
2.131ThrVal: 2.131 ± 0.969
0.71ThrTrp: 0.71 ± 0.47
1.42ThrTyr: 1.42 ± 1.327
0.0ThrXaa: 0.0 ± 0.0
Val
2.131ValAla: 2.131 ± 1.291
1.42ValCys: 1.42 ± 1.036
4.972ValAsp: 4.972 ± 1.669
2.841ValGlu: 2.841 ± 3.963
4.261ValPhe: 4.261 ± 2.02
4.972ValGly: 4.972 ± 3.364
1.42ValHis: 1.42 ± 0.905
0.71ValIle: 0.71 ± 0.47
2.841ValLys: 2.841 ± 1.555
7.102ValLeu: 7.102 ± 2.216
3.551ValMet: 3.551 ± 1.184
0.71ValAsn: 0.71 ± 0.991
5.682ValPro: 5.682 ± 3.058
2.841ValGln: 2.841 ± 1.555
3.551ValArg: 3.551 ± 2.632
4.261ValSer: 4.261 ± 1.096
4.261ValThr: 4.261 ± 2.236
2.131ValVal: 2.131 ± 1.264
0.71ValTrp: 0.71 ± 0.47
1.42ValTyr: 1.42 ± 0.94
0.0ValXaa: 0.0 ± 0.0
Trp
0.71TrpAla: 0.71 ± 0.664
0.0TrpCys: 0.0 ± 0.0
0.71TrpAsp: 0.71 ± 0.903
0.0TrpGlu: 0.0 ± 0.0
0.71TrpPhe: 0.71 ± 0.47
0.0TrpGly: 0.0 ± 0.0
1.42TrpHis: 1.42 ± 0.94
0.0TrpIle: 0.0 ± 0.0
0.71TrpLys: 0.71 ± 0.991
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.42TrpAsn: 1.42 ± 0.646
1.42TrpPro: 1.42 ± 0.94
0.71TrpGln: 0.71 ± 0.664
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.42TrpThr: 1.42 ± 0.646
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.392TyrAla: 6.392 ± 2.106
0.0TyrCys: 0.0 ± 0.0
1.42TyrAsp: 1.42 ± 0.834
2.131TyrGlu: 2.131 ± 0.867
2.841TyrPhe: 2.841 ± 1.36
2.841TyrGly: 2.841 ± 1.899
0.71TyrHis: 0.71 ± 0.664
0.0TyrIle: 0.0 ± 0.0
1.42TyrLys: 1.42 ± 0.94
3.551TyrLeu: 3.551 ± 1.45
0.71TyrMet: 0.71 ± 0.725
2.131TyrAsn: 2.131 ± 0.965
0.71TyrPro: 0.71 ± 0.47
2.131TyrGln: 2.131 ± 0.867
1.42TyrArg: 1.42 ± 0.684
3.551TyrSer: 3.551 ± 1.872
2.131TyrThr: 2.131 ± 0.628
2.131TyrVal: 2.131 ± 1.366
0.71TyrTrp: 0.71 ± 0.47
2.131TyrTyr: 2.131 ± 1.322
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1409 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski