Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_259

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.539AlaAla: 12.539 ± 7.555
0.0AlaCys: 0.0 ± 0.0
7.053AlaAsp: 7.053 ± 1.366
3.135AlaGlu: 3.135 ± 1.545
3.135AlaPhe: 3.135 ± 1.091
2.351AlaGly: 2.351 ± 1.465
0.784AlaHis: 0.784 ± 0.762
1.567AlaIle: 1.567 ± 0.885
4.702AlaLys: 4.702 ± 1.791
7.053AlaLeu: 7.053 ± 2.898
0.784AlaMet: 0.784 ± 0.475
2.351AlaAsn: 2.351 ± 1.98
2.351AlaPro: 2.351 ± 0.899
6.27AlaGln: 6.27 ± 4.696
2.351AlaArg: 2.351 ± 1.675
10.972AlaSer: 10.972 ± 3.263
5.486AlaThr: 5.486 ± 1.197
2.351AlaVal: 2.351 ± 1.134
1.567AlaTrp: 1.567 ± 1.656
3.135AlaTyr: 3.135 ± 0.761
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.784CysAsp: 0.784 ± 0.762
0.0CysGlu: 0.0 ± 0.0
0.784CysPhe: 0.784 ± 0.762
1.567CysGly: 1.567 ± 1.524
0.0CysHis: 0.0 ± 0.0
0.784CysIle: 0.784 ± 0.475
1.567CysLys: 1.567 ± 1.524
0.784CysLeu: 0.784 ± 0.762
0.0CysMet: 0.0 ± 0.0
0.784CysAsn: 0.784 ± 1.116
0.0CysPro: 0.0 ± 0.0
0.784CysGln: 0.784 ± 0.475
2.351CysArg: 2.351 ± 1.792
1.567CysSer: 1.567 ± 0.685
0.0CysThr: 0.0 ± 0.0
0.784CysVal: 0.784 ± 0.475
0.784CysTrp: 0.784 ± 0.475
0.784CysTyr: 0.784 ± 0.762
0.0CysXaa: 0.0 ± 0.0
Asp
3.918AspAla: 3.918 ± 2.147
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
3.135AspGlu: 3.135 ± 1.901
6.27AspPhe: 6.27 ± 1.728
2.351AspGly: 2.351 ± 1.556
0.784AspHis: 0.784 ± 0.762
3.918AspIle: 3.918 ± 0.872
5.486AspLys: 5.486 ± 1.924
7.053AspLeu: 7.053 ± 1.835
0.784AspMet: 0.784 ± 0.719
4.702AspAsn: 4.702 ± 2.695
3.135AspPro: 3.135 ± 1.219
1.567AspGln: 1.567 ± 0.951
6.27AspArg: 6.27 ± 1.014
3.918AspSer: 3.918 ± 1.229
4.702AspThr: 4.702 ± 0.735
3.918AspVal: 3.918 ± 1.14
1.567AspTrp: 1.567 ± 0.685
4.702AspTyr: 4.702 ± 1.093
0.0AspXaa: 0.0 ± 0.0
Glu
7.837GluAla: 7.837 ± 4.054
1.567GluCys: 1.567 ± 0.685
5.486GluAsp: 5.486 ± 2.432
0.784GluGlu: 0.784 ± 0.475
2.351GluPhe: 2.351 ± 1.209
3.918GluGly: 3.918 ± 1.63
1.567GluHis: 1.567 ± 0.885
1.567GluIle: 1.567 ± 0.951
2.351GluLys: 2.351 ± 1.13
3.918GluLeu: 3.918 ± 1.28
0.784GluMet: 0.784 ± 0.475
2.351GluAsn: 2.351 ± 1.134
1.567GluPro: 1.567 ± 0.951
1.567GluGln: 1.567 ± 1.122
1.567GluArg: 1.567 ± 0.951
3.135GluSer: 3.135 ± 1.328
2.351GluThr: 2.351 ± 0.876
2.351GluVal: 2.351 ± 0.671
0.0GluTrp: 0.0 ± 0.0
0.784GluTyr: 0.784 ± 0.828
0.0GluXaa: 0.0 ± 0.0
Phe
3.135PheAla: 3.135 ± 1.249
1.567PheCys: 1.567 ± 1.524
3.918PheAsp: 3.918 ± 1.526
3.135PheGlu: 3.135 ± 1.585
2.351PhePhe: 2.351 ± 1.586
6.27PheGly: 6.27 ± 1.865
0.784PheHis: 0.784 ± 0.762
1.567PheIle: 1.567 ± 0.951
2.351PheLys: 2.351 ± 1.17
8.621PheLeu: 8.621 ± 4.172
1.567PheMet: 1.567 ± 0.825
2.351PheAsn: 2.351 ± 0.899
2.351PhePro: 2.351 ± 0.671
0.0PheGln: 0.0 ± 0.0
3.135PheArg: 3.135 ± 1.091
7.053PheSer: 7.053 ± 0.833
3.135PheThr: 3.135 ± 1.219
2.351PheVal: 2.351 ± 1.093
0.784PheTrp: 0.784 ± 0.475
2.351PheTyr: 2.351 ± 1.368
0.0PheXaa: 0.0 ± 0.0
Gly
0.784GlyAla: 0.784 ± 0.475
0.0GlyCys: 0.0 ± 0.0
8.621GlyAsp: 8.621 ± 2.036
2.351GlyGlu: 2.351 ± 0.876
4.702GlyPhe: 4.702 ± 1.757
2.351GlyGly: 2.351 ± 1.368
0.784GlyHis: 0.784 ± 0.475
3.918GlyIle: 3.918 ± 1.073
5.486GlyLys: 5.486 ± 1.309
8.621GlyLeu: 8.621 ± 3.045
0.0GlyMet: 0.0 ± 0.0
5.486GlyAsn: 5.486 ± 1.669
0.0GlyPro: 0.0 ± 0.0
3.918GlyGln: 3.918 ± 0.578
3.918GlyArg: 3.918 ± 1.44
0.784GlySer: 0.784 ± 0.475
0.784GlyThr: 0.784 ± 0.475
6.27GlyVal: 6.27 ± 2.394
0.0GlyTrp: 0.0 ± 0.0
3.918GlyTyr: 3.918 ± 1.526
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.475
0.784HisCys: 0.784 ± 0.762
0.0HisAsp: 0.0 ± 0.0
1.567HisGlu: 1.567 ± 1.711
3.135HisPhe: 3.135 ± 1.265
2.351HisGly: 2.351 ± 0.876
0.784HisHis: 0.784 ± 0.762
0.784HisIle: 0.784 ± 1.116
0.0HisLys: 0.0 ± 0.0
2.351HisLeu: 2.351 ± 1.368
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.784HisGln: 0.784 ± 0.475
0.784HisArg: 0.784 ± 0.762
2.351HisSer: 2.351 ± 1.17
2.351HisThr: 2.351 ± 0.87
0.784HisVal: 0.784 ± 0.475
0.0HisTrp: 0.0 ± 0.0
3.135HisTyr: 3.135 ± 1.249
0.0HisXaa: 0.0 ± 0.0
Ile
4.702IleAla: 4.702 ± 1.567
0.0IleCys: 0.0 ± 0.0
1.567IleAsp: 1.567 ± 0.951
3.918IleGlu: 3.918 ± 1.568
3.918IlePhe: 3.918 ± 1.241
3.918IleGly: 3.918 ± 1.52
0.784IleHis: 0.784 ± 0.475
2.351IleIle: 2.351 ± 1.093
1.567IleLys: 1.567 ± 1.284
0.784IleLeu: 0.784 ± 0.475
0.0IleMet: 0.0 ± 0.0
4.702IleAsn: 4.702 ± 1.593
3.135IlePro: 3.135 ± 1.499
2.351IleGln: 2.351 ± 0.671
3.918IleArg: 3.918 ± 1.192
3.135IleSer: 3.135 ± 1.328
3.918IleThr: 3.918 ± 1.687
1.567IleVal: 1.567 ± 0.685
1.567IleTrp: 1.567 ± 0.685
2.351IleTyr: 2.351 ± 1.426
0.0IleXaa: 0.0 ± 0.0
Lys
7.053LysAla: 7.053 ± 3.979
0.0LysCys: 0.0 ± 0.0
4.702LysAsp: 4.702 ± 0.873
0.784LysGlu: 0.784 ± 1.116
2.351LysPhe: 2.351 ± 1.209
2.351LysGly: 2.351 ± 0.671
1.567LysHis: 1.567 ± 0.986
6.27LysIle: 6.27 ± 1.014
5.486LysLys: 5.486 ± 3.614
8.621LysLeu: 8.621 ± 2.065
0.0LysMet: 0.0 ± 0.0
3.135LysAsn: 3.135 ± 1.81
0.784LysPro: 0.784 ± 0.856
0.0LysGln: 0.0 ± 0.0
2.351LysArg: 2.351 ± 1.368
2.351LysSer: 2.351 ± 1.134
2.351LysThr: 2.351 ± 0.948
1.567LysVal: 1.567 ± 1.656
0.784LysTrp: 0.784 ± 0.475
4.702LysTyr: 4.702 ± 2.88
0.0LysXaa: 0.0 ± 0.0
Leu
3.135LeuAla: 3.135 ± 1.415
0.0LeuCys: 0.0 ± 0.0
5.486LeuAsp: 5.486 ± 1.924
6.27LeuGlu: 6.27 ± 1.637
7.053LeuPhe: 7.053 ± 1.892
4.702LeuGly: 4.702 ± 1.262
0.784LeuHis: 0.784 ± 0.475
3.135LeuIle: 3.135 ± 2.352
6.27LeuLys: 6.27 ± 3.134
3.918LeuLeu: 3.918 ± 1.958
0.784LeuMet: 0.784 ± 1.116
6.27LeuAsn: 6.27 ± 1.744
4.702LeuPro: 4.702 ± 1.799
2.351LeuGln: 2.351 ± 1.426
8.621LeuArg: 8.621 ± 2.983
7.837LeuSer: 7.837 ± 2.38
5.486LeuThr: 5.486 ± 1.197
8.621LeuVal: 8.621 ± 1.247
0.0LeuTrp: 0.0 ± 0.0
2.351LeuTyr: 2.351 ± 1.426
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.567MetPhe: 1.567 ± 0.685
0.784MetGly: 0.784 ± 0.828
0.0MetHis: 0.0 ± 0.0
0.784MetIle: 0.784 ± 0.475
1.567MetLys: 1.567 ± 0.951
2.351MetLeu: 2.351 ± 1.17
0.0MetMet: 0.0 ± 0.0
1.567MetAsn: 1.567 ± 1.405
0.784MetPro: 0.784 ± 0.475
0.784MetGln: 0.784 ± 0.828
0.0MetArg: 0.0 ± 0.0
1.567MetSer: 1.567 ± 0.986
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.784MetTrp: 0.784 ± 0.475
1.567MetTyr: 1.567 ± 0.951
0.0MetXaa: 0.0 ± 0.0
Asn
5.486AsnAla: 5.486 ± 4.947
1.567AsnCys: 1.567 ± 0.685
3.918AsnAsp: 3.918 ± 2.492
1.567AsnGlu: 1.567 ± 0.885
3.918AsnPhe: 3.918 ± 1.073
5.486AsnGly: 5.486 ± 1.197
3.135AsnHis: 3.135 ± 1.136
3.918AsnIle: 3.918 ± 1.291
1.567AsnLys: 1.567 ± 1.062
4.702AsnLeu: 4.702 ± 2.931
0.0AsnMet: 0.0 ± 0.0
3.918AsnAsn: 3.918 ± 1.149
3.135AsnPro: 3.135 ± 1.343
0.784AsnGln: 0.784 ± 0.475
2.351AsnArg: 2.351 ± 1.297
3.918AsnSer: 3.918 ± 1.52
4.702AsnThr: 4.702 ± 1.308
3.918AsnVal: 3.918 ± 2.414
0.0AsnTrp: 0.0 ± 0.0
1.567AsnTyr: 1.567 ± 1.524
0.0AsnXaa: 0.0 ± 0.0
Pro
1.567ProAla: 1.567 ± 0.685
0.784ProCys: 0.784 ± 0.762
1.567ProAsp: 1.567 ± 0.951
3.135ProGlu: 3.135 ± 1.901
0.784ProPhe: 0.784 ± 0.828
0.784ProGly: 0.784 ± 0.475
0.784ProHis: 0.784 ± 0.475
3.135ProIle: 3.135 ± 1.901
3.135ProLys: 3.135 ± 1.369
1.567ProLeu: 1.567 ± 0.685
2.351ProMet: 2.351 ± 0.876
1.567ProAsn: 1.567 ± 0.708
0.0ProPro: 0.0 ± 0.0
0.784ProGln: 0.784 ± 0.475
2.351ProArg: 2.351 ± 0.876
3.918ProSer: 3.918 ± 1.566
0.784ProThr: 0.784 ± 0.475
1.567ProVal: 1.567 ± 0.708
0.784ProTrp: 0.784 ± 0.828
1.567ProTyr: 1.567 ± 0.685
0.0ProXaa: 0.0 ± 0.0
Gln
2.351GlnAla: 2.351 ± 1.465
0.784GlnCys: 0.784 ± 0.762
2.351GlnAsp: 2.351 ± 0.899
1.567GlnGlu: 1.567 ± 0.685
0.0GlnPhe: 0.0 ± 0.0
3.918GlnGly: 3.918 ± 1.579
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.351GlnLys: 2.351 ± 1.465
3.918GlnLeu: 3.918 ± 1.637
0.784GlnMet: 0.784 ± 1.17
5.486GlnAsn: 5.486 ± 3.734
0.784GlnPro: 0.784 ± 0.475
2.351GlnGln: 2.351 ± 1.134
5.486GlnArg: 5.486 ± 1.795
3.135GlnSer: 3.135 ± 1.357
0.784GlnThr: 0.784 ± 0.475
1.567GlnVal: 1.567 ± 1.022
0.0GlnTrp: 0.0 ± 0.0
1.567GlnTyr: 1.567 ± 0.951
0.0GlnXaa: 0.0 ± 0.0
Arg
6.27ArgAla: 6.27 ± 1.077
2.351ArgCys: 2.351 ± 1.093
5.486ArgAsp: 5.486 ± 2.188
4.702ArgGlu: 4.702 ± 2.032
5.486ArgPhe: 5.486 ± 1.517
1.567ArgGly: 1.567 ± 1.524
1.567ArgHis: 1.567 ± 1.062
3.135ArgIle: 3.135 ± 1.265
3.135ArgLys: 3.135 ± 2.174
10.188ArgLeu: 10.188 ± 1.666
0.784ArgMet: 0.784 ± 0.475
0.784ArgAsn: 0.784 ± 0.856
0.0ArgPro: 0.0 ± 0.0
3.135ArgGln: 3.135 ± 1.585
2.351ArgArg: 2.351 ± 0.87
3.918ArgSer: 3.918 ± 1.582
2.351ArgThr: 2.351 ± 0.87
2.351ArgVal: 2.351 ± 1.093
0.0ArgTrp: 0.0 ± 0.0
1.567ArgTyr: 1.567 ± 0.685
0.0ArgXaa: 0.0 ± 0.0
Ser
7.837SerAla: 7.837 ± 2.068
2.351SerCys: 2.351 ± 1.368
6.27SerAsp: 6.27 ± 1.744
7.837SerGlu: 7.837 ± 2.967
1.567SerPhe: 1.567 ± 1.062
8.621SerGly: 8.621 ± 1.366
1.567SerHis: 1.567 ± 1.022
1.567SerIle: 1.567 ± 0.708
3.918SerLys: 3.918 ± 1.52
6.27SerLeu: 6.27 ± 2.268
1.567SerMet: 1.567 ± 1.342
3.135SerAsn: 3.135 ± 3.313
3.918SerPro: 3.918 ± 1.52
4.702SerGln: 4.702 ± 0.822
3.135SerArg: 3.135 ± 1.376
8.621SerSer: 8.621 ± 2.301
3.918SerThr: 3.918 ± 1.867
4.702SerVal: 4.702 ± 1.69
0.0SerTrp: 0.0 ± 0.0
3.135SerTyr: 3.135 ± 2.174
0.0SerXaa: 0.0 ± 0.0
Thr
5.486ThrAla: 5.486 ± 1.669
0.0ThrCys: 0.0 ± 0.0
3.135ThrAsp: 3.135 ± 0.996
2.351ThrGlu: 2.351 ± 1.134
2.351ThrPhe: 2.351 ± 0.899
4.702ThrGly: 4.702 ± 1.752
0.784ThrHis: 0.784 ± 0.475
1.567ThrIle: 1.567 ± 0.951
3.135ThrLys: 3.135 ± 0.761
1.567ThrLeu: 1.567 ± 1.284
1.567ThrMet: 1.567 ± 0.708
2.351ThrAsn: 2.351 ± 1.465
2.351ThrPro: 2.351 ± 0.899
2.351ThrGln: 2.351 ± 0.876
5.486ThrArg: 5.486 ± 1.659
3.135ThrSer: 3.135 ± 1.265
5.486ThrThr: 5.486 ± 2.203
3.135ThrVal: 3.135 ± 0.961
0.0ThrTrp: 0.0 ± 0.0
1.567ThrTyr: 1.567 ± 0.685
0.0ThrXaa: 0.0 ± 0.0
Val
3.918ValAla: 3.918 ± 1.332
1.567ValCys: 1.567 ± 1.284
5.486ValAsp: 5.486 ± 1.362
0.784ValGlu: 0.784 ± 0.475
3.135ValPhe: 3.135 ± 2.568
1.567ValGly: 1.567 ± 0.885
1.567ValHis: 1.567 ± 0.885
3.918ValIle: 3.918 ± 1.793
1.567ValLys: 1.567 ± 1.022
3.135ValLeu: 3.135 ± 1.265
0.784ValMet: 0.784 ± 0.475
5.486ValAsn: 5.486 ± 2.025
0.784ValPro: 0.784 ± 0.475
3.135ValGln: 3.135 ± 2.174
3.135ValArg: 3.135 ± 0.961
5.486ValSer: 5.486 ± 0.841
0.784ValThr: 0.784 ± 0.475
2.351ValVal: 2.351 ± 1.134
0.784ValTrp: 0.784 ± 0.475
1.567ValTyr: 1.567 ± 0.986
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.567TrpPhe: 1.567 ± 0.986
0.784TrpGly: 0.784 ± 0.475
2.351TrpHis: 2.351 ± 0.671
0.0TrpIle: 0.0 ± 0.0
0.784TrpLys: 0.784 ± 0.475
0.784TrpLeu: 0.784 ± 0.475
0.0TrpMet: 0.0 ± 0.0
1.567TrpAsn: 1.567 ± 0.708
1.567TrpPro: 1.567 ± 0.951
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.784TrpSer: 0.784 ± 0.475
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.135TyrAla: 3.135 ± 1.265
0.784TyrCys: 0.784 ± 0.475
2.351TyrAsp: 2.351 ± 1.724
0.784TyrGlu: 0.784 ± 0.475
2.351TyrPhe: 2.351 ± 1.368
2.351TyrGly: 2.351 ± 2.286
2.351TyrHis: 2.351 ± 1.368
6.27TyrIle: 6.27 ± 1.561
0.784TyrLys: 0.784 ± 0.762
1.567TyrLeu: 1.567 ± 0.685
0.784TyrMet: 0.784 ± 0.475
1.567TyrAsn: 1.567 ± 0.986
1.567TyrPro: 1.567 ± 0.951
2.351TyrGln: 2.351 ± 1.17
1.567TyrArg: 1.567 ± 0.685
7.053TyrSer: 7.053 ± 1.648
3.135TyrThr: 3.135 ± 0.616
0.784TyrVal: 0.784 ± 0.762
0.784TyrTrp: 0.784 ± 0.475
3.135TyrTyr: 3.135 ± 1.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1277 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski