Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_386

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.78AlaAla: 1.78 ± 1.035
0.593AlaCys: 0.593 ± 0.576
2.374AlaAsp: 2.374 ± 0.841
3.561AlaGlu: 3.561 ± 3.458
1.187AlaPhe: 1.187 ± 0.45
2.374AlaGly: 2.374 ± 1.177
1.78AlaHis: 1.78 ± 1.289
1.78AlaIle: 1.78 ± 1.083
2.374AlaLys: 2.374 ± 1.632
3.561AlaLeu: 3.561 ± 0.699
0.593AlaMet: 0.593 ± 0.576
2.967AlaAsn: 2.967 ± 0.563
0.0AlaPro: 0.0 ± 0.0
4.748AlaGln: 4.748 ± 2.287
2.967AlaArg: 2.967 ± 0.563
4.154AlaSer: 4.154 ± 2.115
1.187AlaThr: 1.187 ± 0.562
2.967AlaVal: 2.967 ± 1.159
1.187AlaTrp: 1.187 ± 0.45
3.561AlaTyr: 3.561 ± 1.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.593CysAsp: 0.593 ± 0.43
0.593CysGlu: 0.593 ± 0.475
1.78CysPhe: 1.78 ± 1.17
0.593CysGly: 0.593 ± 0.905
0.0CysHis: 0.0 ± 0.0
1.187CysIle: 1.187 ± 0.562
0.593CysLys: 0.593 ± 0.475
1.187CysLeu: 1.187 ± 0.45
0.0CysMet: 0.0 ± 0.0
1.187CysAsn: 1.187 ± 0.932
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.593CysSer: 0.593 ± 0.475
0.593CysThr: 0.593 ± 0.475
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.593CysTyr: 0.593 ± 0.43
0.0CysXaa: 0.0 ± 0.0
Asp
1.187AspAla: 1.187 ± 1.153
1.187AspCys: 1.187 ± 0.45
1.187AspAsp: 1.187 ± 0.951
2.967AspGlu: 2.967 ± 0.957
4.154AspPhe: 4.154 ± 1.581
1.78AspGly: 1.78 ± 0.35
1.187AspHis: 1.187 ± 0.86
5.341AspIle: 5.341 ± 1.447
4.154AspLys: 4.154 ± 1.778
2.374AspLeu: 2.374 ± 0.899
0.593AspMet: 0.593 ± 0.43
2.967AspAsn: 2.967 ± 0.715
2.374AspPro: 2.374 ± 0.545
1.187AspGln: 1.187 ± 0.562
1.78AspArg: 1.78 ± 0.867
4.748AspSer: 4.748 ± 1.766
3.561AspThr: 3.561 ± 0.595
3.561AspVal: 3.561 ± 1.507
1.78AspTrp: 1.78 ± 0.35
7.122AspTyr: 7.122 ± 1.658
0.0AspXaa: 0.0 ± 0.0
Glu
2.967GluAla: 2.967 ± 1.71
0.0GluCys: 0.0 ± 0.0
1.187GluAsp: 1.187 ± 0.951
1.187GluGlu: 1.187 ± 0.562
4.154GluPhe: 4.154 ± 1.308
3.561GluGly: 3.561 ± 1.639
0.593GluHis: 0.593 ± 0.905
1.78GluIle: 1.78 ± 1.418
4.748GluLys: 4.748 ± 1.346
4.748GluLeu: 4.748 ± 1.602
2.374GluMet: 2.374 ± 1.177
1.187GluAsn: 1.187 ± 0.895
0.593GluPro: 0.593 ± 0.43
3.561GluGln: 3.561 ± 1.988
1.187GluArg: 1.187 ± 0.86
8.309GluSer: 8.309 ± 1.706
3.561GluThr: 3.561 ± 1.472
3.561GluVal: 3.561 ± 1.063
0.0GluTrp: 0.0 ± 0.0
2.967GluTyr: 2.967 ± 1.015
0.0GluXaa: 0.0 ± 0.0
Phe
3.561PheAla: 3.561 ± 0.888
1.187PheCys: 1.187 ± 0.932
5.341PheAsp: 5.341 ± 1.892
2.374PheGlu: 2.374 ± 0.978
5.935PhePhe: 5.935 ± 1.986
3.561PheGly: 3.561 ± 1.325
0.0PheHis: 0.0 ± 0.0
3.561PheIle: 3.561 ± 1.083
2.374PheLys: 2.374 ± 0.978
4.748PheLeu: 4.748 ± 1.183
0.0PheMet: 0.0 ± 0.0
4.748PheAsn: 4.748 ± 1.435
2.967PhePro: 2.967 ± 1.722
1.187PheGln: 1.187 ± 0.589
3.561PheArg: 3.561 ± 2.185
7.715PheSer: 7.715 ± 2.145
1.78PheThr: 1.78 ± 0.74
2.374PheVal: 2.374 ± 0.997
0.0PheTrp: 0.0 ± 0.0
4.748PheTyr: 4.748 ± 1.435
0.0PheXaa: 0.0 ± 0.0
Gly
2.374GlyAla: 2.374 ± 0.841
0.0GlyCys: 0.0 ± 0.0
4.154GlyAsp: 4.154 ± 1.743
4.154GlyGlu: 4.154 ± 1.166
2.374GlyPhe: 2.374 ± 0.899
1.187GlyGly: 1.187 ± 0.45
1.187GlyHis: 1.187 ± 0.589
5.935GlyIle: 5.935 ± 1.713
3.561GlyLys: 3.561 ± 2.189
2.374GlyLeu: 2.374 ± 1.281
0.593GlyMet: 0.593 ± 0.576
5.341GlyAsn: 5.341 ± 1.993
0.0GlyPro: 0.0 ± 0.0
1.78GlyGln: 1.78 ± 1.289
1.78GlyArg: 1.78 ± 0.82
4.154GlySer: 4.154 ± 1.353
1.78GlyThr: 1.78 ± 1.095
1.78GlyVal: 1.78 ± 0.35
0.0GlyTrp: 0.0 ± 0.0
2.374GlyTyr: 2.374 ± 1.719
0.0GlyXaa: 0.0 ± 0.0
His
1.187HisAla: 1.187 ± 0.589
0.0HisCys: 0.0 ± 0.0
1.78HisAsp: 1.78 ± 0.82
0.593HisGlu: 0.593 ± 0.43
2.374HisPhe: 2.374 ± 1.394
1.187HisGly: 1.187 ± 0.45
0.0HisHis: 0.0 ± 0.0
2.374HisIle: 2.374 ± 0.912
0.593HisLys: 0.593 ± 0.43
3.561HisLeu: 3.561 ± 0.936
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.593HisPro: 0.593 ± 0.475
0.593HisGln: 0.593 ± 0.43
0.593HisArg: 0.593 ± 0.905
0.0HisSer: 0.0 ± 0.0
1.187HisThr: 1.187 ± 0.562
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.78HisTyr: 1.78 ± 0.82
0.0HisXaa: 0.0 ± 0.0
Ile
3.561IleAla: 3.561 ± 1.751
1.78IleCys: 1.78 ± 1.775
2.967IleAsp: 2.967 ± 1.604
3.561IleGlu: 3.561 ± 1.731
5.935IlePhe: 5.935 ± 1.576
2.967IleGly: 2.967 ± 0.843
1.187IleHis: 1.187 ± 0.932
2.374IleIle: 2.374 ± 0.718
6.528IleLys: 6.528 ± 1.485
5.935IleLeu: 5.935 ± 1.235
0.593IleMet: 0.593 ± 0.417
5.341IleAsn: 5.341 ± 2.727
3.561IlePro: 3.561 ± 0.808
3.561IleGln: 3.561 ± 1.025
3.561IleArg: 3.561 ± 0.808
9.496IleSer: 9.496 ± 1.131
3.561IleThr: 3.561 ± 1.85
1.78IleVal: 1.78 ± 1.095
0.0IleTrp: 0.0 ± 0.0
1.78IleTyr: 1.78 ± 0.82
0.0IleXaa: 0.0 ± 0.0
Lys
0.593LysAla: 0.593 ± 0.576
0.593LysCys: 0.593 ± 0.475
3.561LysAsp: 3.561 ± 1.563
2.374LysGlu: 2.374 ± 1.281
1.78LysPhe: 1.78 ± 0.35
1.187LysGly: 1.187 ± 0.45
0.593LysHis: 0.593 ± 0.43
5.935LysIle: 5.935 ± 2.313
5.341LysLys: 5.341 ± 2.634
5.341LysLeu: 5.341 ± 2.271
1.187LysMet: 1.187 ± 0.793
5.341LysAsn: 5.341 ± 1.777
2.374LysPro: 2.374 ± 0.899
4.154LysGln: 4.154 ± 2.144
5.341LysArg: 5.341 ± 1.807
8.309LysSer: 8.309 ± 1.331
5.935LysThr: 5.935 ± 1.107
1.187LysVal: 1.187 ± 1.092
0.593LysTrp: 0.593 ± 0.576
6.528LysTyr: 6.528 ± 2.518
0.0LysXaa: 0.0 ± 0.0
Leu
5.935LeuAla: 5.935 ± 2.457
0.0LeuCys: 0.0 ± 0.0
5.935LeuAsp: 5.935 ± 0.659
3.561LeuGlu: 3.561 ± 1.584
5.341LeuPhe: 5.341 ± 2.982
5.935LeuGly: 5.935 ± 2.386
1.187LeuHis: 1.187 ± 0.86
7.122LeuIle: 7.122 ± 1.046
2.967LeuLys: 2.967 ± 1.015
10.089LeuLeu: 10.089 ± 2.06
1.78LeuMet: 1.78 ± 1.751
8.902LeuAsn: 8.902 ± 2.209
2.374LeuPro: 2.374 ± 1.262
6.528LeuGln: 6.528 ± 2.11
2.374LeuArg: 2.374 ± 0.899
8.309LeuSer: 8.309 ± 2.125
5.935LeuThr: 5.935 ± 1.121
4.154LeuVal: 4.154 ± 1.237
0.593LeuTrp: 0.593 ± 0.43
1.78LeuTyr: 1.78 ± 0.74
0.0LeuXaa: 0.0 ± 0.0
Met
0.593MetAla: 0.593 ± 0.43
0.0MetCys: 0.0 ± 0.0
1.187MetAsp: 1.187 ± 0.589
0.593MetGlu: 0.593 ± 0.43
0.593MetPhe: 0.593 ± 0.576
1.78MetGly: 1.78 ± 0.854
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.593MetLys: 0.593 ± 0.475
1.78MetLeu: 1.78 ± 1.489
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.593MetPro: 0.593 ± 0.43
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.374MetSer: 2.374 ± 1.059
0.593MetThr: 0.593 ± 0.576
0.593MetVal: 0.593 ± 0.576
0.0MetTrp: 0.0 ± 0.0
1.187MetTyr: 1.187 ± 0.895
0.0MetXaa: 0.0 ± 0.0
Asn
3.561AsnAla: 3.561 ± 1.967
0.593AsnCys: 0.593 ± 0.475
5.341AsnAsp: 5.341 ± 2.047
4.748AsnGlu: 4.748 ± 0.945
4.154AsnPhe: 4.154 ± 2.035
5.935AsnGly: 5.935 ± 1.273
0.593AsnHis: 0.593 ± 0.43
5.935AsnIle: 5.935 ± 1.902
4.154AsnLys: 4.154 ± 1.167
7.715AsnLeu: 7.715 ± 2.92
0.0AsnMet: 0.0 ± 0.0
7.715AsnAsn: 7.715 ± 3.125
3.561AsnPro: 3.561 ± 2.111
3.561AsnGln: 3.561 ± 1.887
5.341AsnArg: 5.341 ± 1.521
7.715AsnSer: 7.715 ± 1.297
1.187AsnThr: 1.187 ± 0.932
2.967AsnVal: 2.967 ± 1.147
0.0AsnTrp: 0.0 ± 0.0
2.374AsnTyr: 2.374 ± 1.262
0.0AsnXaa: 0.0 ± 0.0
Pro
0.593ProAla: 0.593 ± 0.43
0.593ProCys: 0.593 ± 0.475
1.187ProAsp: 1.187 ± 0.954
2.374ProGlu: 2.374 ± 0.545
1.78ProPhe: 1.78 ± 1.289
1.187ProGly: 1.187 ± 0.45
1.187ProHis: 1.187 ± 0.589
1.78ProIle: 1.78 ± 0.82
2.374ProLys: 2.374 ± 1.557
4.748ProLeu: 4.748 ± 1.714
0.593ProMet: 0.593 ± 0.43
2.374ProAsn: 2.374 ± 1.524
0.593ProPro: 0.593 ± 0.475
1.78ProGln: 1.78 ± 1.083
1.187ProArg: 1.187 ± 0.45
2.374ProSer: 2.374 ± 0.787
0.593ProThr: 0.593 ± 0.43
4.154ProVal: 4.154 ± 1.5
0.0ProTrp: 0.0 ± 0.0
1.78ProTyr: 1.78 ± 0.82
0.0ProXaa: 0.0 ± 0.0
Gln
2.374GlnAla: 2.374 ± 1.578
0.0GlnCys: 0.0 ± 0.0
2.374GlnAsp: 2.374 ± 1.79
1.187GlnGlu: 1.187 ± 0.562
0.593GlnPhe: 0.593 ± 0.905
2.374GlnGly: 2.374 ± 0.592
0.0GlnHis: 0.0 ± 0.0
3.561GlnIle: 3.561 ± 1.916
4.154GlnLys: 4.154 ± 2.539
4.748GlnLeu: 4.748 ± 0.666
1.187GlnMet: 1.187 ± 0.804
2.374GlnAsn: 2.374 ± 1.255
1.187GlnPro: 1.187 ± 0.562
0.593GlnGln: 0.593 ± 0.475
1.187GlnArg: 1.187 ± 0.589
4.748GlnSer: 4.748 ± 1.928
1.78GlnThr: 1.78 ± 1.729
2.967GlnVal: 2.967 ± 0.832
0.593GlnTrp: 0.593 ± 0.475
2.967GlnTyr: 2.967 ± 0.761
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.593ArgCys: 0.593 ± 0.43
0.593ArgAsp: 0.593 ± 0.576
2.374ArgGlu: 2.374 ± 1.262
4.748ArgPhe: 4.748 ± 1.189
0.593ArgGly: 0.593 ± 0.475
1.187ArgHis: 1.187 ± 0.951
3.561ArgIle: 3.561 ± 0.595
1.78ArgLys: 1.78 ± 1.426
5.341ArgLeu: 5.341 ± 1.934
0.593ArgMet: 0.593 ± 0.576
4.154ArgAsn: 4.154 ± 0.715
1.78ArgPro: 1.78 ± 1.067
1.78ArgGln: 1.78 ± 0.74
1.78ArgArg: 1.78 ± 0.82
4.748ArgSer: 4.748 ± 0.772
2.967ArgThr: 2.967 ± 1.645
2.374ArgVal: 2.374 ± 1.218
0.0ArgTrp: 0.0 ± 0.0
4.154ArgTyr: 4.154 ± 2.075
0.0ArgXaa: 0.0 ± 0.0
Ser
8.309SerAla: 8.309 ± 3.342
0.593SerCys: 0.593 ± 0.475
4.748SerAsp: 4.748 ± 1.702
5.341SerGlu: 5.341 ± 1.017
4.748SerPhe: 4.748 ± 0.713
5.341SerGly: 5.341 ± 1.993
1.78SerHis: 1.78 ± 0.867
8.902SerIle: 8.902 ± 3.436
8.309SerLys: 8.309 ± 1.187
11.276SerLeu: 11.276 ± 2.527
0.0SerMet: 0.0 ± 0.0
4.154SerAsn: 4.154 ± 1.377
5.341SerPro: 5.341 ± 2.681
1.78SerGln: 1.78 ± 1.083
4.154SerArg: 4.154 ± 1.743
10.682SerSer: 10.682 ± 4.319
5.341SerThr: 5.341 ± 1.993
4.748SerVal: 4.748 ± 1.184
0.593SerTrp: 0.593 ± 0.43
5.935SerTyr: 5.935 ± 2.682
0.0SerXaa: 0.0 ± 0.0
Thr
2.374ThrAla: 2.374 ± 1.218
0.593ThrCys: 0.593 ± 0.475
4.748ThrAsp: 4.748 ± 0.822
4.748ThrGlu: 4.748 ± 2.73
2.967ThrPhe: 2.967 ± 0.832
2.374ThrGly: 2.374 ± 1.218
1.187ThrHis: 1.187 ± 0.908
3.561ThrIle: 3.561 ± 1.025
4.154ThrLys: 4.154 ± 0.805
2.374ThrLeu: 2.374 ± 0.997
0.0ThrMet: 0.0 ± 0.0
4.748ThrAsn: 4.748 ± 1.681
1.187ThrPro: 1.187 ± 0.895
1.187ThrGln: 1.187 ± 1.153
1.78ThrArg: 1.78 ± 0.867
6.528ThrSer: 6.528 ± 2.845
1.78ThrThr: 1.78 ± 0.854
4.748ThrVal: 4.748 ± 1.77
0.0ThrTrp: 0.0 ± 0.0
1.78ThrTyr: 1.78 ± 0.82
0.0ThrXaa: 0.0 ± 0.0
Val
0.593ValAla: 0.593 ± 0.912
1.187ValCys: 1.187 ± 0.45
4.748ValAsp: 4.748 ± 1.246
1.187ValGlu: 1.187 ± 0.908
2.374ValPhe: 2.374 ± 1.359
1.78ValGly: 1.78 ± 1.489
1.78ValHis: 1.78 ± 0.816
2.374ValIle: 2.374 ± 2.086
5.341ValLys: 5.341 ± 1.628
3.561ValLeu: 3.561 ± 1.115
0.0ValMet: 0.0 ± 0.0
4.748ValAsn: 4.748 ± 1.569
2.374ValPro: 2.374 ± 0.997
1.187ValGln: 1.187 ± 0.589
2.967ValArg: 2.967 ± 1.147
4.748ValSer: 4.748 ± 1.468
5.935ValThr: 5.935 ± 1.115
2.967ValVal: 2.967 ± 1.698
0.593ValTrp: 0.593 ± 0.43
1.78ValTyr: 1.78 ± 0.82
0.0ValXaa: 0.0 ± 0.0
Trp
0.593TrpAla: 0.593 ± 0.43
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.593TrpGlu: 0.593 ± 0.43
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.593TrpHis: 0.593 ± 0.43
0.0TrpIle: 0.0 ± 0.0
1.187TrpLys: 1.187 ± 0.45
0.593TrpLeu: 0.593 ± 0.576
0.593TrpMet: 0.593 ± 0.43
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.78TrpGln: 1.78 ± 0.867
0.593TrpArg: 0.593 ± 0.475
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.593TrpTyr: 0.593 ± 0.475
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.561TyrAla: 3.561 ± 1.205
0.0TyrCys: 0.0 ± 0.0
1.187TyrAsp: 1.187 ± 0.86
4.154TyrGlu: 4.154 ± 1.489
4.748TyrPhe: 4.748 ± 2.524
1.187TyrGly: 1.187 ± 0.951
2.374TyrHis: 2.374 ± 1.262
2.967TyrIle: 2.967 ± 1.33
2.967TyrLys: 2.967 ± 1.234
4.748TyrLeu: 4.748 ± 2.046
1.187TyrMet: 1.187 ± 0.589
8.902TyrAsn: 8.902 ± 1.923
1.78TyrPro: 1.78 ± 0.74
0.593TyrGln: 0.593 ± 0.576
2.967TyrArg: 2.967 ± 0.957
2.374TyrSer: 2.374 ± 0.899
3.561TyrThr: 3.561 ± 0.808
5.341TyrVal: 5.341 ± 2.271
1.187TyrTrp: 1.187 ± 0.951
2.374TyrTyr: 2.374 ± 1.124
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1686 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski