Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_59

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.931AlaAla: 5.931 ± 6.187
1.186AlaCys: 1.186 ± 0.485
3.559AlaAsp: 3.559 ± 1.741
6.524AlaGlu: 6.524 ± 5.071
2.966AlaPhe: 2.966 ± 1.389
2.966AlaGly: 2.966 ± 2.964
2.966AlaHis: 2.966 ± 1.414
3.559AlaIle: 3.559 ± 1.534
3.559AlaLys: 3.559 ± 0.646
2.966AlaLeu: 2.966 ± 0.793
0.0AlaMet: 0.0 ± 0.0
2.372AlaAsn: 2.372 ± 1.837
1.779AlaPro: 1.779 ± 1.183
5.338AlaGln: 5.338 ± 5.193
2.966AlaArg: 2.966 ± 0.793
7.711AlaSer: 7.711 ± 2.838
0.593AlaThr: 0.593 ± 0.434
2.966AlaVal: 2.966 ± 1.292
0.593AlaTrp: 0.593 ± 0.676
2.966AlaTyr: 2.966 ± 1.736
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.593CysGlu: 0.593 ± 0.526
1.186CysPhe: 1.186 ± 0.485
0.593CysGly: 0.593 ± 0.526
0.0CysHis: 0.0 ± 0.0
0.593CysIle: 0.593 ± 0.526
0.593CysLys: 0.593 ± 0.526
2.966CysLeu: 2.966 ± 1.924
0.593CysMet: 0.593 ± 0.526
0.593CysAsn: 0.593 ± 1.281
2.966CysPro: 2.966 ± 2.632
1.186CysGln: 1.186 ± 0.867
0.593CysArg: 0.593 ± 0.434
1.186CysSer: 1.186 ± 0.867
0.593CysThr: 0.593 ± 0.526
0.0CysVal: 0.0 ± 0.0
0.593CysTrp: 0.593 ± 0.434
1.186CysTyr: 1.186 ± 0.485
0.0CysXaa: 0.0 ± 0.0
Asp
4.745AspAla: 4.745 ± 1.427
0.0AspCys: 0.0 ± 0.0
2.372AspAsp: 2.372 ± 0.969
3.559AspGlu: 3.559 ± 0.683
3.559AspPhe: 3.559 ± 1.508
4.152AspGly: 4.152 ± 2.186
0.0AspHis: 0.0 ± 0.0
6.524AspIle: 6.524 ± 2.087
1.779AspLys: 1.779 ± 0.77
10.083AspLeu: 10.083 ± 2.199
2.372AspMet: 2.372 ± 1.34
3.559AspAsn: 3.559 ± 1.508
1.186AspPro: 1.186 ± 1.253
0.0AspGln: 0.0 ± 0.0
2.372AspArg: 2.372 ± 0.377
5.931AspSer: 5.931 ± 1.234
5.931AspThr: 5.931 ± 3.593
2.966AspVal: 2.966 ± 1.924
0.0AspTrp: 0.0 ± 0.0
3.559AspTyr: 3.559 ± 1.149
0.0AspXaa: 0.0 ± 0.0
Glu
2.966GluAla: 2.966 ± 2.554
0.593GluCys: 0.593 ± 0.434
3.559GluAsp: 3.559 ± 2.371
2.966GluGlu: 2.966 ± 2.503
2.372GluPhe: 2.372 ± 1.131
2.372GluGly: 2.372 ± 0.377
1.186GluHis: 1.186 ± 0.58
2.966GluIle: 2.966 ± 1.903
0.593GluLys: 0.593 ± 0.676
4.745GluLeu: 4.745 ± 0.754
2.372GluMet: 2.372 ± 1.76
1.779GluAsn: 1.779 ± 1.254
0.0GluPro: 0.0 ± 0.0
5.338GluGln: 5.338 ± 2.045
1.186GluArg: 1.186 ± 1.306
5.338GluSer: 5.338 ± 2.127
2.372GluThr: 2.372 ± 1.642
1.779GluVal: 1.779 ± 1.369
1.186GluTrp: 1.186 ± 0.485
6.524GluTyr: 6.524 ± 1.384
0.0GluXaa: 0.0 ± 0.0
Phe
2.966PheAla: 2.966 ± 1.537
1.186PheCys: 1.186 ± 1.053
5.338PheAsp: 5.338 ± 2.128
2.372PheGlu: 2.372 ± 0.672
2.372PhePhe: 2.372 ± 1.411
1.779PheGly: 1.779 ± 0.754
1.186PheHis: 1.186 ± 0.684
2.966PheIle: 2.966 ± 1.537
3.559PheLys: 3.559 ± 0.646
4.745PheLeu: 4.745 ± 0.7
0.593PheMet: 0.593 ± 1.281
3.559PheAsn: 3.559 ± 1.672
1.779PhePro: 1.779 ± 0.914
1.779PheGln: 1.779 ± 0.754
1.186PheArg: 1.186 ± 0.485
2.372PheSer: 2.372 ± 0.969
2.966PheThr: 2.966 ± 0.737
4.745PheVal: 4.745 ± 1.602
0.0PheTrp: 0.0 ± 0.0
2.966PheTyr: 2.966 ± 1.191
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
2.966GlyAsp: 2.966 ± 1.191
3.559GlyGlu: 3.559 ± 1.207
1.779GlyPhe: 1.779 ± 1.016
1.779GlyGly: 1.779 ± 1.254
1.186GlyHis: 1.186 ± 0.485
5.338GlyIle: 5.338 ± 1.818
3.559GlyLys: 3.559 ± 1.149
4.152GlyLeu: 4.152 ± 2.085
0.0GlyMet: 0.0 ± 0.0
1.779GlyAsn: 1.779 ± 0.77
1.186GlyPro: 1.186 ± 1.053
0.593GlyGln: 0.593 ± 0.676
1.186GlyArg: 1.186 ± 0.867
7.711GlySer: 7.711 ± 2.137
2.372GlyThr: 2.372 ± 0.969
3.559GlyVal: 3.559 ± 0.646
0.0GlyTrp: 0.0 ± 0.0
5.931GlyTyr: 5.931 ± 1.746
0.0GlyXaa: 0.0 ± 0.0
His
1.186HisAla: 1.186 ± 0.684
0.593HisCys: 0.593 ± 0.526
0.593HisAsp: 0.593 ± 0.526
0.0HisGlu: 0.0 ± 0.0
1.186HisPhe: 1.186 ± 1.053
1.186HisGly: 1.186 ± 0.485
1.186HisHis: 1.186 ± 1.053
1.186HisIle: 1.186 ± 0.485
2.372HisLys: 2.372 ± 1.467
1.779HisLeu: 1.779 ± 1.016
0.0HisMet: 0.0 ± 0.0
1.779HisAsn: 1.779 ± 0.754
0.0HisPro: 0.0 ± 0.0
1.779HisGln: 1.779 ± 0.77
1.779HisArg: 1.779 ± 0.914
1.779HisSer: 1.779 ± 0.754
1.779HisThr: 1.779 ± 1.525
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.186HisTyr: 1.186 ± 0.58
0.0HisXaa: 0.0 ± 0.0
Ile
2.372IleAla: 2.372 ± 1.837
1.186IleCys: 1.186 ± 1.053
3.559IleAsp: 3.559 ± 1.207
0.593IleGlu: 0.593 ± 0.526
0.593IlePhe: 0.593 ± 0.434
1.779IleGly: 1.779 ± 1.185
0.593IleHis: 0.593 ± 0.434
2.966IleIle: 2.966 ± 0.793
3.559IleLys: 3.559 ± 2.443
2.372IleLeu: 2.372 ± 0.377
1.186IleMet: 1.186 ± 0.485
5.338IleAsn: 5.338 ± 0.816
2.372IlePro: 2.372 ± 0.969
1.186IleGln: 1.186 ± 0.485
6.524IleArg: 6.524 ± 2.087
5.338IleSer: 5.338 ± 1.025
1.779IleThr: 1.779 ± 1.301
4.745IleVal: 4.745 ± 2.433
0.593IleTrp: 0.593 ± 0.526
2.372IleTyr: 2.372 ± 0.969
0.0IleXaa: 0.0 ± 0.0
Lys
7.117LysAla: 7.117 ± 1.7
1.186LysCys: 1.186 ± 1.053
4.152LysAsp: 4.152 ± 1.455
1.779LysGlu: 1.779 ± 1.016
4.152LysPhe: 4.152 ± 0.919
5.338LysGly: 5.338 ± 1.818
0.593LysHis: 0.593 ± 0.676
3.559LysIle: 3.559 ± 2.371
5.338LysLys: 5.338 ± 1.025
2.372LysLeu: 2.372 ± 1.16
1.186LysMet: 1.186 ± 0.58
2.372LysAsn: 2.372 ± 1.16
2.372LysPro: 2.372 ± 2.106
3.559LysGln: 3.559 ± 0.683
2.372LysArg: 2.372 ± 0.672
5.338LysSer: 5.338 ± 1.785
3.559LysThr: 3.559 ± 1.425
2.966LysVal: 2.966 ± 1.191
0.593LysTrp: 0.593 ± 0.434
5.338LysTyr: 5.338 ± 3.333
0.0LysXaa: 0.0 ± 0.0
Leu
7.117LeuAla: 7.117 ± 5.444
1.186LeuCys: 1.186 ± 0.485
6.524LeuAsp: 6.524 ± 1.067
4.152LeuGlu: 4.152 ± 2.36
5.338LeuPhe: 5.338 ± 0.706
5.931LeuGly: 5.931 ± 2.862
2.966LeuHis: 2.966 ± 1.924
1.779LeuIle: 1.779 ± 0.342
6.524LeuLys: 6.524 ± 2.074
7.117LeuLeu: 7.117 ± 3.616
2.372LeuMet: 2.372 ± 1.837
5.931LeuAsn: 5.931 ± 3.239
8.304LeuPro: 8.304 ± 2.329
4.152LeuGln: 4.152 ± 0.574
5.338LeuArg: 5.338 ± 0.797
4.152LeuSer: 4.152 ± 0.903
5.338LeuThr: 5.338 ± 2.664
2.966LeuVal: 2.966 ± 0.99
0.593LeuTrp: 0.593 ± 0.526
2.372LeuTyr: 2.372 ± 0.969
0.0LeuXaa: 0.0 ± 0.0
Met
2.966MetAla: 2.966 ± 2.212
1.186MetCys: 1.186 ± 1.053
0.593MetAsp: 0.593 ± 0.676
0.0MetGlu: 0.0 ± 0.0
0.593MetPhe: 0.593 ± 0.434
1.186MetGly: 1.186 ± 1.352
0.593MetHis: 0.593 ± 0.526
0.593MetIle: 0.593 ± 0.434
0.593MetLys: 0.593 ± 0.676
2.372MetLeu: 2.372 ± 0.955
1.186MetMet: 1.186 ± 0.867
0.593MetAsn: 0.593 ± 1.281
1.186MetPro: 1.186 ± 0.867
1.186MetGln: 1.186 ± 0.684
1.779MetArg: 1.779 ± 1.362
2.372MetSer: 2.372 ± 1.106
2.372MetThr: 2.372 ± 1.34
0.593MetVal: 0.593 ± 0.434
0.0MetTrp: 0.0 ± 0.0
0.593MetTyr: 0.593 ± 0.526
0.0MetXaa: 0.0 ± 0.0
Asn
4.152AsnAla: 4.152 ± 3.846
0.593AsnCys: 0.593 ± 0.526
4.745AsnAsp: 4.745 ± 1.374
2.966AsnGlu: 2.966 ± 1.292
6.524AsnPhe: 6.524 ± 1.324
1.779AsnGly: 1.779 ± 0.754
0.593AsnHis: 0.593 ± 0.434
2.966AsnIle: 2.966 ± 0.737
4.745AsnLys: 4.745 ± 1.314
4.745AsnLeu: 4.745 ± 1.975
1.186AsnMet: 1.186 ± 0.684
4.745AsnAsn: 4.745 ± 2.256
0.593AsnPro: 0.593 ± 1.281
2.966AsnGln: 2.966 ± 1.169
1.186AsnArg: 1.186 ± 0.58
6.524AsnSer: 6.524 ± 0.77
4.745AsnThr: 4.745 ± 2.728
3.559AsnVal: 3.559 ± 0.683
0.593AsnTrp: 0.593 ± 0.526
1.779AsnTyr: 1.779 ± 1.301
0.0AsnXaa: 0.0 ± 0.0
Pro
1.779ProAla: 1.779 ± 1.185
1.186ProCys: 1.186 ± 1.053
3.559ProAsp: 3.559 ± 1.364
1.779ProGlu: 1.779 ± 1.185
2.966ProPhe: 2.966 ± 1.924
0.593ProGly: 0.593 ± 0.676
1.186ProHis: 1.186 ± 1.053
2.372ProIle: 2.372 ± 1.294
0.593ProLys: 0.593 ± 0.434
3.559ProLeu: 3.559 ± 1.829
0.593ProMet: 0.593 ± 0.434
1.779ProAsn: 1.779 ± 0.914
0.593ProPro: 0.593 ± 0.526
1.779ProGln: 1.779 ± 0.754
2.372ProArg: 2.372 ± 1.411
5.338ProSer: 5.338 ± 2.48
4.152ProThr: 4.152 ± 1.903
1.779ProVal: 1.779 ± 1.369
0.0ProTrp: 0.0 ± 0.0
3.559ProTyr: 3.559 ± 1.454
0.0ProXaa: 0.0 ± 0.0
Gln
2.966GlnAla: 2.966 ± 1.619
1.186GlnCys: 1.186 ± 0.485
3.559GlnAsp: 3.559 ± 0.683
5.338GlnGlu: 5.338 ± 2.045
0.593GlnPhe: 0.593 ± 0.434
1.186GlnGly: 1.186 ± 0.58
1.186GlnHis: 1.186 ± 0.485
1.779GlnIle: 1.779 ± 0.342
2.966GlnLys: 2.966 ± 1.903
6.524GlnLeu: 6.524 ± 1.645
1.186GlnMet: 1.186 ± 1.352
3.559GlnAsn: 3.559 ± 3.22
1.779GlnPro: 1.779 ± 1.301
4.745GlnGln: 4.745 ± 4.561
2.372GlnArg: 2.372 ± 0.672
1.779GlnSer: 1.779 ± 0.77
2.372GlnThr: 2.372 ± 1.734
3.559GlnVal: 3.559 ± 1.672
0.0GlnTrp: 0.0 ± 0.0
2.966GlnTyr: 2.966 ± 1.903
0.0GlnXaa: 0.0 ± 0.0
Arg
1.186ArgAla: 1.186 ± 0.684
1.186ArgCys: 1.186 ± 1.253
2.372ArgAsp: 2.372 ± 0.969
1.779ArgGlu: 1.779 ± 0.77
2.966ArgPhe: 2.966 ± 1.901
2.372ArgGly: 2.372 ± 0.672
1.186ArgHis: 1.186 ± 0.485
2.966ArgIle: 2.966 ± 1.191
2.372ArgLys: 2.372 ± 1.34
6.524ArgLeu: 6.524 ± 1.885
0.0ArgMet: 0.0 ± 0.398
3.559ArgAsn: 3.559 ± 0.951
2.966ArgPro: 2.966 ± 2.457
2.372ArgGln: 2.372 ± 1.837
1.186ArgArg: 1.186 ± 1.053
4.152ArgSer: 4.152 ± 1.455
1.186ArgThr: 1.186 ± 0.58
0.593ArgVal: 0.593 ± 0.434
0.593ArgTrp: 0.593 ± 0.676
2.966ArgTyr: 2.966 ± 1.366
0.0ArgXaa: 0.0 ± 0.0
Ser
7.117SerAla: 7.117 ± 4.927
1.186SerCys: 1.186 ± 0.867
5.931SerAsp: 5.931 ± 2.137
4.745SerGlu: 4.745 ± 1.374
4.152SerPhe: 4.152 ± 0.574
5.338SerGly: 5.338 ± 2.593
1.779SerHis: 1.779 ± 0.914
2.966SerIle: 2.966 ± 1.414
7.711SerLys: 7.711 ± 3.122
8.897SerLeu: 8.897 ± 2.171
2.372SerMet: 2.372 ± 2.283
2.966SerAsn: 2.966 ± 1.901
4.152SerPro: 4.152 ± 2.474
3.559SerGln: 3.559 ± 1.741
5.931SerArg: 5.931 ± 2.621
4.745SerSer: 4.745 ± 2.046
5.931SerThr: 5.931 ± 1.976
4.152SerVal: 4.152 ± 1.573
0.0SerTrp: 0.0 ± 0.0
1.779SerTyr: 1.779 ± 0.914
0.0SerXaa: 0.0 ± 0.0
Thr
5.931ThrAla: 5.931 ± 1.975
0.0ThrCys: 0.0 ± 0.0
5.931ThrAsp: 5.931 ± 0.94
1.186ThrGlu: 1.186 ± 0.867
2.966ThrPhe: 2.966 ± 1.191
1.186ThrGly: 1.186 ± 0.867
0.593ThrHis: 0.593 ± 0.526
1.186ThrIle: 1.186 ± 0.867
4.152ThrLys: 4.152 ± 0.919
7.117ThrLeu: 7.117 ± 0.641
0.593ThrMet: 0.593 ± 0.676
4.745ThrAsn: 4.745 ± 1.602
3.559ThrPro: 3.559 ± 1.364
3.559ThrGln: 3.559 ± 1.425
1.186ThrArg: 1.186 ± 0.867
2.966ThrSer: 2.966 ± 1.901
3.559ThrThr: 3.559 ± 1.901
0.0ThrVal: 0.0 ± 0.0
2.372ThrTrp: 2.372 ± 0.377
4.745ThrTyr: 4.745 ± 1.934
0.0ThrXaa: 0.0 ± 0.0
Val
1.779ValAla: 1.779 ± 0.77
1.186ValCys: 1.186 ± 0.485
1.779ValAsp: 1.779 ± 1.362
3.559ValGlu: 3.559 ± 2.272
1.186ValPhe: 1.186 ± 0.485
4.745ValGly: 4.745 ± 2.001
0.0ValHis: 0.0 ± 0.0
1.186ValIle: 1.186 ± 0.867
3.559ValLys: 3.559 ± 1.362
2.966ValLeu: 2.966 ± 0.737
1.186ValMet: 1.186 ± 0.58
1.779ValAsn: 1.779 ± 0.914
4.745ValPro: 4.745 ± 2.807
2.372ValGln: 2.372 ± 0.672
1.186ValArg: 1.186 ± 1.253
5.338ValSer: 5.338 ± 2.701
2.372ValThr: 2.372 ± 1.734
2.966ValVal: 2.966 ± 1.366
1.779ValTrp: 1.779 ± 0.914
1.186ValTyr: 1.186 ± 1.053
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.593TrpCys: 0.593 ± 0.526
0.0TrpAsp: 0.0 ± 0.0
1.186TrpGlu: 1.186 ± 0.58
0.0TrpPhe: 0.0 ± 0.0
0.593TrpGly: 0.593 ± 0.434
0.593TrpHis: 0.593 ± 0.526
0.0TrpIle: 0.0 ± 0.0
1.779TrpLys: 1.779 ± 0.77
0.593TrpLeu: 0.593 ± 0.676
0.593TrpMet: 0.593 ± 0.434
1.186TrpAsn: 1.186 ± 1.053
0.0TrpPro: 0.0 ± 0.0
1.186TrpGln: 1.186 ± 1.053
0.0TrpArg: 0.0 ± 0.0
0.593TrpSer: 0.593 ± 0.526
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.593TrpTyr: 0.593 ± 0.434
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.186TyrAla: 1.186 ± 0.485
0.593TyrCys: 0.593 ± 0.434
3.559TyrAsp: 3.559 ± 1.508
3.559TyrGlu: 3.559 ± 1.266
2.966TyrPhe: 2.966 ± 1.537
1.779TyrGly: 1.779 ± 1.016
1.779TyrHis: 1.779 ± 0.914
3.559TyrIle: 3.559 ± 1.829
5.338TyrLys: 5.338 ± 1.025
3.559TyrLeu: 3.559 ± 1.956
2.372TyrMet: 2.372 ± 1.993
7.711TyrAsn: 7.711 ± 2.643
0.0TyrPro: 0.0 ± 0.0
2.966TyrGln: 2.966 ± 0.496
2.372TyrArg: 2.372 ± 1.116
4.745TyrSer: 4.745 ± 1.934
3.559TyrThr: 3.559 ± 1.508
2.966TyrVal: 2.966 ± 1.537
0.0TyrTrp: 0.0 ± 0.0
1.186TyrTyr: 1.186 ± 0.867
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1687 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski