Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_217

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.091AlaAla: 3.091 ± 5.23
0.773AlaCys: 0.773 ± 0.826
7.728AlaAsp: 7.728 ± 4.469
6.955AlaGlu: 6.955 ± 5.274
0.0AlaPhe: 0.0 ± 0.0
2.318AlaGly: 2.318 ± 1.311
1.546AlaHis: 1.546 ± 1.222
3.091AlaIle: 3.091 ± 0.711
2.318AlaLys: 2.318 ± 1.962
6.182AlaLeu: 6.182 ± 2.7
1.546AlaMet: 1.546 ± 0.703
6.182AlaAsn: 6.182 ± 1.847
3.864AlaPro: 3.864 ± 2.297
6.955AlaGln: 6.955 ± 4.305
6.182AlaArg: 6.182 ± 1.127
3.091AlaSer: 3.091 ± 1.515
3.091AlaThr: 3.091 ± 1.693
3.864AlaVal: 3.864 ± 1.129
1.546AlaTrp: 1.546 ± 1.085
3.864AlaTyr: 3.864 ± 1.451
0.0AlaXaa: 0.0 ± 0.0
Cys
0.773CysAla: 0.773 ± 0.826
0.0CysCys: 0.0 ± 0.0
0.773CysAsp: 0.773 ± 1.307
0.0CysGlu: 0.0 ± 0.0
0.773CysPhe: 0.773 ± 0.826
0.773CysGly: 0.773 ± 0.826
0.0CysHis: 0.0 ± 0.0
0.773CysIle: 0.773 ± 0.543
1.546CysLys: 1.546 ± 0.732
3.091CysLeu: 3.091 ± 1.464
0.773CysMet: 0.773 ± 0.543
0.773CysAsn: 0.773 ± 0.543
0.773CysPro: 0.773 ± 0.826
0.0CysGln: 0.0 ± 0.0
0.773CysArg: 0.773 ± 0.826
0.0CysSer: 0.0 ± 0.0
0.773CysThr: 0.773 ± 0.826
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.637AspAla: 4.637 ± 1.527
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
5.41AspGlu: 5.41 ± 1.963
6.182AspPhe: 6.182 ± 3.488
9.274AspGly: 9.274 ± 3.723
0.0AspHis: 0.0 ± 0.0
3.091AspIle: 3.091 ± 2.261
3.864AspLys: 3.864 ± 1.144
4.637AspLeu: 4.637 ± 1.918
0.773AspMet: 0.773 ± 1.307
0.0AspAsn: 0.0 ± 0.0
1.546AspPro: 1.546 ± 1.448
0.773AspGln: 0.773 ± 0.543
0.773AspArg: 0.773 ± 0.716
3.091AspSer: 3.091 ± 1.571
1.546AspThr: 1.546 ± 1.227
0.0AspVal: 0.0 ± 0.0
2.318AspTrp: 2.318 ± 1.463
4.637AspTyr: 4.637 ± 1.538
0.0AspXaa: 0.0 ± 0.0
Glu
4.637GluAla: 4.637 ± 1.056
0.773GluCys: 0.773 ± 0.826
3.864GluAsp: 3.864 ± 1.654
9.274GluGlu: 9.274 ± 4.017
3.864GluPhe: 3.864 ± 1.458
3.091GluGly: 3.091 ± 1.492
3.091GluHis: 3.091 ± 2.261
3.091GluIle: 3.091 ± 1.624
2.318GluLys: 2.318 ± 2.442
3.864GluLeu: 3.864 ± 2.056
0.773GluMet: 0.773 ± 1.164
2.318GluAsn: 2.318 ± 0.612
2.318GluPro: 2.318 ± 1.376
5.41GluGln: 5.41 ± 1.585
4.637GluArg: 4.637 ± 2.376
3.091GluSer: 3.091 ± 1.407
1.546GluThr: 1.546 ± 1.651
4.637GluVal: 4.637 ± 2.512
0.0GluTrp: 0.0 ± 0.0
3.091GluTyr: 3.091 ± 0.987
0.0GluXaa: 0.0 ± 0.0
Phe
4.637PheAla: 4.637 ± 1.26
0.773PheCys: 0.773 ± 0.826
4.637PheAsp: 4.637 ± 2.032
0.773PheGlu: 0.773 ± 0.543
1.546PhePhe: 1.546 ± 0.703
6.182PheGly: 6.182 ± 2.046
0.773PheHis: 0.773 ± 0.543
3.864PheIle: 3.864 ± 1.107
1.546PheLys: 1.546 ± 0.732
3.091PheLeu: 3.091 ± 1.314
0.773PheMet: 0.773 ± 0.702
2.318PheAsn: 2.318 ± 1.206
2.318PhePro: 2.318 ± 1.628
2.318PheGln: 2.318 ± 1.032
2.318PheArg: 2.318 ± 1.628
2.318PheSer: 2.318 ± 1.64
3.864PheThr: 3.864 ± 1.391
2.318PheVal: 2.318 ± 1.628
0.0PheTrp: 0.0 ± 0.0
1.546PheTyr: 1.546 ± 0.732
0.0PheXaa: 0.0 ± 0.0
Gly
6.182GlyAla: 6.182 ± 1.57
0.0GlyCys: 0.0 ± 0.0
3.864GlyAsp: 3.864 ± 1.783
3.091GlyGlu: 3.091 ± 1.183
3.091GlyPhe: 3.091 ± 1.464
3.864GlyGly: 3.864 ± 1.681
1.546GlyHis: 1.546 ± 1.085
5.41GlyIle: 5.41 ± 2.465
2.318GlyLys: 2.318 ± 1.417
6.182GlyLeu: 6.182 ± 0.675
1.546GlyMet: 1.546 ± 0.703
4.637GlyAsn: 4.637 ± 1.622
0.773GlyPro: 0.773 ± 0.543
1.546GlyGln: 1.546 ± 1.432
5.41GlyArg: 5.41 ± 1.729
4.637GlySer: 4.637 ± 2.512
6.182GlyThr: 6.182 ± 2.63
0.773GlyVal: 0.773 ± 0.826
0.0GlyTrp: 0.0 ± 0.0
3.091GlyTyr: 3.091 ± 1.663
0.0GlyXaa: 0.0 ± 0.0
His
0.773HisAla: 0.773 ± 0.543
0.773HisCys: 0.773 ± 0.826
0.0HisAsp: 0.0 ± 0.0
0.773HisGlu: 0.773 ± 0.826
2.318HisPhe: 2.318 ± 1.628
3.091HisGly: 3.091 ± 0.711
0.0HisHis: 0.0 ± 0.0
1.546HisIle: 1.546 ± 0.732
0.773HisLys: 0.773 ± 0.826
3.091HisLeu: 3.091 ± 1.726
0.773HisMet: 0.773 ± 0.511
2.318HisAsn: 2.318 ± 0.99
0.773HisPro: 0.773 ± 0.826
0.773HisGln: 0.773 ± 0.543
0.773HisArg: 0.773 ± 0.826
0.773HisSer: 0.773 ± 0.543
0.773HisThr: 0.773 ± 0.716
1.546HisVal: 1.546 ± 1.651
0.0HisTrp: 0.0 ± 0.0
2.318HisTyr: 2.318 ± 1.463
0.0HisXaa: 0.0 ± 0.0
Ile
0.773IleAla: 0.773 ± 1.307
0.0IleCys: 0.0 ± 0.0
4.637IleAsp: 4.637 ± 1.538
1.546IleGlu: 1.546 ± 1.432
3.091IlePhe: 3.091 ± 2.171
4.637IleGly: 4.637 ± 1.538
1.546IleHis: 1.546 ± 1.085
3.091IleIle: 3.091 ± 1.992
6.182IleLys: 6.182 ± 1.321
4.637IleLeu: 4.637 ± 1.312
0.773IleMet: 0.773 ± 0.543
3.091IleAsn: 3.091 ± 1.647
1.546IlePro: 1.546 ± 0.732
1.546IleGln: 1.546 ± 1.558
0.773IleArg: 0.773 ± 0.543
1.546IleSer: 1.546 ± 1.824
4.637IleThr: 4.637 ± 1.812
1.546IleVal: 1.546 ± 1.651
2.318IleTrp: 2.318 ± 0.612
0.773IleTyr: 0.773 ± 0.543
0.0IleXaa: 0.0 ± 0.0
Lys
5.41LysAla: 5.41 ± 1.22
0.773LysCys: 0.773 ± 0.826
3.091LysAsp: 3.091 ± 1.183
6.182LysGlu: 6.182 ± 2.046
3.864LysPhe: 3.864 ± 1.107
0.773LysGly: 0.773 ± 0.716
2.318LysHis: 2.318 ± 1.463
1.546LysIle: 1.546 ± 0.703
6.955LysLys: 6.955 ± 2.971
1.546LysLeu: 1.546 ± 1.227
4.637LysMet: 4.637 ± 1.624
2.318LysAsn: 2.318 ± 1.573
0.773LysPro: 0.773 ± 0.716
3.864LysGln: 3.864 ± 2.041
3.864LysArg: 3.864 ± 2.609
3.091LysSer: 3.091 ± 2.437
4.637LysThr: 4.637 ± 1.056
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.546LysTyr: 1.546 ± 1.085
0.0LysXaa: 0.0 ± 0.0
Leu
6.182LeuAla: 6.182 ± 3.015
1.546LeuCys: 1.546 ± 0.732
4.637LeuAsp: 4.637 ± 1.918
9.274LeuGlu: 9.274 ± 3.232
4.637LeuPhe: 4.637 ± 1.825
6.182LeuGly: 6.182 ± 1.973
0.0LeuHis: 0.0 ± 0.0
0.773LeuIle: 0.773 ± 0.716
3.864LeuLys: 3.864 ± 2.746
8.501LeuLeu: 8.501 ± 2.562
3.091LeuMet: 3.091 ± 1.183
6.182LeuAsn: 6.182 ± 3.294
3.864LeuPro: 3.864 ± 2.713
10.046LeuGln: 10.046 ± 3.478
5.41LeuArg: 5.41 ± 1.646
6.182LeuSer: 6.182 ± 1.839
3.091LeuThr: 3.091 ± 0.987
3.091LeuVal: 3.091 ± 1.492
1.546LeuTrp: 1.546 ± 0.732
2.318LeuTyr: 2.318 ± 0.612
0.0LeuXaa: 0.0 ± 0.0
Met
2.318MetAla: 2.318 ± 1.311
0.0MetCys: 0.0 ± 0.0
3.091MetAsp: 3.091 ± 0.987
3.864MetGlu: 3.864 ± 2.056
0.773MetPhe: 0.773 ± 0.716
0.773MetGly: 0.773 ± 0.543
3.091MetHis: 3.091 ± 1.464
0.0MetIle: 0.0 ± 0.0
0.773MetLys: 0.773 ± 1.307
2.318MetLeu: 2.318 ± 0.612
0.0MetMet: 0.0 ± 0.0
1.546MetAsn: 1.546 ± 1.222
0.773MetPro: 0.773 ± 0.826
2.318MetGln: 2.318 ± 0.612
1.546MetArg: 1.546 ± 0.703
1.546MetSer: 1.546 ± 1.227
1.546MetThr: 1.546 ± 1.558
0.773MetVal: 0.773 ± 0.543
0.773MetTrp: 0.773 ± 0.716
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.955AsnAla: 6.955 ± 3.035
0.0AsnCys: 0.0 ± 0.0
3.864AsnAsp: 3.864 ± 1.507
1.546AsnGlu: 1.546 ± 0.703
0.773AsnPhe: 0.773 ± 0.543
3.091AsnGly: 3.091 ± 1.272
0.773AsnHis: 0.773 ± 0.543
3.091AsnIle: 3.091 ± 1.726
4.637AsnLys: 4.637 ± 3.217
9.274AsnLeu: 9.274 ± 3.568
0.773AsnMet: 0.773 ± 0.543
3.864AsnAsn: 3.864 ± 1.027
3.864AsnPro: 3.864 ± 1.768
3.091AsnGln: 3.091 ± 2.051
4.637AsnArg: 4.637 ± 1.056
3.864AsnSer: 3.864 ± 1.027
2.318AsnThr: 2.318 ± 1.311
2.318AsnVal: 2.318 ± 1.032
0.0AsnTrp: 0.0 ± 0.0
3.091AsnTyr: 3.091 ± 1.992
0.0AsnXaa: 0.0 ± 0.0
Pro
3.864ProAla: 3.864 ± 3.792
1.546ProCys: 1.546 ± 0.732
3.091ProAsp: 3.091 ± 1.183
2.318ProGlu: 2.318 ± 0.99
2.318ProPhe: 2.318 ± 1.463
0.773ProGly: 0.773 ± 0.543
1.546ProHis: 1.546 ± 0.732
1.546ProIle: 1.546 ± 1.085
1.546ProLys: 1.546 ± 0.703
2.318ProLeu: 2.318 ± 1.178
0.773ProMet: 0.773 ± 0.543
3.864ProAsn: 3.864 ± 2.056
5.41ProPro: 5.41 ± 3.576
1.546ProGln: 1.546 ± 0.732
2.318ProArg: 2.318 ± 0.99
3.864ProSer: 3.864 ± 1.979
1.546ProThr: 1.546 ± 1.227
3.864ProVal: 3.864 ± 1.654
2.318ProTrp: 2.318 ± 0.612
1.546ProTyr: 1.546 ± 1.651
0.0ProXaa: 0.0 ± 0.0
Gln
3.091GlnAla: 3.091 ± 1.515
0.0GlnCys: 0.0 ± 0.0
2.318GlnAsp: 2.318 ± 0.99
3.091GlnGlu: 3.091 ± 1.887
2.318GlnPhe: 2.318 ± 1.311
3.091GlnGly: 3.091 ± 1.647
0.0GlnHis: 0.0 ± 0.0
2.318GlnIle: 2.318 ± 2.442
3.864GlnLys: 3.864 ± 1.434
4.637GlnLeu: 4.637 ± 2.522
2.318GlnMet: 2.318 ± 1.767
6.955GlnAsn: 6.955 ± 4.574
1.546GlnPro: 1.546 ± 1.227
3.091GlnGln: 3.091 ± 2.863
3.091GlnArg: 3.091 ± 1.361
5.41GlnSer: 5.41 ± 1.21
3.091GlnThr: 3.091 ± 1.492
1.546GlnVal: 1.546 ± 0.912
1.546GlnTrp: 1.546 ± 0.703
6.182GlnTyr: 6.182 ± 2.777
0.0GlnXaa: 0.0 ± 0.0
Arg
6.955ArgAla: 6.955 ± 3.18
0.773ArgCys: 0.773 ± 0.543
0.773ArgAsp: 0.773 ± 0.543
1.546ArgGlu: 1.546 ± 1.432
2.318ArgPhe: 2.318 ± 0.99
3.864ArgGly: 3.864 ± 1.458
0.773ArgHis: 0.773 ± 0.826
3.864ArgIle: 3.864 ± 2.332
6.182ArgLys: 6.182 ± 1.527
6.955ArgLeu: 6.955 ± 0.718
1.546ArgMet: 1.546 ± 1.558
2.318ArgAsn: 2.318 ± 2.477
3.864ArgPro: 3.864 ± 1.455
2.318ArgGln: 2.318 ± 0.612
1.546ArgArg: 1.546 ± 1.227
0.773ArgSer: 0.773 ± 0.826
0.773ArgThr: 0.773 ± 0.826
2.318ArgVal: 2.318 ± 1.032
0.773ArgTrp: 0.773 ± 0.543
3.864ArgTyr: 3.864 ± 1.107
0.0ArgXaa: 0.0 ± 0.0
Ser
6.182SerAla: 6.182 ± 1.736
0.773SerCys: 0.773 ± 1.307
0.773SerAsp: 0.773 ± 0.826
3.864SerGlu: 3.864 ± 1.107
2.318SerPhe: 2.318 ± 2.442
2.318SerGly: 2.318 ± 1.311
1.546SerHis: 1.546 ± 0.703
3.091SerIle: 3.091 ± 1.571
2.318SerLys: 2.318 ± 1.573
3.864SerLeu: 3.864 ± 1.144
3.091SerMet: 3.091 ± 1.515
4.637SerAsn: 4.637 ± 1.592
3.864SerPro: 3.864 ± 2.508
4.637SerGln: 4.637 ± 2.258
3.864SerArg: 3.864 ± 1.681
3.864SerSer: 3.864 ± 1.027
3.091SerThr: 3.091 ± 1.407
4.637SerVal: 4.637 ± 3.256
0.0SerTrp: 0.0 ± 0.0
2.318SerTyr: 2.318 ± 0.612
0.0SerXaa: 0.0 ± 0.0
Thr
2.318ThrAla: 2.318 ± 1.178
2.318ThrCys: 2.318 ± 0.99
3.091ThrAsp: 3.091 ± 1.257
2.318ThrGlu: 2.318 ± 2.477
4.637ThrPhe: 4.637 ± 1.26
5.41ThrGly: 5.41 ± 1.963
1.546ThrHis: 1.546 ± 0.703
1.546ThrIle: 1.546 ± 1.085
2.318ThrLys: 2.318 ± 1.032
3.864ThrLeu: 3.864 ± 1.107
0.773ThrMet: 0.773 ± 0.543
2.318ThrAsn: 2.318 ± 1.376
3.091ThrPro: 3.091 ± 0.711
2.318ThrGln: 2.318 ± 1.767
2.318ThrArg: 2.318 ± 1.586
3.864ThrSer: 3.864 ± 2.105
2.318ThrThr: 2.318 ± 1.032
0.773ThrVal: 0.773 ± 1.279
0.773ThrTrp: 0.773 ± 0.543
1.546ThrTyr: 1.546 ± 1.448
0.0ThrXaa: 0.0 ± 0.0
Val
1.546ValAla: 1.546 ± 1.222
0.0ValCys: 0.0 ± 0.0
0.773ValAsp: 0.773 ± 0.543
2.318ValGlu: 2.318 ± 1.628
0.0ValPhe: 0.0 ± 0.0
2.318ValGly: 2.318 ± 1.628
0.773ValHis: 0.773 ± 0.543
1.546ValIle: 1.546 ± 1.432
3.091ValLys: 3.091 ± 1.418
3.864ValLeu: 3.864 ± 1.107
0.773ValMet: 0.773 ± 0.826
0.773ValAsn: 0.773 ± 0.543
5.41ValPro: 5.41 ± 2.877
2.318ValGln: 2.318 ± 0.612
1.546ValArg: 1.546 ± 1.085
6.182ValSer: 6.182 ± 1.973
2.318ValThr: 2.318 ± 0.99
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
2.318ValTyr: 2.318 ± 1.586
0.0ValXaa: 0.0 ± 0.0
Trp
1.546TrpAla: 1.546 ± 0.703
0.773TrpCys: 0.773 ± 0.826
0.0TrpAsp: 0.0 ± 0.0
1.546TrpGlu: 1.546 ± 1.085
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.773TrpHis: 0.773 ± 0.543
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.546TrpLeu: 1.546 ± 0.732
0.0TrpMet: 0.0 ± 0.0
1.546TrpAsn: 1.546 ± 0.703
1.546TrpPro: 1.546 ± 0.732
0.773TrpGln: 0.773 ± 0.543
2.318TrpArg: 2.318 ± 1.463
0.773TrpSer: 0.773 ± 0.543
0.773TrpThr: 0.773 ± 0.716
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.546TrpTyr: 1.546 ± 0.703
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.318TyrAla: 2.318 ± 1.628
0.773TyrCys: 0.773 ± 0.543
1.546TyrAsp: 1.546 ± 1.448
0.773TyrGlu: 0.773 ± 0.826
3.091TyrPhe: 3.091 ± 1.693
2.318TyrGly: 2.318 ± 0.612
2.318TyrHis: 2.318 ± 2.477
5.41TyrIle: 5.41 ± 3.718
0.773TyrLys: 0.773 ± 0.543
6.955TyrLeu: 6.955 ± 1.834
1.546TyrMet: 1.546 ± 0.732
3.864TyrAsn: 3.864 ± 3.579
0.0TyrPro: 0.0 ± 0.0
3.864TyrGln: 3.864 ± 1.768
0.0TyrArg: 0.0 ± 0.0
3.091TyrSer: 3.091 ± 1.515
1.546TyrThr: 1.546 ± 1.38
3.864TyrVal: 3.864 ± 2.162
1.546TyrTrp: 1.546 ± 1.085
2.318TyrTyr: 2.318 ± 1.586
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1295 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski