Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_581

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.411AlaAla: 1.411 ± 1.472
2.117AlaCys: 2.117 ± 1.525
4.234AlaAsp: 4.234 ± 1.397
5.646AlaGlu: 5.646 ± 1.728
2.823AlaPhe: 2.823 ± 1.412
4.234AlaGly: 4.234 ± 2.712
0.706AlaHis: 0.706 ± 0.736
3.529AlaIle: 3.529 ± 1.524
2.823AlaLys: 2.823 ± 0.763
7.057AlaLeu: 7.057 ± 2.016
0.706AlaMet: 0.706 ± 0.878
4.94AlaAsn: 4.94 ± 2.619
2.117AlaPro: 2.117 ± 1.155
3.529AlaGln: 3.529 ± 2.036
2.823AlaArg: 2.823 ± 0.903
0.706AlaSer: 0.706 ± 0.736
4.234AlaThr: 4.234 ± 1.88
6.351AlaVal: 6.351 ± 1.911
0.0AlaTrp: 0.0 ± 0.0
2.117AlaTyr: 2.117 ± 0.669
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.117CysAsp: 2.117 ± 1.201
0.0CysGlu: 0.0 ± 0.0
2.117CysPhe: 2.117 ± 1.525
2.117CysGly: 2.117 ± 1.608
0.0CysHis: 0.0 ± 0.0
0.706CysIle: 0.706 ± 0.878
1.411CysLys: 1.411 ± 0.982
0.706CysLeu: 0.706 ± 1.086
2.117CysMet: 2.117 ± 1.628
0.706CysAsn: 0.706 ± 0.878
0.706CysPro: 0.706 ± 0.491
0.0CysGln: 0.0 ± 0.0
0.706CysArg: 0.706 ± 0.614
2.117CysSer: 2.117 ± 0.98
0.0CysThr: 0.0 ± 0.0
1.411CysVal: 1.411 ± 1.14
0.0CysTrp: 0.0 ± 0.0
0.706CysTyr: 0.706 ± 1.086
0.0CysXaa: 0.0 ± 0.0
Asp
2.117AspAla: 2.117 ± 0.955
2.117AspCys: 2.117 ± 2.175
4.94AspAsp: 4.94 ± 2.553
5.646AspGlu: 5.646 ± 1.626
4.94AspPhe: 4.94 ± 1.71
0.706AspGly: 0.706 ± 0.614
0.706AspHis: 0.706 ± 0.491
3.529AspIle: 3.529 ± 1.961
4.94AspLys: 4.94 ± 3.374
8.469AspLeu: 8.469 ± 2.301
0.706AspMet: 0.706 ± 0.491
5.646AspAsn: 5.646 ± 1.804
1.411AspPro: 1.411 ± 1.229
2.117AspGln: 2.117 ± 0.968
3.529AspArg: 3.529 ± 1.712
3.529AspSer: 3.529 ± 1.712
3.529AspThr: 3.529 ± 1.18
6.351AspVal: 6.351 ± 1.322
0.0AspTrp: 0.0 ± 0.0
6.351AspTyr: 6.351 ± 2.097
0.0AspXaa: 0.0 ± 0.0
Glu
0.706GluAla: 0.706 ± 0.736
1.411GluCys: 1.411 ± 2.172
5.646GluAsp: 5.646 ± 3.013
3.529GluGlu: 3.529 ± 1.248
3.529GluPhe: 3.529 ± 1.172
1.411GluGly: 1.411 ± 0.706
1.411GluHis: 1.411 ± 1.229
2.823GluIle: 2.823 ± 1.014
5.646GluLys: 5.646 ± 2.014
5.646GluLeu: 5.646 ± 2.864
2.117GluMet: 2.117 ± 0.906
4.94GluAsn: 4.94 ± 1.758
2.823GluPro: 2.823 ± 1.778
2.823GluGln: 2.823 ± 0.763
3.529GluArg: 3.529 ± 1.231
2.117GluSer: 2.117 ± 1.463
2.117GluThr: 2.117 ± 0.669
4.234GluVal: 4.234 ± 1.712
2.117GluTrp: 2.117 ± 0.928
2.823GluTyr: 2.823 ± 1.073
0.0GluXaa: 0.0 ± 0.0
Phe
2.823PheAla: 2.823 ± 1.363
1.411PheCys: 1.411 ± 1.14
3.529PheAsp: 3.529 ± 1.892
2.823PheGlu: 2.823 ± 1.784
2.117PhePhe: 2.117 ± 1.473
3.529PheGly: 3.529 ± 1.196
1.411PheHis: 1.411 ± 0.56
4.234PheIle: 4.234 ± 2.137
1.411PheLys: 1.411 ± 1.229
2.117PheLeu: 2.117 ± 0.856
2.117PheMet: 2.117 ± 1.396
4.94PheAsn: 4.94 ± 1.384
2.117PhePro: 2.117 ± 0.992
1.411PheGln: 1.411 ± 0.982
2.117PheArg: 2.117 ± 1.536
2.823PheSer: 2.823 ± 0.931
2.823PheThr: 2.823 ± 0.931
2.823PheVal: 2.823 ± 1.123
2.117PheTrp: 2.117 ± 0.856
2.117PheTyr: 2.117 ± 1.356
0.0PheXaa: 0.0 ± 0.0
Gly
2.823GlyAla: 2.823 ± 1.363
2.117GlyCys: 2.117 ± 0.856
4.234GlyAsp: 4.234 ± 1.735
3.529GlyGlu: 3.529 ± 1.712
4.234GlyPhe: 4.234 ± 1.15
2.117GlyGly: 2.117 ± 1.473
2.117GlyHis: 2.117 ± 1.199
4.234GlyIle: 4.234 ± 1.359
2.823GlyLys: 2.823 ± 1.363
11.291GlyLeu: 11.291 ± 2.971
1.411GlyMet: 1.411 ± 0.706
4.234GlyAsn: 4.234 ± 1.315
0.0GlyPro: 0.0 ± 0.0
0.706GlyGln: 0.706 ± 0.491
1.411GlyArg: 1.411 ± 0.56
4.234GlySer: 4.234 ± 1.019
4.234GlyThr: 4.234 ± 1.359
2.823GlyVal: 2.823 ± 1.278
0.706GlyTrp: 0.706 ± 0.491
2.117GlyTyr: 2.117 ± 1.068
0.0GlyXaa: 0.0 ± 0.0
His
0.706HisAla: 0.706 ± 0.614
0.0HisCys: 0.0 ± 0.0
2.117HisAsp: 2.117 ± 1.608
0.706HisGlu: 0.706 ± 0.614
2.117HisPhe: 2.117 ± 1.199
1.411HisGly: 1.411 ± 0.706
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.117HisLys: 2.117 ± 0.849
2.823HisLeu: 2.823 ± 1.121
1.411HisMet: 1.411 ± 1.088
2.823HisAsn: 2.823 ± 1.014
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.411HisSer: 1.411 ± 0.967
2.823HisThr: 2.823 ± 1.278
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.706HisTyr: 0.706 ± 0.614
0.0HisXaa: 0.0 ± 0.0
Ile
6.351IleAla: 6.351 ± 3.138
0.0IleCys: 0.0 ± 0.0
4.94IleAsp: 4.94 ± 2.374
1.411IleGlu: 1.411 ± 1.377
2.823IlePhe: 2.823 ± 0.931
4.234IleGly: 4.234 ± 2.21
0.706IleHis: 0.706 ± 0.878
2.823IleIle: 2.823 ± 2.021
4.234IleLys: 4.234 ± 2.137
4.234IleLeu: 4.234 ± 1.843
2.117IleMet: 2.117 ± 1.249
3.529IleAsn: 3.529 ± 0.991
2.117IlePro: 2.117 ± 0.669
3.529IleGln: 3.529 ± 0.621
2.117IleArg: 2.117 ± 1.914
4.234IleSer: 4.234 ± 2.583
2.823IleThr: 2.823 ± 0.903
2.823IleVal: 2.823 ± 0.903
1.411IleTrp: 1.411 ± 0.706
5.646IleTyr: 5.646 ± 1.881
0.0IleXaa: 0.0 ± 0.0
Lys
4.234LysAla: 4.234 ± 1.879
0.706LysCys: 0.706 ± 0.878
2.823LysAsp: 2.823 ± 1.121
2.117LysGlu: 2.117 ± 0.849
0.706LysPhe: 0.706 ± 0.491
2.823LysGly: 2.823 ± 1.173
2.117LysHis: 2.117 ± 1.473
4.94LysIle: 4.94 ± 2.618
3.529LysLys: 3.529 ± 1.313
8.469LysLeu: 8.469 ± 1.933
3.529LysMet: 3.529 ± 1.36
4.234LysAsn: 4.234 ± 2.687
0.706LysPro: 0.706 ± 0.491
2.117LysGln: 2.117 ± 1.343
2.823LysArg: 2.823 ± 1.123
2.823LysSer: 2.823 ± 1.802
0.706LysThr: 0.706 ± 0.614
4.94LysVal: 4.94 ± 1.625
0.706LysTrp: 0.706 ± 0.491
2.823LysTyr: 2.823 ± 1.659
0.0LysXaa: 0.0 ± 0.0
Leu
5.646LeuAla: 5.646 ± 2.143
1.411LeuCys: 1.411 ± 1.517
6.351LeuAsp: 6.351 ± 3.036
7.763LeuGlu: 7.763 ± 3.832
4.234LeuPhe: 4.234 ± 1.516
5.646LeuGly: 5.646 ± 1.821
1.411LeuHis: 1.411 ± 1.052
6.351LeuIle: 6.351 ± 1.236
5.646LeuLys: 5.646 ± 2.947
2.823LeuLeu: 2.823 ± 1.65
0.706LeuMet: 0.706 ± 0.614
7.763LeuAsn: 7.763 ± 3.156
5.646LeuPro: 5.646 ± 2.453
4.94LeuGln: 4.94 ± 1.891
4.234LeuArg: 4.234 ± 1.681
8.469LeuSer: 8.469 ± 1.432
2.117LeuThr: 2.117 ± 0.856
2.117LeuVal: 2.117 ± 1.199
0.0LeuTrp: 0.0 ± 0.0
0.706LeuTyr: 0.706 ± 1.086
0.0LeuXaa: 0.0 ± 0.0
Met
2.823MetAla: 2.823 ± 0.997
0.706MetCys: 0.706 ± 0.491
0.706MetAsp: 0.706 ± 0.491
1.411MetGlu: 1.411 ± 1.14
0.706MetPhe: 0.706 ± 0.491
0.706MetGly: 0.706 ± 0.491
0.706MetHis: 0.706 ± 0.878
0.0MetIle: 0.0 ± 0.0
0.706MetLys: 0.706 ± 0.878
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
3.529MetAsn: 3.529 ± 1.172
0.706MetPro: 0.706 ± 0.736
0.0MetGln: 0.0 ± 0.0
2.823MetArg: 2.823 ± 1.347
3.529MetSer: 3.529 ± 1.49
1.411MetThr: 1.411 ± 1.273
2.117MetVal: 2.117 ± 1.343
0.0MetTrp: 0.0 ± 0.0
2.117MetTyr: 2.117 ± 1.473
0.0MetXaa: 0.0 ± 0.0
Asn
6.351AsnAla: 6.351 ± 2.583
2.823AsnCys: 2.823 ± 1.357
3.529AsnAsp: 3.529 ± 2.378
4.94AsnGlu: 4.94 ± 1.177
5.646AsnPhe: 5.646 ± 1.35
4.234AsnGly: 4.234 ± 2.137
1.411AsnHis: 1.411 ± 0.706
4.234AsnIle: 4.234 ± 1.42
2.823AsnLys: 2.823 ± 2.002
4.234AsnLeu: 4.234 ± 2.058
1.411AsnMet: 1.411 ± 0.745
1.411AsnAsn: 1.411 ± 1.034
3.529AsnPro: 3.529 ± 1.892
1.411AsnGln: 1.411 ± 0.982
4.234AsnArg: 4.234 ± 2.227
4.94AsnSer: 4.94 ± 1.908
2.823AsnThr: 2.823 ± 1.363
4.234AsnVal: 4.234 ± 1.359
0.706AsnTrp: 0.706 ± 0.614
3.529AsnTyr: 3.529 ± 1.737
0.0AsnXaa: 0.0 ± 0.0
Pro
0.706ProAla: 0.706 ± 0.491
0.706ProCys: 0.706 ± 0.614
4.234ProAsp: 4.234 ± 2.103
2.117ProGlu: 2.117 ± 1.473
1.411ProPhe: 1.411 ± 1.052
1.411ProGly: 1.411 ± 0.982
2.117ProHis: 2.117 ± 1.068
2.823ProIle: 2.823 ± 1.209
3.529ProLys: 3.529 ± 1.343
4.94ProLeu: 4.94 ± 2.785
2.117ProMet: 2.117 ± 0.856
0.706ProAsn: 0.706 ± 0.491
2.117ProPro: 2.117 ± 1.117
1.411ProGln: 1.411 ± 0.56
2.117ProArg: 2.117 ± 0.856
2.117ProSer: 2.117 ± 1.068
1.411ProThr: 1.411 ± 0.706
0.706ProVal: 0.706 ± 0.491
0.706ProTrp: 0.706 ± 0.491
1.411ProTyr: 1.411 ± 0.967
0.0ProXaa: 0.0 ± 0.0
Gln
2.823GlnAla: 2.823 ± 2.944
0.706GlnCys: 0.706 ± 0.491
0.706GlnAsp: 0.706 ± 0.614
1.411GlnGlu: 1.411 ± 0.706
0.706GlnPhe: 0.706 ± 0.614
7.057GlnGly: 7.057 ± 1.854
0.706GlnHis: 0.706 ± 0.736
1.411GlnIle: 1.411 ± 0.967
4.234GlnLys: 4.234 ± 1.892
2.823GlnLeu: 2.823 ± 0.763
0.0GlnMet: 0.0 ± 0.0
2.117GlnAsn: 2.117 ± 0.955
0.706GlnPro: 0.706 ± 0.614
2.117GlnGln: 2.117 ± 0.955
2.117GlnArg: 2.117 ± 0.968
1.411GlnSer: 1.411 ± 0.982
2.823GlnThr: 2.823 ± 1.014
1.411GlnVal: 1.411 ± 0.982
0.706GlnTrp: 0.706 ± 0.491
1.411GlnTyr: 1.411 ± 0.56
0.0GlnXaa: 0.0 ± 0.0
Arg
1.411ArgAla: 1.411 ± 0.967
0.0ArgCys: 0.0 ± 0.0
2.823ArgAsp: 2.823 ± 1.488
4.94ArgGlu: 4.94 ± 1.053
4.94ArgPhe: 4.94 ± 2.134
2.823ArgGly: 2.823 ± 1.964
1.411ArgHis: 1.411 ± 0.982
3.529ArgIle: 3.529 ± 1.712
2.823ArgLys: 2.823 ± 1.278
4.234ArgLeu: 4.234 ± 2.272
0.0ArgMet: 0.0 ± 0.0
0.706ArgAsn: 0.706 ± 0.878
4.94ArgPro: 4.94 ± 1.441
1.411ArgGln: 1.411 ± 0.56
2.117ArgArg: 2.117 ± 1.343
3.529ArgSer: 3.529 ± 2.236
1.411ArgThr: 1.411 ± 1.052
1.411ArgVal: 1.411 ± 1.132
0.0ArgTrp: 0.0 ± 0.0
2.823ArgTyr: 2.823 ± 1.015
0.0ArgXaa: 0.0 ± 0.0
Ser
5.646SerAla: 5.646 ± 2.52
0.0SerCys: 0.0 ± 0.0
7.057SerAsp: 7.057 ± 3.857
2.117SerGlu: 2.117 ± 1.348
0.0SerPhe: 0.0 ± 0.0
7.057SerGly: 7.057 ± 3.212
1.411SerHis: 1.411 ± 1.052
6.351SerIle: 6.351 ± 2.526
3.529SerLys: 3.529 ± 1.05
4.234SerLeu: 4.234 ± 1.391
2.117SerMet: 2.117 ± 0.968
4.234SerAsn: 4.234 ± 1.455
3.529SerPro: 3.529 ± 1.592
2.823SerGln: 2.823 ± 1.278
2.823SerArg: 2.823 ± 1.121
8.469SerSer: 8.469 ± 1.036
4.94SerThr: 4.94 ± 1.031
2.117SerVal: 2.117 ± 0.955
0.706SerTrp: 0.706 ± 0.491
2.117SerTyr: 2.117 ± 1.199
0.0SerXaa: 0.0 ± 0.0
Thr
3.529ThrAla: 3.529 ± 1.768
0.706ThrCys: 0.706 ± 0.491
2.823ThrAsp: 2.823 ± 1.014
3.529ThrGlu: 3.529 ± 1.014
2.823ThrPhe: 2.823 ± 1.278
1.411ThrGly: 1.411 ± 0.56
0.706ThrHis: 0.706 ± 1.009
4.234ThrIle: 4.234 ± 1.826
1.411ThrLys: 1.411 ± 1.052
4.234ThrLeu: 4.234 ± 1.826
0.0ThrMet: 0.0 ± 0.0
4.234ThrAsn: 4.234 ± 2.117
1.411ThrPro: 1.411 ± 0.706
1.411ThrGln: 1.411 ± 0.892
2.823ThrArg: 2.823 ± 1.123
7.057ThrSer: 7.057 ± 2.714
2.117ThrThr: 2.117 ± 0.955
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
3.529ThrTyr: 3.529 ± 1.592
0.0ThrXaa: 0.0 ± 0.0
Val
6.351ValAla: 6.351 ± 2.383
0.706ValCys: 0.706 ± 0.878
4.94ValAsp: 4.94 ± 1.901
4.234ValGlu: 4.234 ± 2.058
2.117ValPhe: 2.117 ± 0.928
3.529ValGly: 3.529 ± 1.892
0.0ValHis: 0.0 ± 0.0
4.234ValIle: 4.234 ± 0.998
2.117ValLys: 2.117 ± 0.856
1.411ValLeu: 1.411 ± 0.706
1.411ValMet: 1.411 ± 0.706
1.411ValAsn: 1.411 ± 0.982
2.823ValPro: 2.823 ± 0.903
2.823ValGln: 2.823 ± 0.763
2.823ValArg: 2.823 ± 1.488
4.94ValSer: 4.94 ± 1.205
1.411ValThr: 1.411 ± 1.229
6.351ValVal: 6.351 ± 1.896
0.0ValTrp: 0.0 ± 0.0
2.117ValTyr: 2.117 ± 0.856
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.706TrpAsp: 0.706 ± 0.491
0.706TrpGlu: 0.706 ± 0.491
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.706TrpHis: 0.706 ± 0.491
0.706TrpIle: 0.706 ± 0.491
0.706TrpLys: 0.706 ± 0.614
1.411TrpLeu: 1.411 ± 0.798
0.0TrpMet: 0.0 ± 0.0
0.706TrpAsn: 0.706 ± 0.736
1.411TrpPro: 1.411 ± 0.982
0.0TrpGln: 0.0 ± 0.0
1.411TrpArg: 1.411 ± 0.56
0.706TrpSer: 0.706 ± 0.614
0.706TrpThr: 0.706 ± 0.491
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.706TrpTyr: 0.706 ± 0.491
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.646TyrAla: 5.646 ± 1.434
0.0TyrCys: 0.0 ± 0.0
2.823TyrAsp: 2.823 ± 0.951
2.823TyrGlu: 2.823 ± 1.209
2.823TyrPhe: 2.823 ± 1.015
4.94TyrGly: 4.94 ± 1.946
1.411TyrHis: 1.411 ± 1.229
2.117TyrIle: 2.117 ± 0.849
1.411TyrLys: 1.411 ± 1.034
2.823TyrLeu: 2.823 ± 1.401
0.0TyrMet: 0.0 ± 0.0
5.646TyrAsn: 5.646 ± 1.672
0.0TyrPro: 0.0 ± 0.0
2.823TyrGln: 2.823 ± 2.21
0.706TyrArg: 0.706 ± 0.491
2.117TyrSer: 2.117 ± 0.856
3.529TyrThr: 3.529 ± 0.791
3.529TyrVal: 3.529 ± 1.307
0.706TyrTrp: 0.706 ± 0.491
2.117TyrTyr: 2.117 ± 1.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski