Amino acid dipepetide frequency for Citrobacter virus HCF1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.407AlaAla: 5.407 ± 0.895
1.004AlaCys: 1.004 ± 0.373
3.939AlaAsp: 3.939 ± 0.573
4.326AlaGlu: 4.326 ± 0.746
3.785AlaPhe: 3.785 ± 0.794
6.102AlaGly: 6.102 ± 1.077
1.699AlaHis: 1.699 ± 0.43
4.789AlaIle: 4.789 ± 0.628
5.021AlaLys: 5.021 ± 0.595
6.257AlaLeu: 6.257 ± 0.982
2.935AlaMet: 2.935 ± 0.772
3.399AlaAsn: 3.399 ± 0.888
2.24AlaPro: 2.24 ± 0.478
3.862AlaGln: 3.862 ± 0.726
5.407AlaArg: 5.407 ± 0.799
4.48AlaSer: 4.48 ± 0.786
4.171AlaThr: 4.171 ± 0.757
4.635AlaVal: 4.635 ± 0.674
0.772AlaTrp: 0.772 ± 0.322
3.09AlaTyr: 3.09 ± 0.462
0.0AlaXaa: 0.0 ± 0.0
Cys
1.313CysAla: 1.313 ± 0.413
0.232CysCys: 0.232 ± 0.167
1.004CysAsp: 1.004 ± 0.343
0.463CysGlu: 0.463 ± 0.203
0.541CysPhe: 0.541 ± 0.184
1.854CysGly: 1.854 ± 0.547
0.154CysHis: 0.154 ± 0.106
0.463CysIle: 0.463 ± 0.249
1.236CysLys: 1.236 ± 0.348
1.159CysLeu: 1.159 ± 0.338
0.772CysMet: 0.772 ± 0.433
1.081CysAsn: 1.081 ± 0.331
0.463CysPro: 0.463 ± 0.233
0.463CysGln: 0.463 ± 0.199
1.159CysArg: 1.159 ± 0.428
0.772CysSer: 0.772 ± 0.239
1.313CysThr: 1.313 ± 0.392
0.772CysVal: 0.772 ± 0.314
0.463CysTrp: 0.463 ± 0.333
1.313CysTyr: 1.313 ± 0.503
0.0CysXaa: 0.0 ± 0.0
Asp
4.48AspAla: 4.48 ± 0.779
0.695AspCys: 0.695 ± 0.301
3.09AspAsp: 3.09 ± 0.561
3.708AspGlu: 3.708 ± 0.594
3.167AspPhe: 3.167 ± 0.7
5.484AspGly: 5.484 ± 0.757
1.236AspHis: 1.236 ± 0.39
3.167AspIle: 3.167 ± 0.559
4.171AspLys: 4.171 ± 0.934
4.017AspLeu: 4.017 ± 0.563
1.545AspMet: 1.545 ± 0.393
3.167AspAsn: 3.167 ± 0.506
2.24AspPro: 2.24 ± 0.535
2.24AspGln: 2.24 ± 0.412
3.09AspArg: 3.09 ± 0.558
2.935AspSer: 2.935 ± 0.51
3.399AspThr: 3.399 ± 0.671
3.476AspVal: 3.476 ± 0.6
0.618AspTrp: 0.618 ± 0.269
2.935AspTyr: 2.935 ± 0.801
0.0AspXaa: 0.0 ± 0.0
Glu
5.407GluAla: 5.407 ± 0.919
1.159GluCys: 1.159 ± 0.446
3.244GluAsp: 3.244 ± 0.532
3.321GluGlu: 3.321 ± 0.687
3.939GluPhe: 3.939 ± 0.586
4.557GluGly: 4.557 ± 0.708
1.236GluHis: 1.236 ± 0.306
4.171GluIle: 4.171 ± 0.605
4.866GluLys: 4.866 ± 0.791
5.716GluLeu: 5.716 ± 0.852
2.086GluMet: 2.086 ± 0.413
2.626GluAsn: 2.626 ± 0.493
1.777GluPro: 1.777 ± 0.709
2.163GluGln: 2.163 ± 0.559
1.545GluArg: 1.545 ± 0.35
4.248GluSer: 4.248 ± 0.567
3.63GluThr: 3.63 ± 0.594
4.017GluVal: 4.017 ± 0.712
1.159GluTrp: 1.159 ± 0.332
2.858GluTyr: 2.858 ± 0.762
0.0GluXaa: 0.0 ± 0.0
Phe
3.785PheAla: 3.785 ± 0.767
0.309PheCys: 0.309 ± 0.168
2.549PheAsp: 2.549 ± 0.599
2.781PheGlu: 2.781 ± 0.677
0.927PhePhe: 0.927 ± 0.238
3.862PheGly: 3.862 ± 0.593
0.772PheHis: 0.772 ± 0.284
3.013PheIle: 3.013 ± 0.556
3.244PheLys: 3.244 ± 0.643
3.321PheLeu: 3.321 ± 0.524
0.85PheMet: 0.85 ± 0.264
2.317PheAsn: 2.317 ± 0.497
1.699PhePro: 1.699 ± 0.466
0.85PheGln: 0.85 ± 0.24
2.24PheArg: 2.24 ± 0.477
2.317PheSer: 2.317 ± 0.446
2.395PheThr: 2.395 ± 0.486
2.317PheVal: 2.317 ± 0.454
0.618PheTrp: 0.618 ± 0.195
1.004PheTyr: 1.004 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
5.639GlyAla: 5.639 ± 0.873
1.004GlyCys: 1.004 ± 0.279
4.403GlyAsp: 4.403 ± 0.687
5.562GlyGlu: 5.562 ± 0.644
2.549GlyPhe: 2.549 ± 0.529
5.484GlyGly: 5.484 ± 1.348
1.081GlyHis: 1.081 ± 0.252
4.171GlyIle: 4.171 ± 0.829
6.566GlyLys: 6.566 ± 0.852
4.557GlyLeu: 4.557 ± 0.791
3.244GlyMet: 3.244 ± 0.584
3.476GlyAsn: 3.476 ± 0.46
0.0GlyPro: 0.0 ± 0.0
1.081GlyGln: 1.081 ± 0.269
3.476GlyArg: 3.476 ± 0.56
4.017GlySer: 4.017 ± 0.63
2.781GlyThr: 2.781 ± 0.861
6.18GlyVal: 6.18 ± 0.853
1.468GlyTrp: 1.468 ± 0.412
2.549GlyTyr: 2.549 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
1.081HisAla: 1.081 ± 0.279
0.232HisCys: 0.232 ± 0.132
1.545HisAsp: 1.545 ± 0.447
0.927HisGlu: 0.927 ± 0.292
0.85HisPhe: 0.85 ± 0.368
1.931HisGly: 1.931 ± 0.513
1.236HisHis: 1.236 ± 0.684
0.927HisIle: 0.927 ± 0.27
1.081HisLys: 1.081 ± 0.356
1.39HisLeu: 1.39 ± 0.328
0.695HisMet: 0.695 ± 0.294
0.541HisAsn: 0.541 ± 0.196
0.85HisPro: 0.85 ± 0.345
1.004HisGln: 1.004 ± 0.298
0.618HisArg: 0.618 ± 0.233
0.695HisSer: 0.695 ± 0.252
1.159HisThr: 1.159 ± 0.352
1.159HisVal: 1.159 ± 0.333
0.0HisTrp: 0.0 ± 0.0
0.618HisTyr: 0.618 ± 0.234
0.0HisXaa: 0.0 ± 0.0
Ile
5.33IleAla: 5.33 ± 0.832
0.85IleCys: 0.85 ± 0.297
5.021IleAsp: 5.021 ± 0.899
4.635IleGlu: 4.635 ± 0.594
2.086IlePhe: 2.086 ± 0.435
3.708IleGly: 3.708 ± 0.626
1.39IleHis: 1.39 ± 0.526
3.244IleIle: 3.244 ± 0.469
4.171IleLys: 4.171 ± 0.659
3.553IleLeu: 3.553 ± 0.71
1.159IleMet: 1.159 ± 0.3
3.244IleAsn: 3.244 ± 0.733
2.24IlePro: 2.24 ± 0.516
2.395IleGln: 2.395 ± 0.459
3.708IleArg: 3.708 ± 0.496
4.171IleSer: 4.171 ± 0.829
6.411IleThr: 6.411 ± 0.81
4.712IleVal: 4.712 ± 0.562
1.004IleTrp: 1.004 ± 0.273
2.317IleTyr: 2.317 ± 0.506
0.0IleXaa: 0.0 ± 0.0
Lys
7.415LysAla: 7.415 ± 1.273
1.004LysCys: 1.004 ± 0.409
4.094LysAsp: 4.094 ± 0.592
4.557LysGlu: 4.557 ± 0.776
3.321LysPhe: 3.321 ± 0.54
4.866LysGly: 4.866 ± 0.78
1.545LysHis: 1.545 ± 0.396
4.789LysIle: 4.789 ± 0.634
3.013LysLys: 3.013 ± 0.685
5.407LysLeu: 5.407 ± 0.772
2.626LysMet: 2.626 ± 0.48
2.163LysAsn: 2.163 ± 0.54
2.781LysPro: 2.781 ± 0.567
3.708LysGln: 3.708 ± 0.741
3.785LysArg: 3.785 ± 0.629
4.171LysSer: 4.171 ± 0.629
3.63LysThr: 3.63 ± 0.604
4.557LysVal: 4.557 ± 0.653
2.008LysTrp: 2.008 ± 0.467
2.781LysTyr: 2.781 ± 0.546
0.0LysXaa: 0.0 ± 0.0
Leu
5.253LeuAla: 5.253 ± 0.914
1.313LeuCys: 1.313 ± 0.458
3.862LeuAsp: 3.862 ± 0.725
4.712LeuGlu: 4.712 ± 0.702
1.777LeuPhe: 1.777 ± 0.399
3.553LeuGly: 3.553 ± 0.776
1.004LeuHis: 1.004 ± 0.313
4.635LeuIle: 4.635 ± 0.623
5.175LeuLys: 5.175 ± 0.753
4.557LeuLeu: 4.557 ± 0.661
2.472LeuMet: 2.472 ± 0.473
3.708LeuAsn: 3.708 ± 0.643
3.244LeuPro: 3.244 ± 0.569
3.244LeuGln: 3.244 ± 0.63
5.33LeuArg: 5.33 ± 0.748
6.102LeuSer: 6.102 ± 0.774
5.33LeuThr: 5.33 ± 0.835
4.094LeuVal: 4.094 ± 0.69
0.618LeuTrp: 0.618 ± 0.197
2.317LeuTyr: 2.317 ± 0.562
0.0LeuXaa: 0.0 ± 0.0
Met
2.395MetAla: 2.395 ± 0.586
0.232MetCys: 0.232 ± 0.132
1.236MetAsp: 1.236 ± 0.474
1.081MetGlu: 1.081 ± 0.444
1.081MetPhe: 1.081 ± 0.241
1.699MetGly: 1.699 ± 0.593
0.386MetHis: 0.386 ± 0.151
2.472MetIle: 2.472 ± 0.515
2.395MetLys: 2.395 ± 0.443
2.008MetLeu: 2.008 ± 0.435
1.236MetMet: 1.236 ± 0.367
1.313MetAsn: 1.313 ± 0.315
0.386MetPro: 0.386 ± 0.172
1.081MetGln: 1.081 ± 0.342
3.013MetArg: 3.013 ± 0.658
2.395MetSer: 2.395 ± 0.465
2.24MetThr: 2.24 ± 0.475
1.699MetVal: 1.699 ± 0.394
0.309MetTrp: 0.309 ± 0.142
0.927MetTyr: 0.927 ± 0.411
0.0MetXaa: 0.0 ± 0.0
Asn
3.399AsnAla: 3.399 ± 0.655
0.927AsnCys: 0.927 ± 0.434
2.472AsnAsp: 2.472 ± 0.599
2.472AsnGlu: 2.472 ± 0.483
1.468AsnPhe: 1.468 ± 0.465
3.553AsnGly: 3.553 ± 0.59
0.309AsnHis: 0.309 ± 0.141
2.858AsnIle: 2.858 ± 0.514
3.708AsnLys: 3.708 ± 0.517
1.854AsnLeu: 1.854 ± 0.389
0.85AsnMet: 0.85 ± 0.305
2.086AsnAsn: 2.086 ± 0.417
1.468AsnPro: 1.468 ± 0.443
2.008AsnGln: 2.008 ± 0.438
3.167AsnArg: 3.167 ± 0.557
3.708AsnSer: 3.708 ± 0.716
2.549AsnThr: 2.549 ± 0.632
4.248AsnVal: 4.248 ± 0.71
1.081AsnTrp: 1.081 ± 0.314
1.777AsnTyr: 1.777 ± 0.431
0.0AsnXaa: 0.0 ± 0.0
Pro
2.24ProAla: 2.24 ± 0.517
0.618ProCys: 0.618 ± 0.226
3.013ProAsp: 3.013 ± 0.678
3.013ProGlu: 3.013 ± 0.753
1.622ProPhe: 1.622 ± 0.438
0.618ProGly: 0.618 ± 0.237
0.772ProHis: 0.772 ± 0.369
2.163ProIle: 2.163 ± 0.468
1.236ProLys: 1.236 ± 0.322
2.472ProLeu: 2.472 ± 0.454
0.232ProMet: 0.232 ± 0.135
2.086ProAsn: 2.086 ± 0.506
1.004ProPro: 1.004 ± 0.332
1.004ProGln: 1.004 ± 0.309
1.545ProArg: 1.545 ± 0.411
1.777ProSer: 1.777 ± 0.395
1.468ProThr: 1.468 ± 0.422
3.013ProVal: 3.013 ± 0.575
0.232ProTrp: 0.232 ± 0.138
1.236ProTyr: 1.236 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
1.854GlnAla: 1.854 ± 0.374
1.081GlnCys: 1.081 ± 0.347
1.777GlnAsp: 1.777 ± 0.498
2.24GlnGlu: 2.24 ± 0.55
1.622GlnPhe: 1.622 ± 0.487
0.772GlnGly: 0.772 ± 0.271
0.463GlnHis: 0.463 ± 0.194
3.476GlnIle: 3.476 ± 0.535
3.553GlnLys: 3.553 ± 0.86
3.244GlnLeu: 3.244 ± 0.683
0.927GlnMet: 0.927 ± 0.256
1.854GlnAsn: 1.854 ± 0.482
1.236GlnPro: 1.236 ± 0.438
1.854GlnGln: 1.854 ± 0.416
3.167GlnArg: 3.167 ± 0.655
2.704GlnSer: 2.704 ± 0.584
1.081GlnThr: 1.081 ± 0.342
2.472GlnVal: 2.472 ± 0.541
0.309GlnTrp: 0.309 ± 0.166
1.159GlnTyr: 1.159 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
4.017ArgAla: 4.017 ± 0.655
1.313ArgCys: 1.313 ± 0.497
3.321ArgAsp: 3.321 ± 0.717
3.63ArgGlu: 3.63 ± 0.625
2.24ArgPhe: 2.24 ± 0.484
4.094ArgGly: 4.094 ± 1.015
0.85ArgHis: 0.85 ± 0.258
3.244ArgIle: 3.244 ± 0.596
7.415ArgLys: 7.415 ± 1.036
5.175ArgLeu: 5.175 ± 1.127
1.777ArgMet: 1.777 ± 0.431
2.472ArgAsn: 2.472 ± 0.54
1.081ArgPro: 1.081 ± 0.281
1.39ArgGln: 1.39 ± 0.39
4.326ArgArg: 4.326 ± 0.722
3.399ArgSer: 3.399 ± 0.578
2.395ArgThr: 2.395 ± 0.479
4.248ArgVal: 4.248 ± 0.806
0.618ArgTrp: 0.618 ± 0.246
3.013ArgTyr: 3.013 ± 0.714
0.0ArgXaa: 0.0 ± 0.0
Ser
4.248SerAla: 4.248 ± 0.577
1.236SerCys: 1.236 ± 0.414
4.171SerAsp: 4.171 ± 0.789
4.48SerGlu: 4.48 ± 0.688
2.24SerPhe: 2.24 ± 0.462
5.484SerGly: 5.484 ± 0.744
1.236SerHis: 1.236 ± 0.423
4.094SerIle: 4.094 ± 0.726
3.862SerLys: 3.862 ± 0.576
4.789SerLeu: 4.789 ± 0.646
1.236SerMet: 1.236 ± 0.348
2.008SerAsn: 2.008 ± 0.345
2.472SerPro: 2.472 ± 0.491
2.472SerGln: 2.472 ± 0.708
3.553SerArg: 3.553 ± 0.605
2.781SerSer: 2.781 ± 0.777
3.785SerThr: 3.785 ± 0.552
5.098SerVal: 5.098 ± 0.716
0.695SerTrp: 0.695 ± 0.2
2.549SerTyr: 2.549 ± 0.534
0.0SerXaa: 0.0 ± 0.0
Thr
6.102ThrAla: 6.102 ± 0.866
1.004ThrCys: 1.004 ± 0.36
2.935ThrAsp: 2.935 ± 0.625
3.244ThrGlu: 3.244 ± 0.535
3.399ThrPhe: 3.399 ± 0.712
5.484ThrGly: 5.484 ± 0.527
1.545ThrHis: 1.545 ± 0.397
3.09ThrIle: 3.09 ± 0.496
2.781ThrLys: 2.781 ± 0.898
4.789ThrLeu: 4.789 ± 0.73
1.39ThrMet: 1.39 ± 0.379
3.013ThrAsn: 3.013 ± 0.485
2.472ThrPro: 2.472 ± 0.573
1.622ThrGln: 1.622 ± 0.447
2.858ThrArg: 2.858 ± 0.653
2.781ThrSer: 2.781 ± 0.397
2.395ThrThr: 2.395 ± 0.639
4.557ThrVal: 4.557 ± 0.773
0.618ThrTrp: 0.618 ± 0.285
1.545ThrTyr: 1.545 ± 0.413
0.0ThrXaa: 0.0 ± 0.0
Val
3.862ValAla: 3.862 ± 0.724
1.699ValCys: 1.699 ± 0.509
3.862ValAsp: 3.862 ± 0.92
5.871ValGlu: 5.871 ± 0.711
2.24ValPhe: 2.24 ± 0.479
3.476ValGly: 3.476 ± 0.562
0.541ValHis: 0.541 ± 0.16
6.488ValIle: 6.488 ± 1.086
5.484ValLys: 5.484 ± 0.824
5.098ValLeu: 5.098 ± 0.957
1.854ValMet: 1.854 ± 0.434
3.321ValAsn: 3.321 ± 0.595
1.931ValPro: 1.931 ± 0.531
2.163ValGln: 2.163 ± 0.504
4.403ValArg: 4.403 ± 0.67
4.944ValSer: 4.944 ± 0.675
4.712ValThr: 4.712 ± 0.837
4.557ValVal: 4.557 ± 0.766
0.695ValTrp: 0.695 ± 0.243
1.699ValTyr: 1.699 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.772TrpAla: 0.772 ± 0.314
0.463TrpCys: 0.463 ± 0.234
0.463TrpAsp: 0.463 ± 0.176
0.541TrpGlu: 0.541 ± 0.362
0.618TrpPhe: 0.618 ± 0.242
0.85TrpGly: 0.85 ± 0.363
0.463TrpHis: 0.463 ± 0.195
1.468TrpIle: 1.468 ± 0.42
1.159TrpLys: 1.159 ± 0.371
1.159TrpLeu: 1.159 ± 0.342
0.85TrpMet: 0.85 ± 0.311
0.85TrpAsn: 0.85 ± 0.369
0.232TrpPro: 0.232 ± 0.13
0.232TrpGln: 0.232 ± 0.112
1.468TrpArg: 1.468 ± 0.43
0.85TrpSer: 0.85 ± 0.336
0.618TrpThr: 0.618 ± 0.255
0.618TrpVal: 0.618 ± 0.196
0.232TrpTrp: 0.232 ± 0.137
0.541TrpTyr: 0.541 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.321TyrAla: 3.321 ± 0.649
0.618TyrCys: 0.618 ± 0.291
2.858TyrAsp: 2.858 ± 0.515
2.163TyrGlu: 2.163 ± 0.462
1.699TyrPhe: 1.699 ± 0.511
1.854TyrGly: 1.854 ± 0.522
0.618TyrHis: 0.618 ± 0.222
2.626TyrIle: 2.626 ± 0.651
2.086TyrLys: 2.086 ± 0.48
1.777TyrLeu: 1.777 ± 0.349
0.772TyrMet: 0.772 ± 0.26
1.236TyrAsn: 1.236 ± 0.409
1.545TyrPro: 1.545 ± 0.415
2.008TyrGln: 2.008 ± 0.577
2.163TyrArg: 2.163 ± 0.546
3.244TyrSer: 3.244 ± 0.584
2.24TyrThr: 2.24 ± 0.418
2.472TyrVal: 2.472 ± 0.533
0.927TyrTrp: 0.927 ± 0.425
0.772TyrTyr: 0.772 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (12947 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski