Amino acid dipepetide frequency for Caldibacillus phage CBP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.325AlaAla: 4.325 ± 0.809
0.688AlaCys: 0.688 ± 0.253
3.833AlaAsp: 3.833 ± 0.723
4.62AlaGlu: 4.62 ± 0.775
2.457AlaPhe: 2.457 ± 0.583
2.949AlaGly: 2.949 ± 0.571
0.59AlaHis: 0.59 ± 0.222
4.816AlaIle: 4.816 ± 0.709
5.897AlaLys: 5.897 ± 0.797
5.504AlaLeu: 5.504 ± 0.988
1.868AlaMet: 1.868 ± 0.499
4.423AlaAsn: 4.423 ± 0.655
2.064AlaPro: 2.064 ± 0.54
2.064AlaGln: 2.064 ± 0.427
3.44AlaArg: 3.44 ± 0.588
2.654AlaSer: 2.654 ± 0.704
2.752AlaThr: 2.752 ± 0.523
3.637AlaVal: 3.637 ± 0.556
0.688AlaTrp: 0.688 ± 0.281
2.064AlaTyr: 2.064 ± 0.443
0.0AlaXaa: 0.0 ± 0.0
Cys
0.393CysAla: 0.393 ± 0.201
0.0CysCys: 0.0 ± 0.0
0.393CysAsp: 0.393 ± 0.194
0.885CysGlu: 0.885 ± 0.355
0.098CysPhe: 0.098 ± 0.101
1.081CysGly: 1.081 ± 0.32
0.295CysHis: 0.295 ± 0.187
0.59CysIle: 0.59 ± 0.23
0.885CysLys: 0.885 ± 0.302
0.885CysLeu: 0.885 ± 0.337
0.393CysMet: 0.393 ± 0.204
0.098CysAsn: 0.098 ± 0.089
0.393CysPro: 0.393 ± 0.208
0.393CysGln: 0.393 ± 0.179
0.59CysArg: 0.59 ± 0.218
0.786CysSer: 0.786 ± 0.33
0.197CysThr: 0.197 ± 0.146
0.688CysVal: 0.688 ± 0.216
0.197CysTrp: 0.197 ± 0.127
0.295CysTyr: 0.295 ± 0.24
0.0CysXaa: 0.0 ± 0.0
Asp
3.637AspAla: 3.637 ± 0.591
0.295AspCys: 0.295 ± 0.16
4.718AspAsp: 4.718 ± 0.757
6.291AspGlu: 6.291 ± 0.996
2.949AspPhe: 2.949 ± 0.526
4.718AspGly: 4.718 ± 0.686
0.491AspHis: 0.491 ± 0.201
5.406AspIle: 5.406 ± 0.802
4.62AspLys: 4.62 ± 0.843
4.816AspLeu: 4.816 ± 0.671
1.474AspMet: 1.474 ± 0.399
1.474AspAsn: 1.474 ± 0.423
2.556AspPro: 2.556 ± 0.53
1.868AspGln: 1.868 ± 0.369
2.949AspArg: 2.949 ± 0.534
1.769AspSer: 1.769 ± 0.514
2.064AspThr: 2.064 ± 0.388
3.145AspVal: 3.145 ± 0.545
0.59AspTrp: 0.59 ± 0.233
2.261AspTyr: 2.261 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
5.504GluAla: 5.504 ± 0.741
0.491GluCys: 0.491 ± 0.21
3.637GluAsp: 3.637 ± 0.687
8.06GluGlu: 8.06 ± 0.94
3.145GluPhe: 3.145 ± 0.556
5.209GluGly: 5.209 ± 0.756
1.671GluHis: 1.671 ± 0.437
7.667GluIle: 7.667 ± 0.999
8.551GluLys: 8.551 ± 0.853
9.534GluLeu: 9.534 ± 0.89
2.064GluMet: 2.064 ± 0.38
5.209GluAsn: 5.209 ± 0.743
3.145GluPro: 3.145 ± 0.512
3.932GluGln: 3.932 ± 0.573
4.423GluArg: 4.423 ± 0.746
4.816GluSer: 4.816 ± 0.56
4.128GluThr: 4.128 ± 0.607
5.996GluVal: 5.996 ± 0.926
1.179GluTrp: 1.179 ± 0.305
3.637GluTyr: 3.637 ± 0.514
0.0GluXaa: 0.0 ± 0.0
Phe
2.162PheAla: 2.162 ± 0.505
0.393PheCys: 0.393 ± 0.173
2.752PheAsp: 2.752 ± 0.495
3.833PheGlu: 3.833 ± 0.634
1.474PhePhe: 1.474 ± 0.411
4.128PheGly: 4.128 ± 0.495
0.983PheHis: 0.983 ± 0.292
2.752PheIle: 2.752 ± 0.553
3.342PheLys: 3.342 ± 0.632
3.637PheLeu: 3.637 ± 0.511
1.376PheMet: 1.376 ± 0.342
2.359PheAsn: 2.359 ± 0.492
0.885PhePro: 0.885 ± 0.246
1.769PheGln: 1.769 ± 0.402
1.573PheArg: 1.573 ± 0.338
1.573PheSer: 1.573 ± 0.453
2.261PheThr: 2.261 ± 0.667
2.359PheVal: 2.359 ± 0.496
0.491PheTrp: 0.491 ± 0.213
0.885PheTyr: 0.885 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
2.457GlyAla: 2.457 ± 0.443
0.688GlyCys: 0.688 ± 0.29
4.423GlyAsp: 4.423 ± 0.831
5.308GlyGlu: 5.308 ± 0.892
2.85GlyPhe: 2.85 ± 0.543
4.226GlyGly: 4.226 ± 0.763
1.081GlyHis: 1.081 ± 0.338
5.504GlyIle: 5.504 ± 0.718
5.111GlyLys: 5.111 ± 0.615
5.897GlyLeu: 5.897 ± 0.751
1.769GlyMet: 1.769 ± 0.515
3.145GlyAsn: 3.145 ± 0.464
1.573GlyPro: 1.573 ± 0.327
2.064GlyGln: 2.064 ± 0.526
2.949GlyArg: 2.949 ± 0.555
2.85GlySer: 2.85 ± 0.563
3.047GlyThr: 3.047 ± 0.5
3.932GlyVal: 3.932 ± 0.587
0.59GlyTrp: 0.59 ± 0.226
3.538GlyTyr: 3.538 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
1.081HisAla: 1.081 ± 0.33
0.098HisCys: 0.098 ± 0.094
0.885HisAsp: 0.885 ± 0.276
1.769HisGlu: 1.769 ± 0.486
0.786HisPhe: 0.786 ± 0.257
0.786HisGly: 0.786 ± 0.306
0.393HisHis: 0.393 ± 0.167
0.885HisIle: 0.885 ± 0.311
1.278HisLys: 1.278 ± 0.419
1.278HisLeu: 1.278 ± 0.304
0.393HisMet: 0.393 ± 0.185
0.688HisAsn: 0.688 ± 0.254
0.885HisPro: 0.885 ± 0.284
0.491HisGln: 0.491 ± 0.206
0.885HisArg: 0.885 ± 0.323
1.081HisSer: 1.081 ± 0.323
0.295HisThr: 0.295 ± 0.175
1.278HisVal: 1.278 ± 0.343
0.295HisTrp: 0.295 ± 0.2
0.491HisTyr: 0.491 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
5.111IleAla: 5.111 ± 0.768
1.081IleCys: 1.081 ± 0.301
4.423IleAsp: 4.423 ± 0.769
7.765IleGlu: 7.765 ± 1.091
3.244IlePhe: 3.244 ± 0.699
5.111IleGly: 5.111 ± 0.83
1.278IleHis: 1.278 ± 0.307
6.487IleIle: 6.487 ± 0.927
7.077IleLys: 7.077 ± 0.751
5.799IleLeu: 5.799 ± 0.702
1.671IleMet: 1.671 ± 0.397
4.226IleAsn: 4.226 ± 0.705
3.342IlePro: 3.342 ± 0.473
1.868IleGln: 1.868 ± 0.482
4.62IleArg: 4.62 ± 0.749
4.62IleSer: 4.62 ± 0.751
2.752IleThr: 2.752 ± 0.436
4.226IleVal: 4.226 ± 0.493
0.885IleTrp: 0.885 ± 0.301
3.342IleTyr: 3.342 ± 0.583
0.0IleXaa: 0.0 ± 0.0
Lys
6.094LysAla: 6.094 ± 0.743
0.983LysCys: 0.983 ± 0.297
5.406LysAsp: 5.406 ± 0.725
10.124LysGlu: 10.124 ± 0.919
2.949LysPhe: 2.949 ± 0.538
4.325LysGly: 4.325 ± 0.584
1.474LysHis: 1.474 ± 0.395
7.568LysIle: 7.568 ± 0.733
9.534LysLys: 9.534 ± 1.205
6.192LysLeu: 6.192 ± 0.714
2.752LysMet: 2.752 ± 0.499
4.914LysAsn: 4.914 ± 0.858
3.44LysPro: 3.44 ± 0.647
3.932LysGln: 3.932 ± 0.699
5.603LysArg: 5.603 ± 0.7
3.637LysSer: 3.637 ± 0.495
4.718LysThr: 4.718 ± 0.622
4.128LysVal: 4.128 ± 0.565
1.376LysTrp: 1.376 ± 0.398
2.457LysTyr: 2.457 ± 0.632
0.0LysXaa: 0.0 ± 0.0
Leu
5.701LeuAla: 5.701 ± 0.795
0.786LeuCys: 0.786 ± 0.292
5.308LeuAsp: 5.308 ± 0.785
7.863LeuGlu: 7.863 ± 0.92
5.013LeuPhe: 5.013 ± 0.885
4.325LeuGly: 4.325 ± 0.65
1.179LeuHis: 1.179 ± 0.332
5.603LeuIle: 5.603 ± 0.88
8.256LeuLys: 8.256 ± 0.852
5.701LeuLeu: 5.701 ± 0.672
1.868LeuMet: 1.868 ± 0.488
3.932LeuAsn: 3.932 ± 0.65
2.85LeuPro: 2.85 ± 0.659
2.752LeuGln: 2.752 ± 0.577
3.833LeuArg: 3.833 ± 0.605
4.816LeuSer: 4.816 ± 0.586
3.833LeuThr: 3.833 ± 0.567
4.62LeuVal: 4.62 ± 0.591
0.688LeuTrp: 0.688 ± 0.302
3.145LeuTyr: 3.145 ± 0.692
0.0LeuXaa: 0.0 ± 0.0
Met
2.85MetAla: 2.85 ± 0.518
0.295MetCys: 0.295 ± 0.182
1.278MetAsp: 1.278 ± 0.392
2.162MetGlu: 2.162 ± 0.526
0.295MetPhe: 0.295 ± 0.143
2.752MetGly: 2.752 ± 0.486
0.393MetHis: 0.393 ± 0.193
1.868MetIle: 1.868 ± 0.418
1.966MetLys: 1.966 ± 0.424
1.573MetLeu: 1.573 ± 0.349
0.491MetMet: 0.491 ± 0.191
1.868MetAsn: 1.868 ± 0.501
0.786MetPro: 0.786 ± 0.26
0.59MetGln: 0.59 ± 0.247
1.081MetArg: 1.081 ± 0.309
0.983MetSer: 0.983 ± 0.358
1.474MetThr: 1.474 ± 0.413
2.162MetVal: 2.162 ± 0.434
0.295MetTrp: 0.295 ± 0.162
0.59MetTyr: 0.59 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
2.85AsnAla: 2.85 ± 0.586
0.59AsnCys: 0.59 ± 0.253
2.261AsnAsp: 2.261 ± 0.407
5.603AsnGlu: 5.603 ± 0.779
1.769AsnPhe: 1.769 ± 0.365
4.325AsnGly: 4.325 ± 0.903
1.179AsnHis: 1.179 ± 0.339
3.833AsnIle: 3.833 ± 0.615
3.342AsnLys: 3.342 ± 0.543
3.637AsnLeu: 3.637 ± 0.571
1.671AsnMet: 1.671 ± 0.446
3.047AsnAsn: 3.047 ± 0.602
2.556AsnPro: 2.556 ± 0.613
2.457AsnGln: 2.457 ± 0.507
2.261AsnArg: 2.261 ± 0.546
1.966AsnSer: 1.966 ± 0.387
2.457AsnThr: 2.457 ± 0.525
3.44AsnVal: 3.44 ± 0.511
0.688AsnTrp: 0.688 ± 0.229
1.769AsnTyr: 1.769 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
1.769ProAla: 1.769 ± 0.501
0.491ProCys: 0.491 ± 0.263
2.261ProAsp: 2.261 ± 0.487
2.752ProGlu: 2.752 ± 0.459
1.769ProPhe: 1.769 ± 0.429
2.654ProGly: 2.654 ± 0.491
0.295ProHis: 0.295 ± 0.158
2.556ProIle: 2.556 ± 0.57
4.521ProLys: 4.521 ± 0.786
2.556ProLeu: 2.556 ± 0.501
0.688ProMet: 0.688 ± 0.226
2.162ProAsn: 2.162 ± 0.518
1.081ProPro: 1.081 ± 0.322
0.786ProGln: 0.786 ± 0.267
2.162ProArg: 2.162 ± 0.539
1.474ProSer: 1.474 ± 0.384
1.671ProThr: 1.671 ± 0.449
2.359ProVal: 2.359 ± 0.5
0.59ProTrp: 0.59 ± 0.212
1.966ProTyr: 1.966 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
1.966GlnAla: 1.966 ± 0.4
0.295GlnCys: 0.295 ± 0.148
1.573GlnAsp: 1.573 ± 0.364
3.244GlnGlu: 3.244 ± 0.575
1.671GlnPhe: 1.671 ± 0.328
1.868GlnGly: 1.868 ± 0.388
0.393GlnHis: 0.393 ± 0.204
3.833GlnIle: 3.833 ± 0.468
3.145GlnLys: 3.145 ± 0.586
4.03GlnLeu: 4.03 ± 0.734
0.688GlnMet: 0.688 ± 0.232
2.064GlnAsn: 2.064 ± 0.477
0.983GlnPro: 0.983 ± 0.338
1.376GlnGln: 1.376 ± 0.335
2.556GlnArg: 2.556 ± 0.336
1.376GlnSer: 1.376 ± 0.278
2.359GlnThr: 2.359 ± 0.503
1.573GlnVal: 1.573 ± 0.378
0.59GlnTrp: 0.59 ± 0.221
1.179GlnTyr: 1.179 ± 0.369
0.0GlnXaa: 0.0 ± 0.0
Arg
1.966ArgAla: 1.966 ± 0.48
0.393ArgCys: 0.393 ± 0.222
3.145ArgAsp: 3.145 ± 0.546
6.291ArgGlu: 6.291 ± 0.846
1.573ArgPhe: 1.573 ± 0.419
3.047ArgGly: 3.047 ± 0.553
0.59ArgHis: 0.59 ± 0.231
4.816ArgIle: 4.816 ± 0.657
7.47ArgLys: 7.47 ± 0.833
4.718ArgLeu: 4.718 ± 0.697
1.573ArgMet: 1.573 ± 0.382
1.671ArgAsn: 1.671 ± 0.407
1.868ArgPro: 1.868 ± 0.495
2.752ArgGln: 2.752 ± 0.584
3.637ArgArg: 3.637 ± 0.66
1.474ArgSer: 1.474 ± 0.354
2.064ArgThr: 2.064 ± 0.482
2.654ArgVal: 2.654 ± 0.601
0.59ArgTrp: 0.59 ± 0.227
2.064ArgTyr: 2.064 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
3.145SerAla: 3.145 ± 0.543
0.098SerCys: 0.098 ± 0.105
3.145SerAsp: 3.145 ± 0.621
3.932SerGlu: 3.932 ± 0.576
3.047SerPhe: 3.047 ± 0.482
2.752SerGly: 2.752 ± 0.634
0.983SerHis: 0.983 ± 0.32
3.342SerIle: 3.342 ± 0.554
3.244SerLys: 3.244 ± 0.631
3.047SerLeu: 3.047 ± 0.572
1.278SerMet: 1.278 ± 0.301
2.261SerAsn: 2.261 ± 0.381
1.769SerPro: 1.769 ± 0.379
1.671SerGln: 1.671 ± 0.364
2.85SerArg: 2.85 ± 0.476
3.047SerSer: 3.047 ± 0.603
1.474SerThr: 1.474 ± 0.41
3.538SerVal: 3.538 ± 0.494
0.59SerTrp: 0.59 ± 0.212
2.064SerTyr: 2.064 ± 0.474
0.0SerXaa: 0.0 ± 0.0
Thr
3.244ThrAla: 3.244 ± 0.639
0.0ThrCys: 0.0 ± 0.0
2.654ThrAsp: 2.654 ± 0.457
3.047ThrGlu: 3.047 ± 0.525
1.573ThrPhe: 1.573 ± 0.4
3.047ThrGly: 3.047 ± 0.561
0.688ThrHis: 0.688 ± 0.268
3.932ThrIle: 3.932 ± 0.724
4.03ThrLys: 4.03 ± 0.798
3.735ThrLeu: 3.735 ± 0.47
1.179ThrMet: 1.179 ± 0.357
2.064ThrAsn: 2.064 ± 0.456
1.769ThrPro: 1.769 ± 0.434
2.261ThrGln: 2.261 ± 0.486
2.261ThrArg: 2.261 ± 0.455
2.752ThrSer: 2.752 ± 0.507
2.752ThrThr: 2.752 ± 0.426
3.047ThrVal: 3.047 ± 0.577
0.983ThrTrp: 0.983 ± 0.296
1.671ThrTyr: 1.671 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
4.325ValAla: 4.325 ± 0.65
1.081ValCys: 1.081 ± 0.317
3.244ValAsp: 3.244 ± 0.602
4.325ValGlu: 4.325 ± 0.698
2.064ValPhe: 2.064 ± 0.43
2.261ValGly: 2.261 ± 0.376
1.081ValHis: 1.081 ± 0.34
4.521ValIle: 4.521 ± 0.732
5.209ValLys: 5.209 ± 0.697
5.013ValLeu: 5.013 ± 0.782
1.081ValMet: 1.081 ± 0.299
2.949ValAsn: 2.949 ± 0.48
2.359ValPro: 2.359 ± 0.546
2.261ValGln: 2.261 ± 0.467
3.637ValArg: 3.637 ± 0.51
2.85ValSer: 2.85 ± 0.598
3.735ValThr: 3.735 ± 0.571
2.752ValVal: 2.752 ± 0.619
0.688ValTrp: 0.688 ± 0.238
2.457ValTyr: 2.457 ± 0.393
0.0ValXaa: 0.0 ± 0.0
Trp
0.491TrpAla: 0.491 ± 0.181
0.197TrpCys: 0.197 ± 0.15
0.393TrpAsp: 0.393 ± 0.2
1.081TrpGlu: 1.081 ± 0.302
0.491TrpPhe: 0.491 ± 0.216
0.786TrpGly: 0.786 ± 0.336
0.098TrpHis: 0.098 ± 0.101
0.983TrpIle: 0.983 ± 0.254
1.081TrpLys: 1.081 ± 0.352
1.671TrpLeu: 1.671 ± 0.409
0.197TrpMet: 0.197 ± 0.128
1.179TrpAsn: 1.179 ± 0.35
0.295TrpPro: 0.295 ± 0.169
0.295TrpGln: 0.295 ± 0.175
0.688TrpArg: 0.688 ± 0.227
0.885TrpSer: 0.885 ± 0.259
0.688TrpThr: 0.688 ± 0.254
0.688TrpVal: 0.688 ± 0.245
0.0TrpTrp: 0.0 ± 0.0
0.393TrpTyr: 0.393 ± 0.195
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.261TyrAla: 2.261 ± 0.459
0.59TyrCys: 0.59 ± 0.245
2.359TyrAsp: 2.359 ± 0.563
3.047TyrGlu: 3.047 ± 0.575
1.769TyrPhe: 1.769 ± 0.417
2.654TyrGly: 2.654 ± 0.543
0.885TyrHis: 0.885 ± 0.303
1.966TyrIle: 1.966 ± 0.471
3.047TyrLys: 3.047 ± 0.498
2.85TyrLeu: 2.85 ± 0.426
1.179TyrMet: 1.179 ± 0.297
1.966TyrAsn: 1.966 ± 0.533
1.966TyrPro: 1.966 ± 0.36
1.278TyrGln: 1.278 ± 0.332
2.359TyrArg: 2.359 ± 0.475
1.868TyrSer: 1.868 ± 0.478
1.966TyrThr: 1.966 ± 0.486
1.671TyrVal: 1.671 ± 0.326
0.59TyrTrp: 0.59 ± 0.249
0.59TyrTyr: 0.59 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (10175 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski