Amino acid dipepetide frequency for Escherichia phage C130_2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.121AlaAla: 14.121 ± 2.616
1.228AlaCys: 1.228 ± 0.293
6.523AlaAsp: 6.523 ± 0.87
6.907AlaGlu: 6.907 ± 0.918
3.3AlaPhe: 3.3 ± 0.47
9.056AlaGly: 9.056 ± 0.826
1.995AlaHis: 1.995 ± 0.418
6.216AlaIle: 6.216 ± 0.732
6.293AlaLys: 6.293 ± 0.673
8.058AlaLeu: 8.058 ± 0.975
3.3AlaMet: 3.3 ± 0.558
4.298AlaAsn: 4.298 ± 0.585
3.377AlaPro: 3.377 ± 0.779
5.142AlaGln: 5.142 ± 0.655
6.754AlaArg: 6.754 ± 0.821
5.756AlaSer: 5.756 ± 0.692
6.447AlaThr: 6.447 ± 1.026
6.447AlaVal: 6.447 ± 0.666
1.842AlaTrp: 1.842 ± 0.339
3.377AlaTyr: 3.377 ± 0.497
0.0AlaXaa: 0.0 ± 0.0
Cys
0.998CysAla: 0.998 ± 0.354
0.077CysCys: 0.077 ± 0.07
0.691CysAsp: 0.691 ± 0.328
0.767CysGlu: 0.767 ± 0.256
0.46CysPhe: 0.46 ± 0.211
1.612CysGly: 1.612 ± 0.464
0.46CysHis: 0.46 ± 0.252
0.23CysIle: 0.23 ± 0.122
0.921CysLys: 0.921 ± 0.432
0.384CysLeu: 0.384 ± 0.144
0.46CysMet: 0.46 ± 0.162
0.537CysAsn: 0.537 ± 0.27
0.153CysPro: 0.153 ± 0.098
0.307CysGln: 0.307 ± 0.16
1.228CysArg: 1.228 ± 0.367
0.537CysSer: 0.537 ± 0.194
0.537CysThr: 0.537 ± 0.25
0.998CysVal: 0.998 ± 0.232
0.23CysTrp: 0.23 ± 0.142
0.077CysTyr: 0.077 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
7.675AspAla: 7.675 ± 0.808
0.767AspCys: 0.767 ± 0.341
3.454AspAsp: 3.454 ± 0.484
3.454AspGlu: 3.454 ± 0.536
1.765AspPhe: 1.765 ± 0.351
5.756AspGly: 5.756 ± 0.673
1.688AspHis: 1.688 ± 0.362
2.993AspIle: 2.993 ± 0.401
3.223AspLys: 3.223 ± 0.439
5.142AspLeu: 5.142 ± 0.647
1.765AspMet: 1.765 ± 0.329
2.84AspAsn: 2.84 ± 0.507
4.298AspPro: 4.298 ± 0.509
3.377AspGln: 3.377 ± 0.41
3.914AspArg: 3.914 ± 0.606
2.686AspSer: 2.686 ± 0.513
1.995AspThr: 1.995 ± 0.414
3.377AspVal: 3.377 ± 0.427
1.151AspTrp: 1.151 ± 0.298
2.533AspTyr: 2.533 ± 0.438
0.0AspXaa: 0.0 ± 0.0
Glu
6.754GluAla: 6.754 ± 0.924
0.691GluCys: 0.691 ± 0.267
3.607GluAsp: 3.607 ± 0.761
4.375GluGlu: 4.375 ± 0.814
1.995GluPhe: 1.995 ± 0.479
4.682GluGly: 4.682 ± 0.54
1.381GluHis: 1.381 ± 0.35
3.454GluIle: 3.454 ± 0.543
3.607GluLys: 3.607 ± 0.63
4.605GluLeu: 4.605 ± 0.506
1.458GluMet: 1.458 ± 0.268
2.226GluAsn: 2.226 ± 0.421
1.151GluPro: 1.151 ± 0.382
3.454GluGln: 3.454 ± 0.728
3.454GluArg: 3.454 ± 0.576
3.914GluSer: 3.914 ± 0.548
3.223GluThr: 3.223 ± 0.554
3.684GluVal: 3.684 ± 0.524
1.151GluTrp: 1.151 ± 0.269
3.223GluTyr: 3.223 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
3.761PheAla: 3.761 ± 0.584
0.921PheCys: 0.921 ± 0.254
3.53PheAsp: 3.53 ± 0.472
1.919PheGlu: 1.919 ± 0.401
0.844PhePhe: 0.844 ± 0.217
2.456PheGly: 2.456 ± 0.407
0.691PheHis: 0.691 ± 0.299
1.535PheIle: 1.535 ± 0.279
1.151PheLys: 1.151 ± 0.301
2.379PheLeu: 2.379 ± 0.358
1.305PheMet: 1.305 ± 0.354
1.688PheAsn: 1.688 ± 0.301
0.921PhePro: 0.921 ± 0.277
0.691PheGln: 0.691 ± 0.196
1.919PheArg: 1.919 ± 0.406
1.612PheSer: 1.612 ± 0.277
1.995PheThr: 1.995 ± 0.485
1.919PheVal: 1.919 ± 0.431
0.384PheTrp: 0.384 ± 0.164
0.844PheTyr: 0.844 ± 0.238
0.0PheXaa: 0.0 ± 0.0
Gly
8.058GlyAla: 8.058 ± 0.966
1.228GlyCys: 1.228 ± 0.313
4.758GlyAsp: 4.758 ± 0.61
5.142GlyGlu: 5.142 ± 0.696
2.686GlyPhe: 2.686 ± 0.455
7.214GlyGly: 7.214 ± 0.887
1.535GlyHis: 1.535 ± 0.515
5.295GlyIle: 5.295 ± 0.83
5.449GlyLys: 5.449 ± 0.614
5.372GlyLeu: 5.372 ± 0.756
1.688GlyMet: 1.688 ± 0.375
2.686GlyAsn: 2.686 ± 0.536
3.07GlyPro: 3.07 ± 0.591
3.53GlyGln: 3.53 ± 0.73
5.219GlyArg: 5.219 ± 0.687
4.144GlySer: 4.144 ± 0.576
4.758GlyThr: 4.758 ± 0.925
5.142GlyVal: 5.142 ± 0.604
1.305GlyTrp: 1.305 ± 0.296
2.379GlyTyr: 2.379 ± 0.531
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 0.45
0.153HisCys: 0.153 ± 0.093
1.305HisAsp: 1.305 ± 0.378
0.998HisGlu: 0.998 ± 0.355
0.614HisPhe: 0.614 ± 0.236
1.305HisGly: 1.305 ± 0.388
0.307HisHis: 0.307 ± 0.197
0.998HisIle: 0.998 ± 0.324
0.767HisLys: 0.767 ± 0.27
1.535HisLeu: 1.535 ± 0.293
0.23HisMet: 0.23 ± 0.139
0.614HisAsn: 0.614 ± 0.224
1.228HisPro: 1.228 ± 0.422
0.384HisGln: 0.384 ± 0.152
1.612HisArg: 1.612 ± 0.349
0.691HisSer: 0.691 ± 0.275
0.998HisThr: 0.998 ± 0.248
0.921HisVal: 0.921 ± 0.281
0.844HisTrp: 0.844 ± 0.315
0.614HisTyr: 0.614 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.219IleAla: 5.219 ± 0.714
0.614IleCys: 0.614 ± 0.233
4.298IleAsp: 4.298 ± 0.597
4.144IleGlu: 4.144 ± 0.593
1.919IlePhe: 1.919 ± 0.335
3.607IleGly: 3.607 ± 0.583
1.305IleHis: 1.305 ± 0.295
2.302IleIle: 2.302 ± 0.503
2.533IleLys: 2.533 ± 0.619
2.533IleLeu: 2.533 ± 0.524
1.535IleMet: 1.535 ± 0.312
2.763IleAsn: 2.763 ± 0.48
3.223IlePro: 3.223 ± 0.534
2.533IleGln: 2.533 ± 0.421
3.3IleArg: 3.3 ± 0.424
2.072IleSer: 2.072 ± 0.406
3.223IleThr: 3.223 ± 0.471
3.761IleVal: 3.761 ± 0.565
0.691IleTrp: 0.691 ± 0.208
1.919IleTyr: 1.919 ± 0.365
0.0IleXaa: 0.0 ± 0.0
Lys
6.754LysAla: 6.754 ± 0.638
0.077LysCys: 0.077 ± 0.077
4.298LysAsp: 4.298 ± 0.704
3.761LysGlu: 3.761 ± 0.62
1.381LysPhe: 1.381 ± 0.294
4.221LysGly: 4.221 ± 0.494
0.691LysHis: 0.691 ± 0.241
2.226LysIle: 2.226 ± 0.382
4.298LysLys: 4.298 ± 0.646
3.223LysLeu: 3.223 ± 0.484
1.919LysMet: 1.919 ± 0.421
2.302LysAsn: 2.302 ± 0.458
2.226LysPro: 2.226 ± 0.504
2.916LysGln: 2.916 ± 0.452
4.682LysArg: 4.682 ± 0.65
2.916LysSer: 2.916 ± 0.416
2.84LysThr: 2.84 ± 0.48
2.993LysVal: 2.993 ± 0.513
1.228LysTrp: 1.228 ± 0.318
1.381LysTyr: 1.381 ± 0.303
0.0LysXaa: 0.0 ± 0.0
Leu
7.982LeuAla: 7.982 ± 1.032
1.074LeuCys: 1.074 ± 0.358
4.221LeuAsp: 4.221 ± 0.622
4.144LeuGlu: 4.144 ± 0.587
2.226LeuPhe: 2.226 ± 0.41
4.758LeuGly: 4.758 ± 0.882
0.691LeuHis: 0.691 ± 0.298
4.375LeuIle: 4.375 ± 0.421
4.221LeuLys: 4.221 ± 0.669
5.065LeuLeu: 5.065 ± 0.84
2.302LeuMet: 2.302 ± 0.375
3.761LeuAsn: 3.761 ± 0.553
3.454LeuPro: 3.454 ± 0.503
3.454LeuGln: 3.454 ± 0.552
5.295LeuArg: 5.295 ± 0.706
3.3LeuSer: 3.3 ± 0.475
4.758LeuThr: 4.758 ± 0.598
3.147LeuVal: 3.147 ± 0.469
0.998LeuTrp: 0.998 ± 0.347
1.612LeuTyr: 1.612 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
2.763MetAla: 2.763 ± 0.4
0.23MetCys: 0.23 ± 0.149
1.074MetAsp: 1.074 ± 0.479
1.074MetGlu: 1.074 ± 0.32
0.384MetPhe: 0.384 ± 0.161
2.379MetGly: 2.379 ± 0.342
0.23MetHis: 0.23 ± 0.138
2.226MetIle: 2.226 ± 0.498
1.305MetLys: 1.305 ± 0.359
2.072MetLeu: 2.072 ± 0.339
0.46MetMet: 0.46 ± 0.198
1.074MetAsn: 1.074 ± 0.287
1.919MetPro: 1.919 ± 0.435
1.688MetGln: 1.688 ± 0.302
2.149MetArg: 2.149 ± 0.421
1.458MetSer: 1.458 ± 0.4
1.305MetThr: 1.305 ± 0.341
1.765MetVal: 1.765 ± 0.369
0.307MetTrp: 0.307 ± 0.172
0.691MetTyr: 0.691 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
5.986AsnAla: 5.986 ± 0.603
0.23AsnCys: 0.23 ± 0.129
2.609AsnAsp: 2.609 ± 0.444
2.149AsnGlu: 2.149 ± 0.41
1.228AsnPhe: 1.228 ± 0.301
3.991AsnGly: 3.991 ± 0.711
0.691AsnHis: 0.691 ± 0.219
1.919AsnIle: 1.919 ± 0.344
2.84AsnLys: 2.84 ± 0.52
3.07AsnLeu: 3.07 ± 0.463
0.844AsnMet: 0.844 ± 0.278
1.765AsnAsn: 1.765 ± 0.432
3.07AsnPro: 3.07 ± 0.483
2.379AsnGln: 2.379 ± 0.494
2.302AsnArg: 2.302 ± 0.375
1.765AsnSer: 1.765 ± 0.354
1.995AsnThr: 1.995 ± 0.447
2.916AsnVal: 2.916 ± 0.683
0.46AsnTrp: 0.46 ± 0.166
0.767AsnTyr: 0.767 ± 0.254
0.0AsnXaa: 0.0 ± 0.0
Pro
5.065ProAla: 5.065 ± 0.548
0.537ProCys: 0.537 ± 0.224
3.684ProAsp: 3.684 ± 0.539
4.605ProGlu: 4.605 ± 0.65
1.305ProPhe: 1.305 ± 0.348
2.763ProGly: 2.763 ± 0.548
0.998ProHis: 0.998 ± 0.281
2.686ProIle: 2.686 ± 0.436
2.916ProLys: 2.916 ± 0.416
2.84ProLeu: 2.84 ± 0.386
0.921ProMet: 0.921 ± 0.333
1.151ProAsn: 1.151 ± 0.298
2.609ProPro: 2.609 ± 0.516
3.147ProGln: 3.147 ± 0.689
1.919ProArg: 1.919 ± 0.304
1.919ProSer: 1.919 ± 0.384
2.456ProThr: 2.456 ± 0.41
2.84ProVal: 2.84 ± 0.461
0.614ProTrp: 0.614 ± 0.208
2.072ProTyr: 2.072 ± 0.419
0.0ProXaa: 0.0 ± 0.0
Gln
4.988GlnAla: 4.988 ± 0.924
0.384GlnCys: 0.384 ± 0.159
3.223GlnAsp: 3.223 ± 0.52
2.609GlnGlu: 2.609 ± 0.375
1.919GlnPhe: 1.919 ± 0.387
3.147GlnGly: 3.147 ± 0.507
0.46GlnHis: 0.46 ± 0.148
2.609GlnIle: 2.609 ± 0.38
2.302GlnLys: 2.302 ± 0.38
3.914GlnLeu: 3.914 ± 0.626
1.228GlnMet: 1.228 ± 0.266
2.149GlnAsn: 2.149 ± 0.481
2.456GlnPro: 2.456 ± 0.502
5.065GlnGln: 5.065 ± 1.363
3.454GlnArg: 3.454 ± 0.537
2.533GlnSer: 2.533 ± 0.385
3.223GlnThr: 3.223 ± 0.581
2.686GlnVal: 2.686 ± 0.46
0.767GlnTrp: 0.767 ± 0.171
1.151GlnTyr: 1.151 ± 0.352
0.0GlnXaa: 0.0 ± 0.0
Arg
7.291ArgAla: 7.291 ± 0.983
0.691ArgCys: 0.691 ± 0.315
3.377ArgAsp: 3.377 ± 0.405
4.605ArgGlu: 4.605 ± 0.767
2.609ArgPhe: 2.609 ± 0.505
4.682ArgGly: 4.682 ± 0.579
0.998ArgHis: 0.998 ± 0.332
4.221ArgIle: 4.221 ± 0.494
3.377ArgLys: 3.377 ± 0.564
4.835ArgLeu: 4.835 ± 0.675
1.842ArgMet: 1.842 ± 0.355
3.3ArgAsn: 3.3 ± 0.525
2.686ArgPro: 2.686 ± 0.509
3.223ArgGln: 3.223 ± 0.584
4.375ArgArg: 4.375 ± 0.7
2.379ArgSer: 2.379 ± 0.42
2.533ArgThr: 2.533 ± 0.275
3.53ArgVal: 3.53 ± 0.513
0.998ArgTrp: 0.998 ± 0.317
1.458ArgTyr: 1.458 ± 0.306
0.0ArgXaa: 0.0 ± 0.0
Ser
4.528SerAla: 4.528 ± 0.566
0.384SerCys: 0.384 ± 0.185
3.147SerAsp: 3.147 ± 0.442
2.302SerGlu: 2.302 ± 0.405
2.072SerPhe: 2.072 ± 0.359
4.912SerGly: 4.912 ± 0.533
0.844SerHis: 0.844 ± 0.24
1.381SerIle: 1.381 ± 0.348
3.147SerLys: 3.147 ± 0.511
3.147SerLeu: 3.147 ± 0.381
1.458SerMet: 1.458 ± 0.393
2.993SerAsn: 2.993 ± 0.488
1.612SerPro: 1.612 ± 0.304
2.149SerGln: 2.149 ± 0.474
3.223SerArg: 3.223 ± 0.6
2.609SerSer: 2.609 ± 0.331
3.377SerThr: 3.377 ± 0.633
2.993SerVal: 2.993 ± 0.46
0.23SerTrp: 0.23 ± 0.137
1.151SerTyr: 1.151 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
6.293ThrAla: 6.293 ± 1.036
0.614ThrCys: 0.614 ± 0.231
2.916ThrAsp: 2.916 ± 0.442
3.377ThrGlu: 3.377 ± 0.456
1.842ThrPhe: 1.842 ± 0.346
5.833ThrGly: 5.833 ± 0.823
0.46ThrHis: 0.46 ± 0.196
2.84ThrIle: 2.84 ± 0.398
2.149ThrLys: 2.149 ± 0.417
4.375ThrLeu: 4.375 ± 0.649
1.535ThrMet: 1.535 ± 0.273
1.765ThrAsn: 1.765 ± 0.426
4.221ThrPro: 4.221 ± 0.64
2.533ThrGln: 2.533 ± 0.546
2.763ThrArg: 2.763 ± 0.469
2.456ThrSer: 2.456 ± 0.456
4.298ThrThr: 4.298 ± 0.456
4.528ThrVal: 4.528 ± 0.639
0.767ThrTrp: 0.767 ± 0.24
1.458ThrTyr: 1.458 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
6.6ValAla: 6.6 ± 0.792
0.921ValCys: 0.921 ± 0.35
3.914ValAsp: 3.914 ± 0.674
3.607ValGlu: 3.607 ± 0.513
2.533ValPhe: 2.533 ± 0.4
4.068ValGly: 4.068 ± 0.568
1.151ValHis: 1.151 ± 0.291
3.684ValIle: 3.684 ± 0.442
3.377ValLys: 3.377 ± 0.516
3.991ValLeu: 3.991 ± 0.552
1.074ValMet: 1.074 ± 0.236
3.454ValAsn: 3.454 ± 0.588
3.147ValPro: 3.147 ± 0.516
1.688ValGln: 1.688 ± 0.381
3.147ValArg: 3.147 ± 0.522
2.916ValSer: 2.916 ± 0.484
4.068ValThr: 4.068 ± 0.57
3.223ValVal: 3.223 ± 0.572
0.614ValTrp: 0.614 ± 0.203
2.533ValTyr: 2.533 ± 0.532
0.0ValXaa: 0.0 ± 0.0
Trp
1.151TrpAla: 1.151 ± 0.301
0.23TrpCys: 0.23 ± 0.127
1.074TrpAsp: 1.074 ± 0.257
0.767TrpGlu: 0.767 ± 0.199
0.537TrpPhe: 0.537 ± 0.235
1.765TrpGly: 1.765 ± 0.415
0.384TrpHis: 0.384 ± 0.164
0.767TrpIle: 0.767 ± 0.251
1.074TrpLys: 1.074 ± 0.268
1.381TrpLeu: 1.381 ± 0.31
0.23TrpMet: 0.23 ± 0.121
0.691TrpAsn: 0.691 ± 0.251
0.537TrpPro: 0.537 ± 0.18
0.537TrpGln: 0.537 ± 0.171
0.921TrpArg: 0.921 ± 0.298
0.691TrpSer: 0.691 ± 0.231
0.767TrpThr: 0.767 ± 0.247
1.228TrpVal: 1.228 ± 0.359
0.23TrpTrp: 0.23 ± 0.141
0.384TrpTyr: 0.384 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.609TyrAla: 2.609 ± 0.442
0.614TyrCys: 0.614 ± 0.247
2.149TyrAsp: 2.149 ± 0.339
1.381TyrGlu: 1.381 ± 0.262
0.844TyrPhe: 0.844 ± 0.253
2.456TyrGly: 2.456 ± 0.465
0.998TyrHis: 0.998 ± 0.232
1.535TyrIle: 1.535 ± 0.353
1.381TyrLys: 1.381 ± 0.342
2.916TyrLeu: 2.916 ± 0.4
0.921TyrMet: 0.921 ± 0.321
0.921TyrAsn: 0.921 ± 0.243
1.688TyrPro: 1.688 ± 0.417
2.072TyrGln: 2.072 ± 0.4
1.458TyrArg: 1.458 ± 0.37
1.305TyrSer: 1.305 ± 0.267
2.226TyrThr: 2.226 ± 0.436
1.612TyrVal: 1.612 ± 0.296
0.537TyrTrp: 0.537 ± 0.223
0.998TyrTyr: 0.998 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (13031 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski