Amino acid dipepetide frequency for Escherichia phage APC_JM3.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.456AlaAla: 9.456 ± 1.49
0.681AlaCys: 0.681 ± 0.205
5.963AlaAsp: 5.963 ± 0.653
7.582AlaGlu: 7.582 ± 1.632
2.47AlaPhe: 2.47 ± 0.465
5.452AlaGly: 5.452 ± 0.96
1.278AlaHis: 1.278 ± 0.289
5.537AlaIle: 5.537 ± 0.712
6.304AlaLys: 6.304 ± 1.0
6.389AlaLeu: 6.389 ± 0.884
3.748AlaMet: 3.748 ± 0.59
5.452AlaAsn: 5.452 ± 0.882
1.959AlaPro: 1.959 ± 0.342
4.344AlaGln: 4.344 ± 0.721
5.282AlaArg: 5.282 ± 0.742
4.856AlaSer: 4.856 ± 0.612
5.367AlaThr: 5.367 ± 0.784
5.622AlaVal: 5.622 ± 0.825
1.193AlaTrp: 1.193 ± 0.32
2.726AlaTyr: 2.726 ± 0.573
0.0AlaXaa: 0.0 ± 0.0
Cys
1.107CysAla: 1.107 ± 0.336
0.341CysCys: 0.341 ± 0.162
0.852CysAsp: 0.852 ± 0.27
0.937CysGlu: 0.937 ± 0.303
0.426CysPhe: 0.426 ± 0.227
1.107CysGly: 1.107 ± 0.333
0.426CysHis: 0.426 ± 0.241
0.681CysIle: 0.681 ± 0.268
0.767CysLys: 0.767 ± 0.216
0.767CysLeu: 0.767 ± 0.296
0.256CysMet: 0.256 ± 0.156
0.852CysAsn: 0.852 ± 0.321
0.256CysPro: 0.256 ± 0.132
0.256CysGln: 0.256 ± 0.118
0.767CysArg: 0.767 ± 0.307
1.022CysSer: 1.022 ± 0.276
0.341CysThr: 0.341 ± 0.184
0.511CysVal: 0.511 ± 0.183
0.0CysTrp: 0.0 ± 0.0
0.426CysTyr: 0.426 ± 0.216
0.0CysXaa: 0.0 ± 0.0
Asp
7.156AspAla: 7.156 ± 0.834
0.596AspCys: 0.596 ± 0.263
3.237AspAsp: 3.237 ± 0.561
4.174AspGlu: 4.174 ± 0.589
2.385AspPhe: 2.385 ± 0.435
6.133AspGly: 6.133 ± 0.765
1.363AspHis: 1.363 ± 0.376
3.833AspIle: 3.833 ± 0.545
3.578AspLys: 3.578 ± 0.36
4.515AspLeu: 4.515 ± 0.67
1.619AspMet: 1.619 ± 0.297
2.641AspAsn: 2.641 ± 0.572
1.959AspPro: 1.959 ± 0.471
1.448AspGln: 1.448 ± 0.348
3.152AspArg: 3.152 ± 0.501
2.556AspSer: 2.556 ± 0.466
2.896AspThr: 2.896 ± 0.531
4.259AspVal: 4.259 ± 0.686
1.022AspTrp: 1.022 ± 0.337
2.896AspTyr: 2.896 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
6.9GluAla: 6.9 ± 1.636
0.937GluCys: 0.937 ± 0.301
2.13GluAsp: 2.13 ± 0.388
6.133GluGlu: 6.133 ± 1.223
2.896GluPhe: 2.896 ± 0.434
3.237GluGly: 3.237 ± 0.458
1.789GluHis: 1.789 ± 0.394
5.452GluIle: 5.452 ± 0.836
5.111GluLys: 5.111 ± 0.762
6.474GluLeu: 6.474 ± 0.89
1.704GluMet: 1.704 ± 0.458
2.982GluAsn: 2.982 ± 0.423
2.896GluPro: 2.896 ± 0.844
4.089GluGln: 4.089 ± 0.617
4.515GluArg: 4.515 ± 1.118
3.663GluSer: 3.663 ± 0.691
2.3GluThr: 2.3 ± 0.372
4.344GluVal: 4.344 ± 0.664
1.107GluTrp: 1.107 ± 0.254
2.556GluTyr: 2.556 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
2.556PheAla: 2.556 ± 0.432
0.596PheCys: 0.596 ± 0.305
3.237PheAsp: 3.237 ± 0.489
1.874PheGlu: 1.874 ± 0.364
0.596PhePhe: 0.596 ± 0.25
2.13PheGly: 2.13 ± 0.496
0.596PheHis: 0.596 ± 0.235
2.47PheIle: 2.47 ± 0.561
1.874PheLys: 1.874 ± 0.383
1.533PheLeu: 1.533 ± 0.354
1.107PheMet: 1.107 ± 0.298
1.278PheAsn: 1.278 ± 0.276
1.278PhePro: 1.278 ± 0.285
1.533PheGln: 1.533 ± 0.462
1.959PheArg: 1.959 ± 0.376
2.3PheSer: 2.3 ± 0.4
2.385PheThr: 2.385 ± 0.442
1.107PheVal: 1.107 ± 0.191
1.022PheTrp: 1.022 ± 0.261
0.937PheTyr: 0.937 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
6.645GlyAla: 6.645 ± 1.014
0.767GlyCys: 0.767 ± 0.27
4.515GlyAsp: 4.515 ± 0.618
3.833GlyGlu: 3.833 ± 0.651
2.044GlyPhe: 2.044 ± 0.387
4.515GlyGly: 4.515 ± 0.747
1.022GlyHis: 1.022 ± 0.321
4.43GlyIle: 4.43 ± 0.716
5.367GlyLys: 5.367 ± 0.758
4.856GlyLeu: 4.856 ± 0.773
1.959GlyMet: 1.959 ± 0.406
3.919GlyAsn: 3.919 ± 0.455
1.363GlyPro: 1.363 ± 0.399
3.493GlyGln: 3.493 ± 0.676
4.344GlyArg: 4.344 ± 0.672
3.919GlySer: 3.919 ± 0.651
4.004GlyThr: 4.004 ± 0.496
5.111GlyVal: 5.111 ± 0.814
1.959GlyTrp: 1.959 ± 0.424
2.896GlyTyr: 2.896 ± 0.62
0.0GlyXaa: 0.0 ± 0.0
His
1.533HisAla: 1.533 ± 0.325
0.256HisCys: 0.256 ± 0.127
1.022HisAsp: 1.022 ± 0.331
1.619HisGlu: 1.619 ± 0.355
0.852HisPhe: 0.852 ± 0.235
1.874HisGly: 1.874 ± 0.371
0.341HisHis: 0.341 ± 0.201
0.681HisIle: 0.681 ± 0.215
0.937HisLys: 0.937 ± 0.289
1.533HisLeu: 1.533 ± 0.361
0.256HisMet: 0.256 ± 0.141
0.426HisAsn: 0.426 ± 0.148
1.278HisPro: 1.278 ± 0.307
0.767HisGln: 0.767 ± 0.265
0.767HisArg: 0.767 ± 0.238
0.767HisSer: 0.767 ± 0.205
0.341HisThr: 0.341 ± 0.177
0.852HisVal: 0.852 ± 0.253
0.256HisTrp: 0.256 ± 0.167
0.596HisTyr: 0.596 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
5.452IleAla: 5.452 ± 0.649
0.767IleCys: 0.767 ± 0.243
4.344IleAsp: 4.344 ± 0.579
4.941IleGlu: 4.941 ± 0.692
1.704IlePhe: 1.704 ± 0.36
4.259IleGly: 4.259 ± 0.626
0.681IleHis: 0.681 ± 0.22
3.833IleIle: 3.833 ± 0.698
4.259IleLys: 4.259 ± 0.664
3.407IleLeu: 3.407 ± 0.5
1.448IleMet: 1.448 ± 0.337
3.748IleAsn: 3.748 ± 0.668
2.811IlePro: 2.811 ± 0.579
2.385IleGln: 2.385 ± 0.442
3.407IleArg: 3.407 ± 0.519
4.941IleSer: 4.941 ± 0.727
3.578IleThr: 3.578 ± 0.543
2.896IleVal: 2.896 ± 0.582
0.937IleTrp: 0.937 ± 0.311
1.533IleTyr: 1.533 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
6.9LysAla: 6.9 ± 1.169
1.022LysCys: 1.022 ± 0.269
3.748LysAsp: 3.748 ± 0.539
5.282LysGlu: 5.282 ± 0.844
2.13LysPhe: 2.13 ± 0.451
4.174LysGly: 4.174 ± 0.503
0.852LysHis: 0.852 ± 0.226
3.067LysIle: 3.067 ± 0.579
4.259LysLys: 4.259 ± 0.773
4.515LysLeu: 4.515 ± 0.635
1.959LysMet: 1.959 ± 0.448
2.13LysAsn: 2.13 ± 0.383
3.663LysPro: 3.663 ± 0.578
3.407LysGln: 3.407 ± 0.501
4.856LysArg: 4.856 ± 0.764
3.407LysSer: 3.407 ± 0.523
3.322LysThr: 3.322 ± 0.519
2.641LysVal: 2.641 ± 0.498
0.681LysTrp: 0.681 ± 0.239
1.959LysTyr: 1.959 ± 0.476
0.0LysXaa: 0.0 ± 0.0
Leu
6.389LeuAla: 6.389 ± 0.918
0.767LeuCys: 0.767 ± 0.226
4.77LeuAsp: 4.77 ± 0.579
4.77LeuGlu: 4.77 ± 0.7
2.811LeuPhe: 2.811 ± 0.519
4.004LeuGly: 4.004 ± 0.952
0.937LeuHis: 0.937 ± 0.294
3.748LeuIle: 3.748 ± 0.657
4.174LeuLys: 4.174 ± 0.677
5.026LeuLeu: 5.026 ± 0.611
2.47LeuMet: 2.47 ± 0.46
4.004LeuAsn: 4.004 ± 0.671
3.322LeuPro: 3.322 ± 0.634
3.578LeuGln: 3.578 ± 0.625
4.856LeuArg: 4.856 ± 0.775
5.452LeuSer: 5.452 ± 0.62
5.282LeuThr: 5.282 ± 0.523
4.004LeuVal: 4.004 ± 0.4
0.852LeuTrp: 0.852 ± 0.356
1.448LeuTyr: 1.448 ± 0.369
0.0LeuXaa: 0.0 ± 0.0
Met
2.811MetAla: 2.811 ± 0.525
0.085MetCys: 0.085 ± 0.083
1.959MetAsp: 1.959 ± 0.375
1.448MetGlu: 1.448 ± 0.344
0.681MetPhe: 0.681 ± 0.234
1.959MetGly: 1.959 ± 0.375
0.341MetHis: 0.341 ± 0.164
1.704MetIle: 1.704 ± 0.377
2.044MetLys: 2.044 ± 0.31
2.385MetLeu: 2.385 ± 0.426
1.193MetMet: 1.193 ± 0.361
1.107MetAsn: 1.107 ± 0.298
1.448MetPro: 1.448 ± 0.382
0.937MetGln: 0.937 ± 0.286
2.3MetArg: 2.3 ± 0.468
2.385MetSer: 2.385 ± 0.397
1.619MetThr: 1.619 ± 0.382
1.107MetVal: 1.107 ± 0.346
0.085MetTrp: 0.085 ± 0.089
1.278MetTyr: 1.278 ± 0.351
0.0MetXaa: 0.0 ± 0.0
Asn
4.43AsnAla: 4.43 ± 0.618
0.256AsnCys: 0.256 ± 0.16
3.493AsnAsp: 3.493 ± 0.649
2.726AsnGlu: 2.726 ± 0.464
0.426AsnPhe: 0.426 ± 0.225
5.452AsnGly: 5.452 ± 0.875
1.193AsnHis: 1.193 ± 0.367
2.811AsnIle: 2.811 ± 0.653
2.982AsnLys: 2.982 ± 0.427
3.663AsnLeu: 3.663 ± 0.544
1.278AsnMet: 1.278 ± 0.302
2.385AsnAsn: 2.385 ± 0.597
3.237AsnPro: 3.237 ± 0.72
3.493AsnGln: 3.493 ± 0.668
2.215AsnArg: 2.215 ± 0.495
2.896AsnSer: 2.896 ± 0.476
1.533AsnThr: 1.533 ± 0.437
3.663AsnVal: 3.663 ± 0.675
0.341AsnTrp: 0.341 ± 0.138
1.619AsnTyr: 1.619 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
2.982ProAla: 2.982 ± 0.536
0.596ProCys: 0.596 ± 0.211
3.322ProAsp: 3.322 ± 0.537
4.6ProGlu: 4.6 ± 0.809
2.044ProPhe: 2.044 ± 0.455
2.3ProGly: 2.3 ± 0.451
0.596ProHis: 0.596 ± 0.223
2.215ProIle: 2.215 ± 0.417
2.982ProLys: 2.982 ± 0.489
1.959ProLeu: 1.959 ± 0.337
0.937ProMet: 0.937 ± 0.282
1.874ProAsn: 1.874 ± 0.319
0.937ProPro: 0.937 ± 0.274
1.789ProGln: 1.789 ± 0.41
1.278ProArg: 1.278 ± 0.367
3.152ProSer: 3.152 ± 0.448
1.448ProThr: 1.448 ± 0.292
2.896ProVal: 2.896 ± 0.469
0.341ProTrp: 0.341 ± 0.145
1.533ProTyr: 1.533 ± 0.38
0.0ProXaa: 0.0 ± 0.0
Gln
6.048GlnAla: 6.048 ± 1.056
0.511GlnCys: 0.511 ± 0.222
1.363GlnAsp: 1.363 ± 0.325
2.47GlnGlu: 2.47 ± 0.41
1.448GlnPhe: 1.448 ± 0.303
2.215GlnGly: 2.215 ± 0.45
0.596GlnHis: 0.596 ± 0.239
2.982GlnIle: 2.982 ± 0.493
2.896GlnLys: 2.896 ± 0.453
3.748GlnLeu: 3.748 ± 0.452
1.874GlnMet: 1.874 ± 0.437
2.556GlnAsn: 2.556 ± 0.587
1.874GlnPro: 1.874 ± 0.333
4.089GlnGln: 4.089 ± 0.927
3.493GlnArg: 3.493 ± 0.491
2.811GlnSer: 2.811 ± 0.588
1.704GlnThr: 1.704 ± 0.403
2.3GlnVal: 2.3 ± 0.46
0.852GlnTrp: 0.852 ± 0.256
2.044GlnTyr: 2.044 ± 0.505
0.0GlnXaa: 0.0 ± 0.0
Arg
3.919ArgAla: 3.919 ± 0.647
0.852ArgCys: 0.852 ± 0.274
3.663ArgAsp: 3.663 ± 0.523
5.282ArgGlu: 5.282 ± 1.182
1.619ArgPhe: 1.619 ± 0.345
4.004ArgGly: 4.004 ± 0.578
0.937ArgHis: 0.937 ± 0.243
4.6ArgIle: 4.6 ± 0.793
4.259ArgLys: 4.259 ± 0.741
5.196ArgLeu: 5.196 ± 0.683
1.789ArgMet: 1.789 ± 0.415
3.237ArgAsn: 3.237 ± 0.444
1.448ArgPro: 1.448 ± 0.318
2.47ArgGln: 2.47 ± 0.65
3.748ArgArg: 3.748 ± 0.644
3.493ArgSer: 3.493 ± 0.646
2.13ArgThr: 2.13 ± 0.473
3.919ArgVal: 3.919 ± 0.692
0.511ArgTrp: 0.511 ± 0.176
2.3ArgTyr: 2.3 ± 0.431
0.0ArgXaa: 0.0 ± 0.0
Ser
4.515SerAla: 4.515 ± 0.842
0.596SerCys: 0.596 ± 0.292
3.578SerAsp: 3.578 ± 0.518
4.174SerGlu: 4.174 ± 0.614
2.13SerPhe: 2.13 ± 0.371
5.707SerGly: 5.707 ± 0.659
1.022SerHis: 1.022 ± 0.274
4.174SerIle: 4.174 ± 0.699
3.152SerLys: 3.152 ± 0.503
4.259SerLeu: 4.259 ± 0.593
1.874SerMet: 1.874 ± 0.32
3.322SerAsn: 3.322 ± 0.567
2.726SerPro: 2.726 ± 0.437
3.067SerGln: 3.067 ± 0.454
3.322SerArg: 3.322 ± 0.488
3.919SerSer: 3.919 ± 0.876
2.641SerThr: 2.641 ± 0.656
3.237SerVal: 3.237 ± 0.578
0.681SerTrp: 0.681 ± 0.23
2.3SerTyr: 2.3 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
4.089ThrAla: 4.089 ± 0.48
0.681ThrCys: 0.681 ± 0.245
3.493ThrAsp: 3.493 ± 0.476
2.556ThrGlu: 2.556 ± 0.516
1.448ThrPhe: 1.448 ± 0.401
4.685ThrGly: 4.685 ± 0.726
1.107ThrHis: 1.107 ± 0.316
2.47ThrIle: 2.47 ± 0.492
2.556ThrLys: 2.556 ± 0.407
3.663ThrLeu: 3.663 ± 0.617
0.937ThrMet: 0.937 ± 0.318
2.215ThrAsn: 2.215 ± 0.451
2.982ThrPro: 2.982 ± 0.482
2.044ThrGln: 2.044 ± 0.378
2.215ThrArg: 2.215 ± 0.396
2.896ThrSer: 2.896 ± 0.57
3.407ThrThr: 3.407 ± 0.532
4.004ThrVal: 4.004 ± 0.712
1.022ThrTrp: 1.022 ± 0.242
2.3ThrTyr: 2.3 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
5.196ValAla: 5.196 ± 0.64
0.852ValCys: 0.852 ± 0.292
2.982ValAsp: 2.982 ± 0.385
4.259ValGlu: 4.259 ± 0.408
1.959ValPhe: 1.959 ± 0.491
4.43ValGly: 4.43 ± 0.596
0.767ValHis: 0.767 ± 0.243
3.748ValIle: 3.748 ± 0.456
3.493ValLys: 3.493 ± 0.482
4.344ValLeu: 4.344 ± 0.616
1.278ValMet: 1.278 ± 0.263
3.919ValAsn: 3.919 ± 0.44
2.556ValPro: 2.556 ± 0.334
2.044ValGln: 2.044 ± 0.368
2.47ValArg: 2.47 ± 0.402
3.493ValSer: 3.493 ± 0.502
4.43ValThr: 4.43 ± 0.651
3.748ValVal: 3.748 ± 0.576
1.193ValTrp: 1.193 ± 0.312
1.789ValTyr: 1.789 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
1.193TrpAla: 1.193 ± 0.378
0.341TrpCys: 0.341 ± 0.156
1.022TrpAsp: 1.022 ± 0.324
0.511TrpGlu: 0.511 ± 0.23
0.681TrpPhe: 0.681 ± 0.217
0.596TrpGly: 0.596 ± 0.209
0.341TrpHis: 0.341 ± 0.208
0.937TrpIle: 0.937 ± 0.332
1.107TrpLys: 1.107 ± 0.352
1.363TrpLeu: 1.363 ± 0.334
0.596TrpMet: 0.596 ± 0.211
0.767TrpAsn: 0.767 ± 0.282
0.511TrpPro: 0.511 ± 0.239
0.681TrpGln: 0.681 ± 0.262
1.533TrpArg: 1.533 ± 0.37
0.681TrpSer: 0.681 ± 0.235
0.511TrpThr: 0.511 ± 0.183
0.852TrpVal: 0.852 ± 0.299
0.511TrpTrp: 0.511 ± 0.202
0.341TrpTyr: 0.341 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.385TyrAla: 2.385 ± 0.47
0.596TyrCys: 0.596 ± 0.23
2.726TyrAsp: 2.726 ± 0.505
2.044TyrGlu: 2.044 ± 0.423
1.619TyrPhe: 1.619 ± 0.288
2.726TyrGly: 2.726 ± 0.403
0.767TyrHis: 0.767 ± 0.215
1.959TyrIle: 1.959 ± 0.451
1.874TyrLys: 1.874 ± 0.362
3.067TyrLeu: 3.067 ± 0.483
0.511TyrMet: 0.511 ± 0.239
1.278TyrAsn: 1.278 ± 0.316
1.278TyrPro: 1.278 ± 0.354
1.959TyrGln: 1.959 ± 0.366
2.896TyrArg: 2.896 ± 0.596
1.789TyrSer: 1.789 ± 0.414
1.533TyrThr: 1.533 ± 0.341
1.874TyrVal: 1.874 ± 0.37
0.511TyrTrp: 0.511 ± 0.248
0.767TyrTyr: 0.767 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski