Amino acid dipepetide frequency for Cyanophage NATL1A-7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.092AlaAla: 9.092 ± 0.874
0.376AlaCys: 0.376 ± 0.168
6.312AlaAsp: 6.312 ± 1.228
5.109AlaGlu: 5.109 ± 0.594
2.555AlaPhe: 2.555 ± 0.364
7.063AlaGly: 7.063 ± 0.875
1.277AlaHis: 1.277 ± 0.303
3.907AlaIle: 3.907 ± 0.503
6.612AlaLys: 6.612 ± 1.002
6.537AlaLeu: 6.537 ± 0.515
2.029AlaMet: 2.029 ± 0.376
4.283AlaAsn: 4.283 ± 0.517
3.381AlaPro: 3.381 ± 0.599
3.231AlaGln: 3.231 ± 0.584
4.283AlaArg: 4.283 ± 0.62
5.786AlaSer: 5.786 ± 0.728
5.786AlaThr: 5.786 ± 0.753
4.508AlaVal: 4.508 ± 0.635
1.052AlaTrp: 1.052 ± 0.243
3.456AlaTyr: 3.456 ± 0.588
0.0AlaXaa: 0.0 ± 0.0
Cys
0.376CysAla: 0.376 ± 0.187
0.0CysCys: 0.0 ± 0.0
0.526CysAsp: 0.526 ± 0.184
0.451CysGlu: 0.451 ± 0.183
0.15CysPhe: 0.15 ± 0.097
0.827CysGly: 0.827 ± 0.233
0.075CysHis: 0.075 ± 0.07
0.301CysIle: 0.301 ± 0.137
0.451CysLys: 0.451 ± 0.181
0.751CysLeu: 0.751 ± 0.297
0.075CysMet: 0.075 ± 0.072
0.526CysAsn: 0.526 ± 0.183
0.15CysPro: 0.15 ± 0.111
0.225CysGln: 0.225 ± 0.14
0.376CysArg: 0.376 ± 0.138
0.827CysSer: 0.827 ± 0.286
1.127CysThr: 1.127 ± 0.322
0.225CysVal: 0.225 ± 0.137
0.0CysTrp: 0.0 ± 0.0
0.376CysTyr: 0.376 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
6.762AspAla: 6.762 ± 1.086
0.526AspCys: 0.526 ± 0.188
5.26AspAsp: 5.26 ± 0.758
3.832AspGlu: 3.832 ± 0.536
2.329AspPhe: 2.329 ± 0.377
4.959AspGly: 4.959 ± 0.606
1.052AspHis: 1.052 ± 0.3
4.433AspIle: 4.433 ± 0.575
3.381AspLys: 3.381 ± 0.459
5.034AspLeu: 5.034 ± 0.486
1.578AspMet: 1.578 ± 0.327
3.381AspAsn: 3.381 ± 0.403
2.93AspPro: 2.93 ± 0.476
2.705AspGln: 2.705 ± 0.424
2.555AspArg: 2.555 ± 0.463
4.959AspSer: 4.959 ± 0.919
3.907AspThr: 3.907 ± 0.551
3.982AspVal: 3.982 ± 0.525
1.202AspTrp: 1.202 ± 0.215
3.456AspTyr: 3.456 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
5.936GluAla: 5.936 ± 0.616
0.376GluCys: 0.376 ± 0.171
4.734GluAsp: 4.734 ± 0.562
4.734GluGlu: 4.734 ± 0.746
2.104GluPhe: 2.104 ± 0.449
4.283GluGly: 4.283 ± 0.592
1.202GluHis: 1.202 ± 0.394
2.78GluIle: 2.78 ± 0.403
3.757GluLys: 3.757 ± 0.569
6.988GluLeu: 6.988 ± 0.718
1.578GluMet: 1.578 ± 0.432
2.104GluAsn: 2.104 ± 0.417
2.104GluPro: 2.104 ± 0.332
3.381GluGln: 3.381 ± 0.58
2.104GluArg: 2.104 ± 0.466
3.005GluSer: 3.005 ± 0.511
4.659GluThr: 4.659 ± 0.791
3.982GluVal: 3.982 ± 0.424
0.902GluTrp: 0.902 ± 0.279
3.081GluTyr: 3.081 ± 0.574
0.0GluXaa: 0.0 ± 0.0
Phe
1.728PheAla: 1.728 ± 0.484
0.376PheCys: 0.376 ± 0.176
1.728PheAsp: 1.728 ± 0.367
1.653PheGlu: 1.653 ± 0.333
1.428PhePhe: 1.428 ± 0.339
2.555PheGly: 2.555 ± 0.535
1.052PheHis: 1.052 ± 0.322
2.104PheIle: 2.104 ± 0.398
2.404PheLys: 2.404 ± 0.442
2.404PheLeu: 2.404 ± 0.469
1.503PheMet: 1.503 ± 0.385
2.254PheAsn: 2.254 ± 0.49
0.526PhePro: 0.526 ± 0.163
1.352PheGln: 1.352 ± 0.376
2.254PheArg: 2.254 ± 0.463
2.48PheSer: 2.48 ± 0.456
2.555PheThr: 2.555 ± 0.429
1.954PheVal: 1.954 ± 0.432
0.301PheTrp: 0.301 ± 0.12
1.277PheTyr: 1.277 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
6.837GlyAla: 6.837 ± 0.903
0.526GlyCys: 0.526 ± 0.237
4.659GlyAsp: 4.659 ± 0.582
3.531GlyGlu: 3.531 ± 0.52
2.48GlyPhe: 2.48 ± 0.427
3.982GlyGly: 3.982 ± 0.559
1.202GlyHis: 1.202 ± 0.31
4.283GlyIle: 4.283 ± 0.545
4.057GlyLys: 4.057 ± 0.638
4.809GlyLeu: 4.809 ± 0.522
1.428GlyMet: 1.428 ± 0.412
4.508GlyAsn: 4.508 ± 0.539
1.503GlyPro: 1.503 ± 0.273
2.855GlyGln: 2.855 ± 0.448
3.156GlyArg: 3.156 ± 0.581
6.236GlySer: 6.236 ± 1.361
4.884GlyThr: 4.884 ± 0.816
5.335GlyVal: 5.335 ± 0.52
0.902GlyTrp: 0.902 ± 0.28
2.855GlyTyr: 2.855 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.298
0.15HisCys: 0.15 ± 0.118
1.428HisAsp: 1.428 ± 0.453
0.902HisGlu: 0.902 ± 0.279
0.676HisPhe: 0.676 ± 0.313
1.352HisGly: 1.352 ± 0.33
0.225HisHis: 0.225 ± 0.109
1.352HisIle: 1.352 ± 0.522
0.676HisLys: 0.676 ± 0.284
1.954HisLeu: 1.954 ± 0.323
0.451HisMet: 0.451 ± 0.153
1.052HisAsn: 1.052 ± 0.321
1.202HisPro: 1.202 ± 0.268
0.751HisGln: 0.751 ± 0.28
0.601HisArg: 0.601 ± 0.304
1.277HisSer: 1.277 ± 0.288
1.202HisThr: 1.202 ± 0.26
0.751HisVal: 0.751 ± 0.258
0.075HisTrp: 0.075 ± 0.073
0.751HisTyr: 0.751 ± 0.275
0.0HisXaa: 0.0 ± 0.0
Ile
5.635IleAla: 5.635 ± 0.847
0.601IleCys: 0.601 ± 0.24
4.133IleAsp: 4.133 ± 0.613
3.682IleGlu: 3.682 ± 0.521
1.653IlePhe: 1.653 ± 0.279
3.531IleGly: 3.531 ± 0.499
1.503IleHis: 1.503 ± 0.335
2.48IleIle: 2.48 ± 0.411
5.034IleLys: 5.034 ± 0.627
4.057IleLeu: 4.057 ± 0.466
0.977IleMet: 0.977 ± 0.331
2.855IleAsn: 2.855 ± 0.41
3.005IlePro: 3.005 ± 0.421
2.179IleGln: 2.179 ± 0.325
2.179IleArg: 2.179 ± 0.369
4.283IleSer: 4.283 ± 0.607
3.531IleThr: 3.531 ± 0.587
2.029IleVal: 2.029 ± 0.446
0.676IleTrp: 0.676 ± 0.295
2.029IleTyr: 2.029 ± 0.39
0.0IleXaa: 0.0 ± 0.0
Lys
6.462LysAla: 6.462 ± 0.883
0.15LysCys: 0.15 ± 0.114
3.381LysAsp: 3.381 ± 0.525
5.335LysGlu: 5.335 ± 0.704
2.254LysPhe: 2.254 ± 0.424
3.306LysGly: 3.306 ± 0.534
0.827LysHis: 0.827 ± 0.289
4.508LysIle: 4.508 ± 0.73
5.936LysLys: 5.936 ± 1.391
7.213LysLeu: 7.213 ± 0.907
1.202LysMet: 1.202 ± 0.32
3.531LysAsn: 3.531 ± 0.761
3.005LysPro: 3.005 ± 0.54
3.156LysGln: 3.156 ± 0.468
3.456LysArg: 3.456 ± 0.639
4.057LysSer: 4.057 ± 0.625
4.433LysThr: 4.433 ± 0.583
2.93LysVal: 2.93 ± 0.48
1.052LysTrp: 1.052 ± 0.232
2.254LysTyr: 2.254 ± 0.351
0.0LysXaa: 0.0 ± 0.0
Leu
5.635LeuAla: 5.635 ± 0.654
0.751LeuCys: 0.751 ± 0.301
5.335LeuAsp: 5.335 ± 0.615
5.861LeuGlu: 5.861 ± 0.807
2.404LeuPhe: 2.404 ± 0.499
5.861LeuGly: 5.861 ± 0.651
1.878LeuHis: 1.878 ± 0.39
3.682LeuIle: 3.682 ± 0.614
6.161LeuLys: 6.161 ± 0.81
4.583LeuLeu: 4.583 ± 0.541
1.578LeuMet: 1.578 ± 0.278
4.283LeuAsn: 4.283 ± 0.697
3.381LeuPro: 3.381 ± 0.627
3.907LeuGln: 3.907 ± 0.631
3.156LeuArg: 3.156 ± 0.491
5.335LeuSer: 5.335 ± 0.533
5.335LeuThr: 5.335 ± 0.732
3.982LeuVal: 3.982 ± 0.66
0.601LeuTrp: 0.601 ± 0.249
2.329LeuTyr: 2.329 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.555MetAla: 2.555 ± 0.303
0.451MetCys: 0.451 ± 0.175
1.503MetAsp: 1.503 ± 0.331
1.428MetGlu: 1.428 ± 0.275
0.376MetPhe: 0.376 ± 0.165
2.179MetGly: 2.179 ± 0.398
0.376MetHis: 0.376 ± 0.301
0.827MetIle: 0.827 ± 0.277
2.104MetLys: 2.104 ± 0.446
1.728MetLeu: 1.728 ± 0.409
0.751MetMet: 0.751 ± 0.27
1.428MetAsn: 1.428 ± 0.34
0.451MetPro: 0.451 ± 0.186
0.977MetGln: 0.977 ± 0.292
1.277MetArg: 1.277 ± 0.313
1.428MetSer: 1.428 ± 0.303
1.428MetThr: 1.428 ± 0.252
1.428MetVal: 1.428 ± 0.324
0.15MetTrp: 0.15 ± 0.09
0.827MetTyr: 0.827 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
4.283AsnAla: 4.283 ± 0.552
0.301AsnCys: 0.301 ± 0.166
3.607AsnAsp: 3.607 ± 0.722
3.005AsnGlu: 3.005 ± 0.408
2.104AsnPhe: 2.104 ± 0.474
4.208AsnGly: 4.208 ± 0.444
1.052AsnHis: 1.052 ± 0.25
3.907AsnIle: 3.907 ± 0.587
3.005AsnLys: 3.005 ± 0.448
4.358AsnLeu: 4.358 ± 0.427
1.202AsnMet: 1.202 ± 0.28
3.381AsnAsn: 3.381 ± 0.673
2.404AsnPro: 2.404 ± 0.373
2.329AsnGln: 2.329 ± 0.491
2.329AsnArg: 2.329 ± 0.532
4.809AsnSer: 4.809 ± 0.657
3.832AsnThr: 3.832 ± 0.59
3.005AsnVal: 3.005 ± 0.515
0.977AsnTrp: 0.977 ± 0.341
1.878AsnTyr: 1.878 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
2.48ProAla: 2.48 ± 0.398
0.15ProCys: 0.15 ± 0.118
3.456ProAsp: 3.456 ± 0.49
2.78ProGlu: 2.78 ± 0.515
1.503ProPhe: 1.503 ± 0.388
3.156ProGly: 3.156 ± 0.536
0.301ProHis: 0.301 ± 0.13
2.329ProIle: 2.329 ± 0.398
2.555ProLys: 2.555 ± 0.53
2.705ProLeu: 2.705 ± 0.47
0.751ProMet: 0.751 ± 0.244
2.404ProAsn: 2.404 ± 0.385
1.878ProPro: 1.878 ± 0.769
1.653ProGln: 1.653 ± 0.354
0.827ProArg: 0.827 ± 0.245
2.855ProSer: 2.855 ± 0.505
2.78ProThr: 2.78 ± 0.402
2.78ProVal: 2.78 ± 0.516
0.827ProTrp: 0.827 ± 0.324
1.352ProTyr: 1.352 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
3.907GlnAla: 3.907 ± 0.532
0.376GlnCys: 0.376 ± 0.167
1.954GlnAsp: 1.954 ± 0.425
2.78GlnGlu: 2.78 ± 0.57
2.029GlnPhe: 2.029 ± 0.358
2.029GlnGly: 2.029 ± 0.452
0.676GlnHis: 0.676 ± 0.224
2.705GlnIle: 2.705 ± 0.484
1.954GlnLys: 1.954 ± 0.308
3.832GlnLeu: 3.832 ± 0.477
1.127GlnMet: 1.127 ± 0.316
1.728GlnAsn: 1.728 ± 0.386
1.503GlnPro: 1.503 ± 0.413
1.277GlnGln: 1.277 ± 0.335
1.578GlnArg: 1.578 ± 0.316
3.081GlnSer: 3.081 ± 0.478
2.63GlnThr: 2.63 ± 0.336
3.607GlnVal: 3.607 ± 0.579
0.376GlnTrp: 0.376 ± 0.143
1.202GlnTyr: 1.202 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
4.133ArgAla: 4.133 ± 0.607
0.301ArgCys: 0.301 ± 0.144
2.705ArgAsp: 2.705 ± 0.421
2.78ArgGlu: 2.78 ± 0.486
1.578ArgPhe: 1.578 ± 0.387
2.254ArgGly: 2.254 ± 0.524
0.676ArgHis: 0.676 ± 0.188
1.803ArgIle: 1.803 ± 0.375
3.531ArgLys: 3.531 ± 0.543
3.607ArgLeu: 3.607 ± 0.543
1.202ArgMet: 1.202 ± 0.276
3.381ArgAsn: 3.381 ± 0.549
1.578ArgPro: 1.578 ± 0.306
1.954ArgGln: 1.954 ± 0.335
1.503ArgArg: 1.503 ± 0.367
2.329ArgSer: 2.329 ± 0.456
2.404ArgThr: 2.404 ± 0.462
1.728ArgVal: 1.728 ± 0.395
0.751ArgTrp: 0.751 ± 0.21
1.428ArgTyr: 1.428 ± 0.244
0.0ArgXaa: 0.0 ± 0.0
Ser
5.56SerAla: 5.56 ± 0.726
0.601SerCys: 0.601 ± 0.253
5.335SerAsp: 5.335 ± 0.702
4.433SerGlu: 4.433 ± 0.482
2.179SerPhe: 2.179 ± 0.357
5.335SerGly: 5.335 ± 0.872
1.202SerHis: 1.202 ± 0.381
4.057SerIle: 4.057 ± 0.477
4.508SerLys: 4.508 ± 0.739
3.982SerLeu: 3.982 ± 0.541
1.578SerMet: 1.578 ± 0.284
3.682SerAsn: 3.682 ± 0.548
3.982SerPro: 3.982 ± 0.441
2.329SerGln: 2.329 ± 0.336
3.005SerArg: 3.005 ± 0.493
5.184SerSer: 5.184 ± 1.006
5.034SerThr: 5.034 ± 0.77
4.358SerVal: 4.358 ± 0.693
1.202SerTrp: 1.202 ± 0.325
2.705SerTyr: 2.705 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
6.762ThrAla: 6.762 ± 0.888
0.827ThrCys: 0.827 ± 0.297
4.133ThrAsp: 4.133 ± 0.594
3.081ThrGlu: 3.081 ± 0.455
2.555ThrPhe: 2.555 ± 0.495
6.161ThrGly: 6.161 ± 0.864
0.902ThrHis: 0.902 ± 0.233
5.109ThrIle: 5.109 ± 1.041
4.734ThrLys: 4.734 ± 0.632
4.583ThrLeu: 4.583 ± 0.544
1.277ThrMet: 1.277 ± 0.322
3.757ThrAsn: 3.757 ± 0.493
3.682ThrPro: 3.682 ± 0.465
2.104ThrGln: 2.104 ± 0.398
2.254ThrArg: 2.254 ± 0.548
4.659ThrSer: 4.659 ± 0.61
7.664ThrThr: 7.664 ± 1.071
3.682ThrVal: 3.682 ± 0.62
0.751ThrTrp: 0.751 ± 0.216
3.231ThrTyr: 3.231 ± 0.545
0.0ThrXaa: 0.0 ± 0.0
Val
4.057ValAla: 4.057 ± 0.779
0.601ValCys: 0.601 ± 0.218
3.607ValAsp: 3.607 ± 0.715
4.433ValGlu: 4.433 ± 0.66
1.954ValPhe: 1.954 ± 0.404
3.757ValGly: 3.757 ± 0.487
0.751ValHis: 0.751 ± 0.235
3.005ValIle: 3.005 ± 0.521
4.133ValLys: 4.133 ± 0.469
3.081ValLeu: 3.081 ± 0.488
1.503ValMet: 1.503 ± 0.339
4.208ValAsn: 4.208 ± 0.612
1.202ValPro: 1.202 ± 0.257
2.104ValGln: 2.104 ± 0.332
2.404ValArg: 2.404 ± 0.43
4.057ValSer: 4.057 ± 0.711
5.184ValThr: 5.184 ± 0.855
3.381ValVal: 3.381 ± 0.746
1.052ValTrp: 1.052 ± 0.217
1.653ValTyr: 1.653 ± 0.317
0.0ValXaa: 0.0 ± 0.0
Trp
0.601TrpAla: 0.601 ± 0.245
0.15TrpCys: 0.15 ± 0.114
1.127TrpAsp: 1.127 ± 0.253
1.352TrpGlu: 1.352 ± 0.355
0.676TrpPhe: 0.676 ± 0.187
0.451TrpGly: 0.451 ± 0.149
0.376TrpHis: 0.376 ± 0.139
0.451TrpIle: 0.451 ± 0.204
0.902TrpLys: 0.902 ± 0.263
1.352TrpLeu: 1.352 ± 0.337
0.676TrpMet: 0.676 ± 0.219
0.751TrpAsn: 0.751 ± 0.132
0.301TrpPro: 0.301 ± 0.153
0.301TrpGln: 0.301 ± 0.149
0.751TrpArg: 0.751 ± 0.244
0.676TrpSer: 0.676 ± 0.225
0.902TrpThr: 0.902 ± 0.216
0.827TrpVal: 0.827 ± 0.267
0.0TrpTrp: 0.0 ± 0.0
0.451TrpTyr: 0.451 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.705TyrAla: 2.705 ± 0.391
0.225TyrCys: 0.225 ± 0.121
3.306TyrAsp: 3.306 ± 0.556
2.705TyrGlu: 2.705 ± 0.505
0.902TyrPhe: 0.902 ± 0.322
2.78TyrGly: 2.78 ± 0.522
1.352TyrHis: 1.352 ± 0.381
2.029TyrIle: 2.029 ± 0.372
2.555TyrLys: 2.555 ± 0.567
2.48TyrLeu: 2.48 ± 0.43
1.127TyrMet: 1.127 ± 0.241
2.48TyrAsn: 2.48 ± 0.523
1.352TyrPro: 1.352 ± 0.286
1.578TyrGln: 1.578 ± 0.426
1.578TyrArg: 1.578 ± 0.419
2.93TyrSer: 2.93 ± 0.482
2.63TyrThr: 2.63 ± 0.478
1.653TyrVal: 1.653 ± 0.404
0.225TyrTrp: 0.225 ± 0.121
2.029TyrTyr: 2.029 ± 0.344
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13310 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski