Amino acid dipepetide frequency for Morganella phage IME1369_02

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.469AlaAla: 10.469 ± 1.247
1.089AlaCys: 1.089 ± 0.339
5.025AlaAsp: 5.025 ± 0.6
8.04AlaGlu: 8.04 ± 0.751
3.518AlaPhe: 3.518 ± 0.477
8.459AlaGly: 8.459 ± 0.902
1.089AlaHis: 1.089 ± 0.239
5.025AlaIle: 5.025 ± 0.596
4.439AlaLys: 4.439 ± 0.651
7.37AlaLeu: 7.37 ± 0.797
2.68AlaMet: 2.68 ± 0.429
2.429AlaAsn: 2.429 ± 0.438
2.596AlaPro: 2.596 ± 0.449
3.685AlaGln: 3.685 ± 0.72
4.439AlaArg: 4.439 ± 0.693
4.858AlaSer: 4.858 ± 1.06
3.518AlaThr: 3.518 ± 0.602
5.528AlaVal: 5.528 ± 0.671
1.843AlaTrp: 1.843 ± 0.279
2.513AlaTyr: 2.513 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
1.005CysAla: 1.005 ± 0.374
0.586CysCys: 0.586 ± 0.226
1.089CysAsp: 1.089 ± 0.268
1.005CysGlu: 1.005 ± 0.322
0.419CysPhe: 0.419 ± 0.206
1.089CysGly: 1.089 ± 0.316
0.335CysHis: 0.335 ± 0.163
0.838CysIle: 0.838 ± 0.263
0.838CysLys: 0.838 ± 0.322
0.67CysLeu: 0.67 ± 0.234
0.335CysMet: 0.335 ± 0.162
0.503CysAsn: 0.503 ± 0.252
0.419CysPro: 0.419 ± 0.18
0.754CysGln: 0.754 ± 0.248
0.838CysArg: 0.838 ± 0.28
0.921CysSer: 0.921 ± 0.271
0.419CysThr: 0.419 ± 0.218
0.335CysVal: 0.335 ± 0.138
0.335CysTrp: 0.335 ± 0.16
0.503CysTyr: 0.503 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
4.774AspAla: 4.774 ± 0.582
0.586AspCys: 0.586 ± 0.242
4.104AspAsp: 4.104 ± 0.811
4.941AspGlu: 4.941 ± 0.626
1.508AspPhe: 1.508 ± 0.297
5.193AspGly: 5.193 ± 0.916
0.921AspHis: 0.921 ± 0.328
3.601AspIle: 3.601 ± 0.596
3.099AspLys: 3.099 ± 0.494
4.69AspLeu: 4.69 ± 0.567
2.345AspMet: 2.345 ± 0.557
3.099AspAsn: 3.099 ± 0.499
2.764AspPro: 2.764 ± 0.524
1.843AspGln: 1.843 ± 0.429
2.513AspArg: 2.513 ± 0.497
2.931AspSer: 2.931 ± 0.373
3.099AspThr: 3.099 ± 0.425
4.104AspVal: 4.104 ± 0.611
0.754AspTrp: 0.754 ± 0.264
1.591AspTyr: 1.591 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
5.695GluAla: 5.695 ± 0.655
1.256GluCys: 1.256 ± 0.369
3.015GluAsp: 3.015 ± 0.507
3.266GluGlu: 3.266 ± 0.617
3.099GluPhe: 3.099 ± 0.463
2.596GluGly: 2.596 ± 0.477
2.01GluHis: 2.01 ± 0.452
3.601GluIle: 3.601 ± 0.615
4.104GluLys: 4.104 ± 0.648
8.291GluLeu: 8.291 ± 0.936
2.68GluMet: 2.68 ± 0.498
3.35GluAsn: 3.35 ± 0.527
1.926GluPro: 1.926 ± 0.401
4.355GluGln: 4.355 ± 0.804
4.606GluArg: 4.606 ± 0.64
3.183GluSer: 3.183 ± 0.582
3.35GluThr: 3.35 ± 0.569
4.104GluVal: 4.104 ± 0.613
1.089GluTrp: 1.089 ± 0.354
1.759GluTyr: 1.759 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
2.178PheAla: 2.178 ± 0.448
0.335PheCys: 0.335 ± 0.162
2.513PheAsp: 2.513 ± 0.43
2.01PheGlu: 2.01 ± 0.486
1.005PhePhe: 1.005 ± 0.298
2.68PheGly: 2.68 ± 0.494
0.419PheHis: 0.419 ± 0.179
2.094PheIle: 2.094 ± 0.35
2.429PheLys: 2.429 ± 0.481
2.68PheLeu: 2.68 ± 0.546
1.089PheMet: 1.089 ± 0.332
2.01PheAsn: 2.01 ± 0.352
1.675PhePro: 1.675 ± 0.428
0.921PheGln: 0.921 ± 0.273
2.094PheArg: 2.094 ± 0.394
2.01PheSer: 2.01 ± 0.437
2.429PheThr: 2.429 ± 0.444
1.759PheVal: 1.759 ± 0.329
0.503PheTrp: 0.503 ± 0.197
1.759PheTyr: 1.759 ± 0.409
0.0PheXaa: 0.0 ± 0.0
Gly
5.863GlyAla: 5.863 ± 1.039
0.838GlyCys: 0.838 ± 0.286
4.355GlyAsp: 4.355 ± 0.682
5.863GlyGlu: 5.863 ± 0.603
3.266GlyPhe: 3.266 ± 0.461
6.365GlyGly: 6.365 ± 0.814
0.586GlyHis: 0.586 ± 0.25
4.606GlyIle: 4.606 ± 0.644
4.69GlyLys: 4.69 ± 0.685
4.606GlyLeu: 4.606 ± 0.506
2.01GlyMet: 2.01 ± 0.374
3.769GlyAsn: 3.769 ± 0.77
0.838GlyPro: 0.838 ± 0.302
2.01GlyGln: 2.01 ± 0.412
3.853GlyArg: 3.853 ± 0.696
4.271GlySer: 4.271 ± 0.732
5.36GlyThr: 5.36 ± 0.923
5.025GlyVal: 5.025 ± 0.583
0.754GlyTrp: 0.754 ± 0.284
3.183GlyTyr: 3.183 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
1.34HisAla: 1.34 ± 0.316
0.419HisCys: 0.419 ± 0.176
2.094HisAsp: 2.094 ± 0.414
1.173HisGlu: 1.173 ± 0.28
0.67HisPhe: 0.67 ± 0.232
1.424HisGly: 1.424 ± 0.395
1.005HisHis: 1.005 ± 0.468
1.843HisIle: 1.843 ± 0.413
1.089HisLys: 1.089 ± 0.329
1.675HisLeu: 1.675 ± 0.404
0.419HisMet: 0.419 ± 0.182
0.586HisAsn: 0.586 ± 0.176
0.754HisPro: 0.754 ± 0.28
1.089HisGln: 1.089 ± 0.319
1.34HisArg: 1.34 ± 0.309
1.424HisSer: 1.424 ± 0.367
0.67HisThr: 0.67 ± 0.189
0.921HisVal: 0.921 ± 0.293
0.503HisTrp: 0.503 ± 0.225
1.089HisTyr: 1.089 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
4.355IleAla: 4.355 ± 0.628
0.754IleCys: 0.754 ± 0.244
4.941IleAsp: 4.941 ± 0.711
4.271IleGlu: 4.271 ± 0.658
1.843IlePhe: 1.843 ± 0.382
3.35IleGly: 3.35 ± 0.511
1.508IleHis: 1.508 ± 0.332
3.183IleIle: 3.183 ± 0.732
3.434IleLys: 3.434 ± 0.533
3.685IleLeu: 3.685 ± 0.68
0.838IleMet: 0.838 ± 0.246
3.015IleAsn: 3.015 ± 0.46
2.68IlePro: 2.68 ± 0.481
2.094IleGln: 2.094 ± 0.407
3.936IleArg: 3.936 ± 0.565
4.104IleSer: 4.104 ± 0.677
4.271IleThr: 4.271 ± 0.493
3.183IleVal: 3.183 ± 0.471
0.586IleTrp: 0.586 ± 0.248
1.005IleTyr: 1.005 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
5.695LysAla: 5.695 ± 0.623
0.168LysCys: 0.168 ± 0.13
2.596LysAsp: 2.596 ± 0.424
4.02LysGlu: 4.02 ± 0.614
1.508LysPhe: 1.508 ± 0.319
3.266LysGly: 3.266 ± 0.515
1.591LysHis: 1.591 ± 0.41
2.931LysIle: 2.931 ± 0.438
3.601LysLys: 3.601 ± 0.795
4.439LysLeu: 4.439 ± 0.582
1.926LysMet: 1.926 ± 0.375
2.764LysAsn: 2.764 ± 0.677
2.68LysPro: 2.68 ± 0.504
1.926LysGln: 1.926 ± 0.388
2.68LysArg: 2.68 ± 0.531
4.69LysSer: 4.69 ± 0.631
4.02LysThr: 4.02 ± 0.467
3.015LysVal: 3.015 ± 0.521
1.256LysTrp: 1.256 ± 0.289
2.429LysTyr: 2.429 ± 0.393
0.0LysXaa: 0.0 ± 0.0
Leu
7.035LeuAla: 7.035 ± 0.789
1.173LeuCys: 1.173 ± 0.379
4.523LeuAsp: 4.523 ± 0.625
3.685LeuGlu: 3.685 ± 0.573
3.099LeuPhe: 3.099 ± 0.482
3.601LeuGly: 3.601 ± 0.545
1.508LeuHis: 1.508 ± 0.344
3.35LeuIle: 3.35 ± 0.411
5.444LeuLys: 5.444 ± 0.769
6.281LeuLeu: 6.281 ± 0.831
2.345LeuMet: 2.345 ± 0.417
4.523LeuAsn: 4.523 ± 0.446
3.853LeuPro: 3.853 ± 0.541
2.513LeuGln: 2.513 ± 0.487
5.946LeuArg: 5.946 ± 0.802
6.951LeuSer: 6.951 ± 0.868
4.941LeuThr: 4.941 ± 0.631
4.439LeuVal: 4.439 ± 0.578
0.419LeuTrp: 0.419 ± 0.207
2.68LeuTyr: 2.68 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
2.345MetAla: 2.345 ± 0.544
0.251MetCys: 0.251 ± 0.153
0.838MetAsp: 0.838 ± 0.245
1.508MetGlu: 1.508 ± 0.338
0.838MetPhe: 0.838 ± 0.384
1.759MetGly: 1.759 ± 0.437
0.503MetHis: 0.503 ± 0.198
1.34MetIle: 1.34 ± 0.361
1.926MetLys: 1.926 ± 0.396
2.848MetLeu: 2.848 ± 0.566
0.838MetMet: 0.838 ± 0.297
1.089MetAsn: 1.089 ± 0.291
1.591MetPro: 1.591 ± 0.383
1.005MetGln: 1.005 ± 0.279
1.508MetArg: 1.508 ± 0.373
3.266MetSer: 3.266 ± 0.483
2.764MetThr: 2.764 ± 0.374
1.759MetVal: 1.759 ± 0.369
0.419MetTrp: 0.419 ± 0.188
0.168MetTyr: 0.168 ± 0.1
0.0MetXaa: 0.0 ± 0.0
Asn
3.936AsnAla: 3.936 ± 0.6
0.67AsnCys: 0.67 ± 0.234
2.596AsnAsp: 2.596 ± 0.35
2.513AsnGlu: 2.513 ± 0.456
1.424AsnPhe: 1.424 ± 0.311
3.936AsnGly: 3.936 ± 0.572
1.424AsnHis: 1.424 ± 0.369
2.764AsnIle: 2.764 ± 0.415
2.513AsnLys: 2.513 ± 0.499
2.764AsnLeu: 2.764 ± 0.419
0.921AsnMet: 0.921 ± 0.35
1.926AsnAsn: 1.926 ± 0.325
2.764AsnPro: 2.764 ± 0.412
2.094AsnGln: 2.094 ± 0.528
2.094AsnArg: 2.094 ± 0.463
2.345AsnSer: 2.345 ± 0.466
2.345AsnThr: 2.345 ± 0.5
1.675AsnVal: 1.675 ± 0.325
1.089AsnTrp: 1.089 ± 0.264
1.089AsnTyr: 1.089 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
3.601ProAla: 3.601 ± 0.655
0.586ProCys: 0.586 ± 0.216
3.518ProAsp: 3.518 ± 0.542
3.769ProGlu: 3.769 ± 0.648
1.256ProPhe: 1.256 ± 0.411
2.848ProGly: 2.848 ± 0.472
1.089ProHis: 1.089 ± 0.338
1.173ProIle: 1.173 ± 0.281
2.01ProLys: 2.01 ± 0.431
2.931ProLeu: 2.931 ± 0.466
1.089ProMet: 1.089 ± 0.256
1.005ProAsn: 1.005 ± 0.333
2.178ProPro: 2.178 ± 0.565
1.843ProGln: 1.843 ± 0.468
2.094ProArg: 2.094 ± 0.394
2.68ProSer: 2.68 ± 0.453
1.508ProThr: 1.508 ± 0.299
4.104ProVal: 4.104 ± 0.536
1.089ProTrp: 1.089 ± 0.306
1.424ProTyr: 1.424 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
3.518GlnAla: 3.518 ± 0.604
0.503GlnCys: 0.503 ± 0.203
1.926GlnAsp: 1.926 ± 0.379
2.178GlnGlu: 2.178 ± 0.473
1.256GlnPhe: 1.256 ± 0.29
2.931GlnGly: 2.931 ± 0.389
0.921GlnHis: 0.921 ± 0.247
2.261GlnIle: 2.261 ± 0.398
2.596GlnLys: 2.596 ± 0.728
2.68GlnLeu: 2.68 ± 0.639
1.591GlnMet: 1.591 ± 0.344
1.675GlnAsn: 1.675 ± 0.327
1.926GlnPro: 1.926 ± 0.395
1.843GlnGln: 1.843 ± 0.495
2.848GlnArg: 2.848 ± 0.538
3.015GlnSer: 3.015 ± 0.802
1.759GlnThr: 1.759 ± 0.314
2.764GlnVal: 2.764 ± 0.428
1.173GlnTrp: 1.173 ± 0.359
1.508GlnTyr: 1.508 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
5.779ArgAla: 5.779 ± 0.783
0.838ArgCys: 0.838 ± 0.31
3.183ArgAsp: 3.183 ± 0.437
4.104ArgGlu: 4.104 ± 0.644
2.68ArgPhe: 2.68 ± 0.424
3.601ArgGly: 3.601 ± 0.559
1.675ArgHis: 1.675 ± 0.291
4.271ArgIle: 4.271 ± 0.692
3.183ArgLys: 3.183 ± 0.526
4.941ArgLeu: 4.941 ± 0.449
1.173ArgMet: 1.173 ± 0.307
2.178ArgAsn: 2.178 ± 0.409
1.843ArgPro: 1.843 ± 0.35
2.429ArgGln: 2.429 ± 0.531
3.685ArgArg: 3.685 ± 0.65
3.518ArgSer: 3.518 ± 0.686
2.513ArgThr: 2.513 ± 0.474
4.271ArgVal: 4.271 ± 0.807
0.503ArgTrp: 0.503 ± 0.185
2.596ArgTyr: 2.596 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
6.365SerAla: 6.365 ± 1.11
0.586SerCys: 0.586 ± 0.263
3.015SerAsp: 3.015 ± 0.625
4.271SerGlu: 4.271 ± 0.422
1.843SerPhe: 1.843 ± 0.398
6.7SerGly: 6.7 ± 0.745
1.591SerHis: 1.591 ± 0.363
3.099SerIle: 3.099 ± 0.542
3.518SerLys: 3.518 ± 0.539
5.109SerLeu: 5.109 ± 0.788
2.094SerMet: 2.094 ± 0.387
2.178SerAsn: 2.178 ± 0.333
3.35SerPro: 3.35 ± 0.583
2.764SerGln: 2.764 ± 0.517
4.02SerArg: 4.02 ± 0.619
4.439SerSer: 4.439 ± 0.548
2.764SerThr: 2.764 ± 0.523
5.611SerVal: 5.611 ± 0.722
1.591SerTrp: 1.591 ± 0.37
1.759SerTyr: 1.759 ± 0.331
0.0SerXaa: 0.0 ± 0.0
Thr
4.606ThrAla: 4.606 ± 0.492
0.67ThrCys: 0.67 ± 0.209
3.183ThrAsp: 3.183 ± 0.46
5.109ThrGlu: 5.109 ± 0.764
1.926ThrPhe: 1.926 ± 0.35
5.695ThrGly: 5.695 ± 0.627
1.089ThrHis: 1.089 ± 0.342
3.601ThrIle: 3.601 ± 0.522
2.345ThrLys: 2.345 ± 0.447
4.02ThrLeu: 4.02 ± 0.589
1.256ThrMet: 1.256 ± 0.322
2.178ThrAsn: 2.178 ± 0.446
3.015ThrPro: 3.015 ± 0.518
2.345ThrGln: 2.345 ± 0.452
2.513ThrArg: 2.513 ± 0.405
3.266ThrSer: 3.266 ± 0.642
3.769ThrThr: 3.769 ± 0.609
4.271ThrVal: 4.271 ± 0.5
0.921ThrTrp: 0.921 ± 0.26
1.843ThrTyr: 1.843 ± 0.43
0.0ThrXaa: 0.0 ± 0.0
Val
5.276ValAla: 5.276 ± 0.78
0.754ValCys: 0.754 ± 0.238
3.685ValAsp: 3.685 ± 0.588
2.848ValGlu: 2.848 ± 0.415
2.01ValPhe: 2.01 ± 0.302
2.68ValGly: 2.68 ± 0.466
1.005ValHis: 1.005 ± 0.263
4.606ValIle: 4.606 ± 0.604
3.601ValLys: 3.601 ± 0.616
5.695ValLeu: 5.695 ± 0.647
1.843ValMet: 1.843 ± 0.45
2.848ValAsn: 2.848 ± 0.505
3.099ValPro: 3.099 ± 0.367
2.429ValGln: 2.429 ± 0.415
3.853ValArg: 3.853 ± 0.749
4.858ValSer: 4.858 ± 0.674
4.69ValThr: 4.69 ± 0.723
3.936ValVal: 3.936 ± 0.524
1.256ValTrp: 1.256 ± 0.309
2.764ValTyr: 2.764 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
1.005TrpAla: 1.005 ± 0.336
0.586TrpCys: 0.586 ± 0.202
0.67TrpAsp: 0.67 ± 0.242
1.173TrpGlu: 1.173 ± 0.357
0.503TrpPhe: 0.503 ± 0.183
1.508TrpGly: 1.508 ± 0.397
0.251TrpHis: 0.251 ± 0.14
0.754TrpIle: 0.754 ± 0.226
0.838TrpLys: 0.838 ± 0.273
1.256TrpLeu: 1.256 ± 0.335
0.67TrpMet: 0.67 ± 0.284
0.838TrpAsn: 0.838 ± 0.244
0.838TrpPro: 0.838 ± 0.26
1.005TrpGln: 1.005 ± 0.287
1.34TrpArg: 1.34 ± 0.315
1.256TrpSer: 1.256 ± 0.325
1.089TrpThr: 1.089 ± 0.371
1.173TrpVal: 1.173 ± 0.303
0.419TrpTrp: 0.419 ± 0.21
0.335TrpTyr: 0.335 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.769TyrAla: 3.769 ± 0.555
0.586TyrCys: 0.586 ± 0.202
1.591TyrAsp: 1.591 ± 0.325
1.591TyrGlu: 1.591 ± 0.362
0.921TyrPhe: 0.921 ± 0.281
2.429TyrGly: 2.429 ± 0.403
1.005TyrHis: 1.005 ± 0.267
2.094TyrIle: 2.094 ± 0.393
1.591TyrLys: 1.591 ± 0.406
1.675TyrLeu: 1.675 ± 0.443
0.335TyrMet: 0.335 ± 0.148
0.921TyrAsn: 0.921 ± 0.219
1.005TyrPro: 1.005 ± 0.316
1.926TyrGln: 1.926 ± 0.409
2.764TyrArg: 2.764 ± 0.489
2.68TyrSer: 2.68 ± 0.763
2.345TyrThr: 2.345 ± 0.449
1.759TyrVal: 1.759 ± 0.354
1.005TyrTrp: 1.005 ± 0.295
1.005TyrTyr: 1.005 ± 0.325
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski