Amino acid dipepetide frequency for Providencia phage Redjac

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.176AlaAla: 9.176 ± 1.496
0.706AlaCys: 0.706 ± 0.241
5.435AlaAsp: 5.435 ± 0.627
6.565AlaGlu: 6.565 ± 0.652
3.318AlaPhe: 3.318 ± 0.456
6.565AlaGly: 6.565 ± 0.937
0.847AlaHis: 0.847 ± 0.294
6.423AlaIle: 6.423 ± 0.913
5.576AlaLys: 5.576 ± 1.07
7.341AlaLeu: 7.341 ± 1.004
2.753AlaMet: 2.753 ± 0.502
3.176AlaAsn: 3.176 ± 0.446
3.741AlaPro: 3.741 ± 0.428
5.082AlaGln: 5.082 ± 1.006
5.153AlaArg: 5.153 ± 0.805
6.141AlaSer: 6.141 ± 0.715
6.141AlaThr: 6.141 ± 0.681
5.788AlaVal: 5.788 ± 0.802
1.271AlaTrp: 1.271 ± 0.297
2.682AlaTyr: 2.682 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.565CysAla: 0.565 ± 0.216
0.071CysCys: 0.071 ± 0.065
0.776CysAsp: 0.776 ± 0.354
0.282CysGlu: 0.282 ± 0.138
0.424CysPhe: 0.424 ± 0.195
0.424CysGly: 0.424 ± 0.201
0.212CysHis: 0.212 ± 0.101
0.424CysIle: 0.424 ± 0.151
0.282CysLys: 0.282 ± 0.128
0.282CysLeu: 0.282 ± 0.148
0.141CysMet: 0.141 ± 0.087
0.282CysAsn: 0.282 ± 0.133
0.706CysPro: 0.706 ± 0.309
0.141CysGln: 0.141 ± 0.103
0.565CysArg: 0.565 ± 0.228
0.494CysSer: 0.494 ± 0.177
0.494CysThr: 0.494 ± 0.21
0.565CysVal: 0.565 ± 0.188
0.071CysTrp: 0.071 ± 0.075
0.282CysTyr: 0.282 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
5.647AspAla: 5.647 ± 0.602
0.282AspCys: 0.282 ± 0.136
5.859AspAsp: 5.859 ± 1.258
4.87AspGlu: 4.87 ± 0.797
2.188AspPhe: 2.188 ± 0.394
5.576AspGly: 5.576 ± 0.833
1.412AspHis: 1.412 ± 0.309
3.247AspIle: 3.247 ± 0.667
4.094AspLys: 4.094 ± 0.605
4.447AspLeu: 4.447 ± 0.545
2.259AspMet: 2.259 ± 0.273
1.694AspAsn: 1.694 ± 0.432
4.094AspPro: 4.094 ± 0.73
1.694AspGln: 1.694 ± 0.339
2.823AspArg: 2.823 ± 0.488
2.612AspSer: 2.612 ± 0.513
4.094AspThr: 4.094 ± 0.563
4.87AspVal: 4.87 ± 0.633
1.553AspTrp: 1.553 ± 0.375
2.471AspTyr: 2.471 ± 0.361
0.0AspXaa: 0.0 ± 0.0
Glu
6.706GluAla: 6.706 ± 0.766
0.635GluCys: 0.635 ± 0.222
4.376GluAsp: 4.376 ± 0.663
4.941GluGlu: 4.941 ± 0.613
2.118GluPhe: 2.118 ± 0.453
4.023GluGly: 4.023 ± 0.62
1.059GluHis: 1.059 ± 0.351
4.447GluIle: 4.447 ± 0.517
2.965GluLys: 2.965 ± 0.373
4.376GluLeu: 4.376 ± 0.605
2.682GluMet: 2.682 ± 0.422
2.612GluAsn: 2.612 ± 0.383
2.682GluPro: 2.682 ± 0.552
2.118GluGln: 2.118 ± 0.428
3.741GluArg: 3.741 ± 0.472
4.8GluSer: 4.8 ± 0.439
3.106GluThr: 3.106 ± 0.45
5.576GluVal: 5.576 ± 0.705
1.553GluTrp: 1.553 ± 0.345
1.906GluTyr: 1.906 ± 0.307
0.0GluXaa: 0.0 ± 0.0
Phe
3.388PheAla: 3.388 ± 0.482
0.424PheCys: 0.424 ± 0.154
3.671PheAsp: 3.671 ± 0.57
3.035PheGlu: 3.035 ± 0.538
0.706PhePhe: 0.706 ± 0.191
2.965PheGly: 2.965 ± 0.438
0.635PheHis: 0.635 ± 0.187
1.271PheIle: 1.271 ± 0.326
2.259PheLys: 2.259 ± 0.471
1.765PheLeu: 1.765 ± 0.398
1.2PheMet: 1.2 ± 0.282
1.906PheAsn: 1.906 ± 0.321
1.271PhePro: 1.271 ± 0.326
0.776PheGln: 0.776 ± 0.19
2.188PheArg: 2.188 ± 0.419
2.4PheSer: 2.4 ± 0.448
2.118PheThr: 2.118 ± 0.44
1.835PheVal: 1.835 ± 0.3
0.282PheTrp: 0.282 ± 0.157
0.988PheTyr: 0.988 ± 0.206
0.0PheXaa: 0.0 ± 0.0
Gly
6.706GlyAla: 6.706 ± 0.903
0.847GlyCys: 0.847 ± 0.267
4.376GlyAsp: 4.376 ± 0.665
4.941GlyGlu: 4.941 ± 0.603
3.106GlyPhe: 3.106 ± 0.644
6.0GlyGly: 6.0 ± 0.969
1.2GlyHis: 1.2 ± 0.325
3.671GlyIle: 3.671 ± 0.544
4.094GlyLys: 4.094 ± 0.562
4.87GlyLeu: 4.87 ± 0.728
2.259GlyMet: 2.259 ± 0.343
2.471GlyAsn: 2.471 ± 0.403
1.835GlyPro: 1.835 ± 0.467
2.753GlyGln: 2.753 ± 0.653
4.376GlyArg: 4.376 ± 0.564
4.165GlySer: 4.165 ± 0.508
4.659GlyThr: 4.659 ± 0.689
5.929GlyVal: 5.929 ± 0.74
1.906GlyTrp: 1.906 ± 0.343
2.965GlyTyr: 2.965 ± 0.542
0.0GlyXaa: 0.0 ± 0.0
His
1.341HisAla: 1.341 ± 0.363
0.141HisCys: 0.141 ± 0.106
1.2HisAsp: 1.2 ± 0.326
1.059HisGlu: 1.059 ± 0.247
0.494HisPhe: 0.494 ± 0.209
1.059HisGly: 1.059 ± 0.249
0.353HisHis: 0.353 ± 0.178
1.341HisIle: 1.341 ± 0.293
0.918HisLys: 0.918 ± 0.288
1.271HisLeu: 1.271 ± 0.245
0.282HisMet: 0.282 ± 0.221
0.635HisAsn: 0.635 ± 0.176
0.494HisPro: 0.494 ± 0.154
0.494HisGln: 0.494 ± 0.253
1.271HisArg: 1.271 ± 0.303
0.565HisSer: 0.565 ± 0.23
0.988HisThr: 0.988 ± 0.378
1.2HisVal: 1.2 ± 0.39
0.353HisTrp: 0.353 ± 0.183
1.129HisTyr: 1.129 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
6.0IleAla: 6.0 ± 0.643
0.635IleCys: 0.635 ± 0.224
4.447IleAsp: 4.447 ± 0.382
4.235IleGlu: 4.235 ± 0.472
1.623IlePhe: 1.623 ± 0.377
4.729IleGly: 4.729 ± 0.572
1.341IleHis: 1.341 ± 0.286
3.529IleIle: 3.529 ± 0.594
3.035IleLys: 3.035 ± 0.437
2.965IleLeu: 2.965 ± 0.32
1.2IleMet: 1.2 ± 0.326
2.823IleAsn: 2.823 ± 0.333
2.541IlePro: 2.541 ± 0.417
1.765IleGln: 1.765 ± 0.316
3.035IleArg: 3.035 ± 0.46
3.6IleSer: 3.6 ± 0.512
3.176IleThr: 3.176 ± 0.419
3.882IleVal: 3.882 ± 0.559
0.565IleTrp: 0.565 ± 0.189
1.553IleTyr: 1.553 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
4.518LysAla: 4.518 ± 0.677
0.494LysCys: 0.494 ± 0.204
3.459LysAsp: 3.459 ± 0.53
3.812LysGlu: 3.812 ± 0.567
1.623LysPhe: 1.623 ± 0.385
3.741LysGly: 3.741 ± 0.509
1.482LysHis: 1.482 ± 0.274
3.741LysIle: 3.741 ± 0.572
3.459LysLys: 3.459 ± 0.542
3.882LysLeu: 3.882 ± 0.602
1.906LysMet: 1.906 ± 0.381
2.4LysAsn: 2.4 ± 0.456
2.823LysPro: 2.823 ± 0.516
2.188LysGln: 2.188 ± 0.661
3.953LysArg: 3.953 ± 0.638
3.247LysSer: 3.247 ± 0.541
2.894LysThr: 2.894 ± 0.468
2.4LysVal: 2.4 ± 0.353
1.129LysTrp: 1.129 ± 0.398
2.188LysTyr: 2.188 ± 0.401
0.0LysXaa: 0.0 ± 0.0
Leu
8.47LeuAla: 8.47 ± 1.312
0.635LeuCys: 0.635 ± 0.166
4.659LeuAsp: 4.659 ± 0.644
4.518LeuGlu: 4.518 ± 0.767
2.612LeuPhe: 2.612 ± 0.492
4.659LeuGly: 4.659 ± 0.774
0.988LeuHis: 0.988 ± 0.287
3.106LeuIle: 3.106 ± 0.439
4.8LeuLys: 4.8 ± 0.672
4.306LeuLeu: 4.306 ± 0.633
1.906LeuMet: 1.906 ± 0.475
3.671LeuAsn: 3.671 ± 0.529
4.094LeuPro: 4.094 ± 0.784
2.894LeuGln: 2.894 ± 0.554
3.671LeuArg: 3.671 ± 0.52
4.941LeuSer: 4.941 ± 0.518
4.87LeuThr: 4.87 ± 0.543
4.235LeuVal: 4.235 ± 0.461
0.988LeuTrp: 0.988 ± 0.238
1.623LeuTyr: 1.623 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
3.459MetAla: 3.459 ± 0.408
0.212MetCys: 0.212 ± 0.117
1.694MetAsp: 1.694 ± 0.328
1.341MetGlu: 1.341 ± 0.374
0.847MetPhe: 0.847 ± 0.255
1.906MetGly: 1.906 ± 0.384
0.706MetHis: 0.706 ± 0.23
1.482MetIle: 1.482 ± 0.362
1.765MetLys: 1.765 ± 0.423
2.259MetLeu: 2.259 ± 0.325
0.635MetMet: 0.635 ± 0.186
1.553MetAsn: 1.553 ± 0.421
1.976MetPro: 1.976 ± 0.436
1.341MetGln: 1.341 ± 0.275
2.188MetArg: 2.188 ± 0.434
2.329MetSer: 2.329 ± 0.563
1.553MetThr: 1.553 ± 0.307
1.765MetVal: 1.765 ± 0.245
0.353MetTrp: 0.353 ± 0.152
1.129MetTyr: 1.129 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
4.376AsnAla: 4.376 ± 0.57
0.141AsnCys: 0.141 ± 0.101
1.341AsnAsp: 1.341 ± 0.305
2.682AsnGlu: 2.682 ± 0.42
1.2AsnPhe: 1.2 ± 0.255
4.376AsnGly: 4.376 ± 0.535
0.494AsnHis: 0.494 ± 0.206
1.976AsnIle: 1.976 ± 0.319
1.906AsnLys: 1.906 ± 0.348
3.388AsnLeu: 3.388 ± 0.477
0.776AsnMet: 0.776 ± 0.24
1.906AsnAsn: 1.906 ± 0.529
2.823AsnPro: 2.823 ± 0.484
1.765AsnGln: 1.765 ± 0.339
2.329AsnArg: 2.329 ± 0.389
2.471AsnSer: 2.471 ± 0.396
2.894AsnThr: 2.894 ± 0.483
2.118AsnVal: 2.118 ± 0.326
0.776AsnTrp: 0.776 ± 0.224
1.765AsnTyr: 1.765 ± 0.49
0.0AsnXaa: 0.0 ± 0.0
Pro
3.812ProAla: 3.812 ± 0.449
0.282ProCys: 0.282 ± 0.208
4.306ProAsp: 4.306 ± 0.75
3.812ProGlu: 3.812 ± 0.467
1.835ProPhe: 1.835 ± 0.346
3.388ProGly: 3.388 ± 0.522
1.129ProHis: 1.129 ± 0.295
2.965ProIle: 2.965 ± 0.454
2.682ProLys: 2.682 ± 0.485
2.965ProLeu: 2.965 ± 0.429
1.694ProMet: 1.694 ± 0.508
2.047ProAsn: 2.047 ± 0.38
1.906ProPro: 1.906 ± 0.584
1.412ProGln: 1.412 ± 0.457
1.482ProArg: 1.482 ± 0.337
3.035ProSer: 3.035 ± 0.563
3.247ProThr: 3.247 ± 0.577
2.682ProVal: 2.682 ± 0.629
0.918ProTrp: 0.918 ± 0.284
2.329ProTyr: 2.329 ± 0.526
0.0ProXaa: 0.0 ± 0.0
Gln
3.812GlnAla: 3.812 ± 0.822
0.141GlnCys: 0.141 ± 0.099
1.623GlnAsp: 1.623 ± 0.29
1.835GlnGlu: 1.835 ± 0.393
1.482GlnPhe: 1.482 ± 0.368
2.823GlnGly: 2.823 ± 0.395
0.847GlnHis: 0.847 ± 0.228
2.259GlnIle: 2.259 ± 0.45
2.4GlnLys: 2.4 ± 0.647
3.176GlnLeu: 3.176 ± 0.484
1.694GlnMet: 1.694 ± 0.509
1.2GlnAsn: 1.2 ± 0.234
1.835GlnPro: 1.835 ± 0.465
1.906GlnGln: 1.906 ± 0.505
2.118GlnArg: 2.118 ± 0.449
2.259GlnSer: 2.259 ± 0.566
2.4GlnThr: 2.4 ± 0.455
3.388GlnVal: 3.388 ± 0.871
0.424GlnTrp: 0.424 ± 0.158
1.412GlnTyr: 1.412 ± 0.258
0.0GlnXaa: 0.0 ± 0.0
Arg
4.87ArgAla: 4.87 ± 0.732
0.353ArgCys: 0.353 ± 0.128
3.459ArgAsp: 3.459 ± 0.463
2.682ArgGlu: 2.682 ± 0.503
2.4ArgPhe: 2.4 ± 0.311
2.823ArgGly: 2.823 ± 0.536
0.776ArgHis: 0.776 ± 0.236
3.035ArgIle: 3.035 ± 0.329
3.882ArgLys: 3.882 ± 0.57
4.306ArgLeu: 4.306 ± 0.541
1.976ArgMet: 1.976 ± 0.377
2.259ArgAsn: 2.259 ± 0.419
2.541ArgPro: 2.541 ± 0.425
1.906ArgGln: 1.906 ± 0.47
2.753ArgArg: 2.753 ± 0.478
3.176ArgSer: 3.176 ± 0.445
3.106ArgThr: 3.106 ± 0.423
4.376ArgVal: 4.376 ± 0.597
0.847ArgTrp: 0.847 ± 0.244
1.765ArgTyr: 1.765 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
5.012SerAla: 5.012 ± 0.679
0.494SerCys: 0.494 ± 0.154
3.812SerAsp: 3.812 ± 0.627
2.823SerGlu: 2.823 ± 0.373
2.329SerPhe: 2.329 ± 0.348
5.012SerGly: 5.012 ± 0.403
0.918SerHis: 0.918 ± 0.263
3.671SerIle: 3.671 ± 0.422
3.318SerLys: 3.318 ± 0.611
5.082SerLeu: 5.082 ± 0.503
2.188SerMet: 2.188 ± 0.413
2.753SerAsn: 2.753 ± 0.641
3.741SerPro: 3.741 ± 0.524
2.965SerGln: 2.965 ± 0.727
2.753SerArg: 2.753 ± 0.554
3.882SerSer: 3.882 ± 0.56
2.965SerThr: 2.965 ± 0.502
4.87SerVal: 4.87 ± 0.547
1.2SerTrp: 1.2 ± 0.376
2.612SerTyr: 2.612 ± 0.496
0.0SerXaa: 0.0 ± 0.0
Thr
4.729ThrAla: 4.729 ± 0.703
0.141ThrCys: 0.141 ± 0.104
4.165ThrAsp: 4.165 ± 0.598
3.247ThrGlu: 3.247 ± 0.426
2.471ThrPhe: 2.471 ± 0.44
4.729ThrGly: 4.729 ± 0.52
0.353ThrHis: 0.353 ± 0.201
3.882ThrIle: 3.882 ± 0.656
3.247ThrLys: 3.247 ± 0.547
5.153ThrLeu: 5.153 ± 0.568
1.765ThrMet: 1.765 ± 0.357
2.682ThrAsn: 2.682 ± 0.375
3.671ThrPro: 3.671 ± 0.508
2.259ThrGln: 2.259 ± 0.401
2.188ThrArg: 2.188 ± 0.325
3.953ThrSer: 3.953 ± 0.548
4.376ThrThr: 4.376 ± 0.799
5.929ThrVal: 5.929 ± 0.906
0.918ThrTrp: 0.918 ± 0.311
1.623ThrTyr: 1.623 ± 0.383
0.0ThrXaa: 0.0 ± 0.0
Val
7.129ValAla: 7.129 ± 0.857
0.494ValCys: 0.494 ± 0.179
3.953ValAsp: 3.953 ± 0.424
5.294ValGlu: 5.294 ± 0.53
2.259ValPhe: 2.259 ± 0.286
4.729ValGly: 4.729 ± 0.59
0.988ValHis: 0.988 ± 0.291
4.094ValIle: 4.094 ± 0.536
2.823ValLys: 2.823 ± 0.412
5.647ValLeu: 5.647 ± 0.647
1.694ValMet: 1.694 ± 0.385
3.459ValAsn: 3.459 ± 0.493
2.682ValPro: 2.682 ± 0.36
3.176ValGln: 3.176 ± 0.411
3.953ValArg: 3.953 ± 0.725
4.376ValSer: 4.376 ± 0.581
5.365ValThr: 5.365 ± 0.617
5.012ValVal: 5.012 ± 0.779
1.129ValTrp: 1.129 ± 0.272
1.976ValTyr: 1.976 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
1.835TrpAla: 1.835 ± 0.472
0.141TrpCys: 0.141 ± 0.104
1.412TrpAsp: 1.412 ± 0.34
1.482TrpGlu: 1.482 ± 0.433
0.847TrpPhe: 0.847 ± 0.198
1.271TrpGly: 1.271 ± 0.22
0.353TrpHis: 0.353 ± 0.206
0.635TrpIle: 0.635 ± 0.182
0.706TrpLys: 0.706 ± 0.312
1.482TrpLeu: 1.482 ± 0.362
0.212TrpMet: 0.212 ± 0.133
0.776TrpAsn: 0.776 ± 0.263
0.706TrpPro: 0.706 ± 0.255
1.059TrpGln: 1.059 ± 0.265
0.635TrpArg: 0.635 ± 0.219
1.129TrpSer: 1.129 ± 0.255
0.494TrpThr: 0.494 ± 0.199
1.129TrpVal: 1.129 ± 0.373
0.212TrpTrp: 0.212 ± 0.13
0.635TrpTyr: 0.635 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.329TyrAla: 2.329 ± 0.363
0.212TyrCys: 0.212 ± 0.185
1.765TyrAsp: 1.765 ± 0.298
2.965TyrGlu: 2.965 ± 0.514
1.2TyrPhe: 1.2 ± 0.324
2.4TyrGly: 2.4 ± 0.57
0.494TyrHis: 0.494 ± 0.2
1.482TyrIle: 1.482 ± 0.37
1.129TyrLys: 1.129 ± 0.247
2.682TyrLeu: 2.682 ± 0.392
1.2TyrMet: 1.2 ± 0.268
1.2TyrAsn: 1.2 ± 0.233
1.765TyrPro: 1.765 ± 0.428
1.341TyrGln: 1.341 ± 0.351
2.118TyrArg: 2.118 ± 0.377
2.823TyrSer: 2.823 ± 0.477
2.541TyrThr: 2.541 ± 0.294
2.682TyrVal: 2.682 ± 0.366
0.706TyrTrp: 0.706 ± 0.284
1.412TyrTyr: 1.412 ± 0.367
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (14168 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski