Amino acid dipepetide frequency for Pectobacterium phage Q19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.4AlaAla: 10.4 ± 1.112
1.065AlaCys: 1.065 ± 0.281
6.715AlaAsp: 6.715 ± 0.81
4.094AlaGlu: 4.094 ± 0.62
2.866AlaPhe: 2.866 ± 0.472
6.305AlaGly: 6.305 ± 0.575
1.474AlaHis: 1.474 ± 0.386
4.995AlaIle: 4.995 ± 0.648
6.142AlaLys: 6.142 ± 0.658
7.124AlaLeu: 7.124 ± 1.084
2.866AlaMet: 2.866 ± 0.568
4.422AlaAsn: 4.422 ± 0.525
2.948AlaPro: 2.948 ± 0.437
4.012AlaGln: 4.012 ± 0.621
4.831AlaArg: 4.831 ± 0.73
5.405AlaSer: 5.405 ± 0.821
5.323AlaThr: 5.323 ± 0.618
5.486AlaVal: 5.486 ± 0.536
1.146AlaTrp: 1.146 ± 0.386
2.62AlaTyr: 2.62 ± 0.557
0.0AlaXaa: 0.0 ± 0.0
Cys
0.901CysAla: 0.901 ± 0.263
0.164CysCys: 0.164 ± 0.139
0.573CysAsp: 0.573 ± 0.28
0.819CysGlu: 0.819 ± 0.234
0.573CysPhe: 0.573 ± 0.257
0.328CysGly: 0.328 ± 0.158
0.246CysHis: 0.246 ± 0.139
0.409CysIle: 0.409 ± 0.196
0.409CysLys: 0.409 ± 0.215
0.573CysLeu: 0.573 ± 0.201
0.246CysMet: 0.246 ± 0.129
0.328CysAsn: 0.328 ± 0.179
0.655CysPro: 0.655 ± 0.227
0.246CysGln: 0.246 ± 0.144
0.573CysArg: 0.573 ± 0.377
0.409CysSer: 0.409 ± 0.193
0.082CysThr: 0.082 ± 0.085
0.983CysVal: 0.983 ± 0.315
0.246CysTrp: 0.246 ± 0.149
0.328CysTyr: 0.328 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.568AspAla: 5.568 ± 0.439
0.573AspCys: 0.573 ± 0.306
4.749AspAsp: 4.749 ± 0.708
4.176AspGlu: 4.176 ± 0.495
2.948AspPhe: 2.948 ± 0.507
6.469AspGly: 6.469 ± 0.68
1.065AspHis: 1.065 ± 0.272
3.112AspIle: 3.112 ± 0.358
3.849AspLys: 3.849 ± 0.549
4.749AspLeu: 4.749 ± 0.923
1.638AspMet: 1.638 ± 0.357
2.211AspAsn: 2.211 ± 0.368
2.702AspPro: 2.702 ± 0.522
2.129AspGln: 2.129 ± 0.445
2.948AspArg: 2.948 ± 0.415
3.521AspSer: 3.521 ± 0.551
3.603AspThr: 3.603 ± 0.584
4.668AspVal: 4.668 ± 0.537
1.31AspTrp: 1.31 ± 0.344
2.375AspTyr: 2.375 ± 0.423
0.0AspXaa: 0.0 ± 0.0
Glu
6.797GluAla: 6.797 ± 0.808
0.573GluCys: 0.573 ± 0.267
4.094GluAsp: 4.094 ± 0.453
4.668GluGlu: 4.668 ± 0.842
2.293GluPhe: 2.293 ± 0.458
5.241GluGly: 5.241 ± 0.854
1.146GluHis: 1.146 ± 0.371
2.538GluIle: 2.538 ± 0.347
2.293GluLys: 2.293 ± 0.434
5.077GluLeu: 5.077 ± 0.532
2.129GluMet: 2.129 ± 0.42
2.538GluAsn: 2.538 ± 0.363
1.474GluPro: 1.474 ± 0.388
3.603GluGln: 3.603 ± 0.658
3.439GluArg: 3.439 ± 0.512
3.931GluSer: 3.931 ± 0.466
3.275GluThr: 3.275 ± 0.508
4.176GluVal: 4.176 ± 0.534
1.065GluTrp: 1.065 ± 0.257
2.62GluTyr: 2.62 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
2.457PheAla: 2.457 ± 0.484
0.246PheCys: 0.246 ± 0.169
2.948PheAsp: 2.948 ± 0.441
1.065PheGlu: 1.065 ± 0.352
1.31PhePhe: 1.31 ± 0.342
3.275PheGly: 3.275 ± 0.432
0.655PheHis: 0.655 ± 0.224
1.638PheIle: 1.638 ± 0.402
2.047PheLys: 2.047 ± 0.356
3.685PheLeu: 3.685 ± 0.5
1.146PheMet: 1.146 ± 0.332
2.047PheAsn: 2.047 ± 0.336
1.72PhePro: 1.72 ± 0.446
1.228PheGln: 1.228 ± 0.329
1.802PheArg: 1.802 ± 0.365
3.03PheSer: 3.03 ± 0.431
3.275PheThr: 3.275 ± 0.683
2.375PheVal: 2.375 ± 0.467
0.573PheTrp: 0.573 ± 0.223
0.983PheTyr: 0.983 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
6.469GlyAla: 6.469 ± 0.602
0.819GlyCys: 0.819 ± 0.344
5.814GlyAsp: 5.814 ± 0.641
4.422GlyGlu: 4.422 ± 0.738
3.03GlyPhe: 3.03 ± 0.43
6.469GlyGly: 6.469 ± 1.038
1.31GlyHis: 1.31 ± 0.347
4.586GlyIle: 4.586 ± 0.696
5.978GlyLys: 5.978 ± 0.689
6.305GlyLeu: 6.305 ± 0.754
2.375GlyMet: 2.375 ± 0.422
3.03GlyAsn: 3.03 ± 0.477
1.228GlyPro: 1.228 ± 0.363
3.439GlyGln: 3.439 ± 0.573
4.012GlyArg: 4.012 ± 0.463
4.831GlySer: 4.831 ± 0.779
3.931GlyThr: 3.931 ± 0.637
5.486GlyVal: 5.486 ± 0.713
1.638GlyTrp: 1.638 ± 0.412
2.784GlyTyr: 2.784 ± 0.44
0.0GlyXaa: 0.0 ± 0.0
His
1.474HisAla: 1.474 ± 0.313
0.164HisCys: 0.164 ± 0.099
1.31HisAsp: 1.31 ± 0.366
1.228HisGlu: 1.228 ± 0.401
0.819HisPhe: 0.819 ± 0.234
1.474HisGly: 1.474 ± 0.339
0.655HisHis: 0.655 ± 0.27
1.31HisIle: 1.31 ± 0.338
1.31HisLys: 1.31 ± 0.269
1.965HisLeu: 1.965 ± 0.415
0.573HisMet: 0.573 ± 0.2
0.328HisAsn: 0.328 ± 0.151
0.246HisPro: 0.246 ± 0.151
0.573HisGln: 0.573 ± 0.211
0.983HisArg: 0.983 ± 0.385
1.228HisSer: 1.228 ± 0.354
1.31HisThr: 1.31 ± 0.32
1.065HisVal: 1.065 ± 0.231
0.409HisTrp: 0.409 ± 0.161
0.409HisTyr: 0.409 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
4.668IleAla: 4.668 ± 0.587
0.737IleCys: 0.737 ± 0.243
3.194IleAsp: 3.194 ± 0.623
2.866IleGlu: 2.866 ± 0.449
1.228IlePhe: 1.228 ± 0.228
3.439IleGly: 3.439 ± 0.574
1.146IleHis: 1.146 ± 0.221
2.538IleIle: 2.538 ± 0.555
3.439IleLys: 3.439 ± 0.488
3.439IleLeu: 3.439 ± 0.471
1.638IleMet: 1.638 ± 0.41
2.375IleAsn: 2.375 ± 0.541
3.275IlePro: 3.275 ± 0.433
1.146IleGln: 1.146 ± 0.297
3.521IleArg: 3.521 ± 0.545
3.194IleSer: 3.194 ± 0.606
3.194IleThr: 3.194 ± 0.546
3.439IleVal: 3.439 ± 0.581
0.491IleTrp: 0.491 ± 0.209
1.392IleTyr: 1.392 ± 0.344
0.0IleXaa: 0.0 ± 0.0
Lys
7.452LysAla: 7.452 ± 0.866
0.409LysCys: 0.409 ± 0.194
3.194LysAsp: 3.194 ± 0.552
3.521LysGlu: 3.521 ± 0.674
2.866LysPhe: 2.866 ± 0.49
5.568LysGly: 5.568 ± 0.793
1.31LysHis: 1.31 ± 0.365
2.293LysIle: 2.293 ± 0.416
3.194LysLys: 3.194 ± 0.706
4.749LysLeu: 4.749 ± 0.778
1.392LysMet: 1.392 ± 0.315
2.293LysAsn: 2.293 ± 0.33
2.293LysPro: 2.293 ± 0.411
2.702LysGln: 2.702 ± 0.542
3.439LysArg: 3.439 ± 0.725
2.948LysSer: 2.948 ± 0.497
3.439LysThr: 3.439 ± 0.527
6.469LysVal: 6.469 ± 0.73
0.409LysTrp: 0.409 ± 0.18
1.965LysTyr: 1.965 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
6.96LeuAla: 6.96 ± 0.756
0.246LeuCys: 0.246 ± 0.135
4.34LeuAsp: 4.34 ± 0.702
6.715LeuGlu: 6.715 ± 0.803
1.802LeuPhe: 1.802 ± 0.368
5.732LeuGly: 5.732 ± 0.723
1.638LeuHis: 1.638 ± 0.394
3.357LeuIle: 3.357 ± 0.462
6.797LeuLys: 6.797 ± 0.807
5.241LeuLeu: 5.241 ± 0.781
2.948LeuMet: 2.948 ± 0.503
4.34LeuAsn: 4.34 ± 0.598
3.194LeuPro: 3.194 ± 0.506
3.521LeuGln: 3.521 ± 0.606
4.422LeuArg: 4.422 ± 0.685
5.732LeuSer: 5.732 ± 0.718
5.896LeuThr: 5.896 ± 0.814
6.223LeuVal: 6.223 ± 0.588
1.065LeuTrp: 1.065 ± 0.305
2.211LeuTyr: 2.211 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
2.948MetAla: 2.948 ± 0.433
0.246MetCys: 0.246 ± 0.163
2.047MetAsp: 2.047 ± 0.483
1.883MetGlu: 1.883 ± 0.419
1.065MetPhe: 1.065 ± 0.229
1.883MetGly: 1.883 ± 0.408
0.164MetHis: 0.164 ± 0.146
1.638MetIle: 1.638 ± 0.389
1.392MetLys: 1.392 ± 0.29
2.62MetLeu: 2.62 ± 0.362
0.737MetMet: 0.737 ± 0.281
1.065MetAsn: 1.065 ± 0.297
1.228MetPro: 1.228 ± 0.305
1.474MetGln: 1.474 ± 0.365
1.065MetArg: 1.065 ± 0.279
1.474MetSer: 1.474 ± 0.343
1.638MetThr: 1.638 ± 0.331
2.129MetVal: 2.129 ± 0.349
0.082MetTrp: 0.082 ± 0.081
0.409MetTyr: 0.409 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
4.586AsnAla: 4.586 ± 0.577
0.328AsnCys: 0.328 ± 0.154
3.439AsnAsp: 3.439 ± 0.446
2.293AsnGlu: 2.293 ± 0.53
1.392AsnPhe: 1.392 ± 0.315
4.176AsnGly: 4.176 ± 0.699
1.065AsnHis: 1.065 ± 0.408
2.784AsnIle: 2.784 ± 0.532
2.293AsnLys: 2.293 ± 0.527
3.439AsnLeu: 3.439 ± 0.57
1.146AsnMet: 1.146 ± 0.194
1.474AsnAsn: 1.474 ± 0.316
3.112AsnPro: 3.112 ± 0.529
1.638AsnGln: 1.638 ± 0.37
2.211AsnArg: 2.211 ± 0.496
2.948AsnSer: 2.948 ± 0.69
1.965AsnThr: 1.965 ± 0.413
2.866AsnVal: 2.866 ± 0.445
0.409AsnTrp: 0.409 ± 0.139
1.965AsnTyr: 1.965 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
2.866ProAla: 2.866 ± 0.429
0.409ProCys: 0.409 ± 0.198
2.129ProAsp: 2.129 ± 0.335
4.094ProGlu: 4.094 ± 0.628
1.228ProPhe: 1.228 ± 0.316
1.883ProGly: 1.883 ± 0.369
0.573ProHis: 0.573 ± 0.177
1.065ProIle: 1.065 ± 0.296
2.457ProLys: 2.457 ± 0.521
2.538ProLeu: 2.538 ± 0.44
0.983ProMet: 0.983 ± 0.328
2.293ProAsn: 2.293 ± 0.521
0.983ProPro: 0.983 ± 0.232
1.556ProGln: 1.556 ± 0.372
1.31ProArg: 1.31 ± 0.321
2.293ProSer: 2.293 ± 0.326
2.866ProThr: 2.866 ± 0.49
2.538ProVal: 2.538 ± 0.397
0.901ProTrp: 0.901 ± 0.247
2.129ProTyr: 2.129 ± 0.511
0.0ProXaa: 0.0 ± 0.0
Gln
4.504GlnAla: 4.504 ± 0.711
0.082GlnCys: 0.082 ± 0.083
2.129GlnAsp: 2.129 ± 0.428
2.211GlnGlu: 2.211 ± 0.52
2.538GlnPhe: 2.538 ± 0.379
2.538GlnGly: 2.538 ± 0.49
0.573GlnHis: 0.573 ± 0.236
2.129GlnIle: 2.129 ± 0.361
2.129GlnLys: 2.129 ± 0.482
4.34GlnLeu: 4.34 ± 0.515
0.901GlnMet: 0.901 ± 0.263
1.802GlnAsn: 1.802 ± 0.383
1.638GlnPro: 1.638 ± 0.353
2.211GlnGln: 2.211 ± 0.571
2.457GlnArg: 2.457 ± 0.592
2.866GlnSer: 2.866 ± 0.499
1.474GlnThr: 1.474 ± 0.373
2.702GlnVal: 2.702 ± 0.494
0.655GlnTrp: 0.655 ± 0.207
1.065GlnTyr: 1.065 ± 0.375
0.0GlnXaa: 0.0 ± 0.0
Arg
3.521ArgAla: 3.521 ± 0.319
0.901ArgCys: 0.901 ± 0.245
3.357ArgAsp: 3.357 ± 0.552
4.012ArgGlu: 4.012 ± 0.58
1.638ArgPhe: 1.638 ± 0.444
3.521ArgGly: 3.521 ± 0.419
0.737ArgHis: 0.737 ± 0.335
2.62ArgIle: 2.62 ± 0.493
3.685ArgLys: 3.685 ± 0.626
5.241ArgLeu: 5.241 ± 0.618
1.065ArgMet: 1.065 ± 0.236
2.866ArgAsn: 2.866 ± 0.448
2.211ArgPro: 2.211 ± 0.472
2.129ArgGln: 2.129 ± 0.47
2.702ArgArg: 2.702 ± 0.458
3.931ArgSer: 3.931 ± 0.517
3.194ArgThr: 3.194 ± 0.431
3.603ArgVal: 3.603 ± 0.582
0.655ArgTrp: 0.655 ± 0.317
1.474ArgTyr: 1.474 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
4.749SerAla: 4.749 ± 0.547
0.573SerCys: 0.573 ± 0.243
4.012SerAsp: 4.012 ± 0.629
3.521SerGlu: 3.521 ± 0.626
2.62SerPhe: 2.62 ± 0.357
5.65SerGly: 5.65 ± 0.759
1.065SerHis: 1.065 ± 0.328
3.849SerIle: 3.849 ± 0.642
3.194SerLys: 3.194 ± 0.638
4.831SerLeu: 4.831 ± 0.699
0.819SerMet: 0.819 ± 0.253
2.866SerAsn: 2.866 ± 0.381
2.211SerPro: 2.211 ± 0.363
3.194SerGln: 3.194 ± 0.547
3.767SerArg: 3.767 ± 0.495
3.685SerSer: 3.685 ± 0.627
3.521SerThr: 3.521 ± 0.581
4.012SerVal: 4.012 ± 0.393
0.819SerTrp: 0.819 ± 0.244
2.866SerTyr: 2.866 ± 0.615
0.0SerXaa: 0.0 ± 0.0
Thr
4.504ThrAla: 4.504 ± 0.721
0.655ThrCys: 0.655 ± 0.263
2.866ThrAsp: 2.866 ± 0.373
3.603ThrGlu: 3.603 ± 0.408
2.457ThrPhe: 2.457 ± 0.546
5.732ThrGly: 5.732 ± 0.752
1.146ThrHis: 1.146 ± 0.222
4.094ThrIle: 4.094 ± 0.67
4.422ThrLys: 4.422 ± 0.626
6.142ThrLeu: 6.142 ± 0.758
1.474ThrMet: 1.474 ± 0.339
3.194ThrAsn: 3.194 ± 0.575
2.948ThrPro: 2.948 ± 0.436
1.883ThrGln: 1.883 ± 0.406
2.293ThrArg: 2.293 ± 0.462
3.521ThrSer: 3.521 ± 0.475
3.685ThrThr: 3.685 ± 0.586
3.439ThrVal: 3.439 ± 0.613
0.655ThrTrp: 0.655 ± 0.202
1.638ThrTyr: 1.638 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
5.732ValAla: 5.732 ± 0.566
0.491ValCys: 0.491 ± 0.223
3.521ValAsp: 3.521 ± 0.479
5.077ValGlu: 5.077 ± 0.607
3.357ValPhe: 3.357 ± 0.672
5.077ValGly: 5.077 ± 0.667
1.883ValHis: 1.883 ± 0.376
3.194ValIle: 3.194 ± 0.513
4.094ValLys: 4.094 ± 0.527
5.486ValLeu: 5.486 ± 0.554
1.556ValMet: 1.556 ± 0.25
3.767ValAsn: 3.767 ± 0.574
1.802ValPro: 1.802 ± 0.326
2.866ValGln: 2.866 ± 0.35
4.34ValArg: 4.34 ± 0.572
4.668ValSer: 4.668 ± 0.554
5.323ValThr: 5.323 ± 0.704
5.405ValVal: 5.405 ± 0.806
1.065ValTrp: 1.065 ± 0.314
2.457ValTyr: 2.457 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.236
0.164TrpCys: 0.164 ± 0.119
0.819TrpAsp: 0.819 ± 0.286
0.901TrpGlu: 0.901 ± 0.282
0.328TrpPhe: 0.328 ± 0.171
0.983TrpGly: 0.983 ± 0.267
0.328TrpHis: 0.328 ± 0.195
0.328TrpIle: 0.328 ± 0.158
0.901TrpLys: 0.901 ± 0.253
1.802TrpLeu: 1.802 ± 0.418
0.573TrpMet: 0.573 ± 0.184
0.573TrpAsn: 0.573 ± 0.221
0.164TrpPro: 0.164 ± 0.115
0.328TrpGln: 0.328 ± 0.126
0.983TrpArg: 0.983 ± 0.233
1.146TrpSer: 1.146 ± 0.368
0.983TrpThr: 0.983 ± 0.231
1.556TrpVal: 1.556 ± 0.625
0.246TrpTrp: 0.246 ± 0.118
0.409TrpTyr: 0.409 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.866TyrAla: 2.866 ± 0.493
0.246TyrCys: 0.246 ± 0.128
3.112TyrAsp: 3.112 ± 0.632
1.802TyrGlu: 1.802 ± 0.372
1.146TyrPhe: 1.146 ± 0.359
2.211TyrGly: 2.211 ± 0.446
0.655TyrHis: 0.655 ± 0.278
1.883TyrIle: 1.883 ± 0.431
1.965TyrLys: 1.965 ± 0.418
3.03TyrLeu: 3.03 ± 0.518
0.901TyrMet: 0.901 ± 0.232
1.72TyrAsn: 1.72 ± 0.351
1.065TyrPro: 1.065 ± 0.358
1.065TyrGln: 1.065 ± 0.486
1.965TyrArg: 1.965 ± 0.368
1.065TyrSer: 1.065 ± 0.291
2.375TyrThr: 2.375 ± 0.386
2.538TyrVal: 2.538 ± 0.383
0.491TyrTrp: 0.491 ± 0.187
0.819TyrTyr: 0.819 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski