Amino acid dipepetide frequency for Salmonella phage SETP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.067AlaAla: 11.067 ± 1.701
1.137AlaCys: 1.137 ± 0.32
6.443AlaAsp: 6.443 ± 0.788
6.064AlaGlu: 6.064 ± 0.685
3.714AlaPhe: 3.714 ± 0.518
7.201AlaGly: 7.201 ± 0.788
1.895AlaHis: 1.895 ± 0.422
4.776AlaIle: 4.776 ± 0.905
6.064AlaLys: 6.064 ± 0.935
7.277AlaLeu: 7.277 ± 0.716
1.668AlaMet: 1.668 ± 0.314
3.032AlaAsn: 3.032 ± 0.497
3.563AlaPro: 3.563 ± 0.55
4.169AlaGln: 4.169 ± 1.214
4.245AlaArg: 4.245 ± 0.533
5.761AlaSer: 5.761 ± 0.696
5.761AlaThr: 5.761 ± 0.76
7.277AlaVal: 7.277 ± 0.855
1.213AlaTrp: 1.213 ± 0.31
3.032AlaTyr: 3.032 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
0.91CysAla: 0.91 ± 0.249
0.152CysCys: 0.152 ± 0.124
0.758CysAsp: 0.758 ± 0.208
1.137CysGlu: 1.137 ± 0.346
0.227CysPhe: 0.227 ± 0.139
0.606CysGly: 0.606 ± 0.207
0.152CysHis: 0.152 ± 0.112
0.303CysIle: 0.303 ± 0.164
1.061CysLys: 1.061 ± 0.26
0.985CysLeu: 0.985 ± 0.313
0.455CysMet: 0.455 ± 0.232
0.606CysAsn: 0.606 ± 0.238
0.152CysPro: 0.152 ± 0.101
0.379CysGln: 0.379 ± 0.225
0.758CysArg: 0.758 ± 0.262
0.455CysSer: 0.455 ± 0.247
0.455CysThr: 0.455 ± 0.139
0.531CysVal: 0.531 ± 0.217
0.227CysTrp: 0.227 ± 0.116
0.227CysTyr: 0.227 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
6.443AspAla: 6.443 ± 0.714
0.758AspCys: 0.758 ± 0.225
3.26AspAsp: 3.26 ± 0.469
3.942AspGlu: 3.942 ± 0.565
2.502AspPhe: 2.502 ± 0.382
6.292AspGly: 6.292 ± 0.739
0.682AspHis: 0.682 ± 0.257
3.335AspIle: 3.335 ± 0.376
3.032AspLys: 3.032 ± 0.431
4.548AspLeu: 4.548 ± 0.653
1.516AspMet: 1.516 ± 0.308
2.577AspAsn: 2.577 ± 0.55
1.743AspPro: 1.743 ± 0.372
0.606AspGln: 0.606 ± 0.193
2.881AspArg: 2.881 ± 0.388
3.79AspSer: 3.79 ± 0.527
3.942AspThr: 3.942 ± 0.448
3.639AspVal: 3.639 ± 0.507
1.061AspTrp: 1.061 ± 0.29
2.122AspTyr: 2.122 ± 0.429
0.0AspXaa: 0.0 ± 0.0
Glu
6.14GluAla: 6.14 ± 0.754
0.227GluCys: 0.227 ± 0.127
3.26GluAsp: 3.26 ± 0.624
4.472GluGlu: 4.472 ± 0.868
2.956GluPhe: 2.956 ± 0.576
5.079GluGly: 5.079 ± 0.807
0.834GluHis: 0.834 ± 0.258
3.639GluIle: 3.639 ± 0.45
4.624GluLys: 4.624 ± 0.668
6.443GluLeu: 6.443 ± 0.793
2.805GluMet: 2.805 ± 0.536
2.729GluAsn: 2.729 ± 0.425
1.971GluPro: 1.971 ± 0.577
3.108GluGln: 3.108 ± 0.604
4.245GluArg: 4.245 ± 0.578
3.563GluSer: 3.563 ± 0.588
4.093GluThr: 4.093 ± 0.57
4.776GluVal: 4.776 ± 0.687
0.91GluTrp: 0.91 ± 0.2
1.44GluTyr: 1.44 ± 0.372
0.0GluXaa: 0.0 ± 0.0
Phe
2.577PheAla: 2.577 ± 0.431
0.531PheCys: 0.531 ± 0.186
2.805PheAsp: 2.805 ± 0.483
2.426PheGlu: 2.426 ± 0.42
0.531PhePhe: 0.531 ± 0.184
3.108PheGly: 3.108 ± 0.362
0.531PheHis: 0.531 ± 0.211
2.577PheIle: 2.577 ± 0.517
1.819PheLys: 1.819 ± 0.436
2.502PheLeu: 2.502 ± 0.379
0.455PheMet: 0.455 ± 0.175
1.516PheAsn: 1.516 ± 0.34
1.743PhePro: 1.743 ± 0.433
1.516PheGln: 1.516 ± 0.364
2.198PheArg: 2.198 ± 0.349
1.743PheSer: 1.743 ± 0.436
3.26PheThr: 3.26 ± 0.508
2.502PheVal: 2.502 ± 0.496
0.834PheTrp: 0.834 ± 0.26
0.91PheTyr: 0.91 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
7.201GlyAla: 7.201 ± 0.851
0.91GlyCys: 0.91 ± 0.249
4.169GlyAsp: 4.169 ± 0.672
5.609GlyGlu: 5.609 ± 0.735
3.032GlyPhe: 3.032 ± 0.524
6.519GlyGly: 6.519 ± 0.859
1.516GlyHis: 1.516 ± 0.512
3.487GlyIle: 3.487 ± 0.488
5.155GlyLys: 5.155 ± 0.543
5.458GlyLeu: 5.458 ± 0.557
2.35GlyMet: 2.35 ± 0.56
3.335GlyAsn: 3.335 ± 0.531
1.895GlyPro: 1.895 ± 0.384
3.032GlyGln: 3.032 ± 0.535
4.624GlyArg: 4.624 ± 0.532
4.776GlySer: 4.776 ± 0.974
4.472GlyThr: 4.472 ± 0.57
5.685GlyVal: 5.685 ± 0.639
1.44GlyTrp: 1.44 ± 0.33
3.184GlyTyr: 3.184 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
1.213HisAla: 1.213 ± 0.283
0.379HisCys: 0.379 ± 0.191
0.758HisAsp: 0.758 ± 0.192
0.758HisGlu: 0.758 ± 0.237
0.531HisPhe: 0.531 ± 0.233
0.985HisGly: 0.985 ± 0.323
0.834HisHis: 0.834 ± 0.33
1.137HisIle: 1.137 ± 0.34
0.985HisLys: 0.985 ± 0.239
1.289HisLeu: 1.289 ± 0.343
0.531HisMet: 0.531 ± 0.196
0.606HisAsn: 0.606 ± 0.168
1.061HisPro: 1.061 ± 0.278
0.985HisGln: 0.985 ± 0.278
0.985HisArg: 0.985 ± 0.294
0.985HisSer: 0.985 ± 0.245
0.91HisThr: 0.91 ± 0.307
0.606HisVal: 0.606 ± 0.201
0.076HisTrp: 0.076 ± 0.073
0.91HisTyr: 0.91 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
4.169IleAla: 4.169 ± 1.032
0.682IleCys: 0.682 ± 0.252
4.169IleAsp: 4.169 ± 0.622
2.502IleGlu: 2.502 ± 0.389
1.213IlePhe: 1.213 ± 0.305
3.335IleGly: 3.335 ± 0.43
0.606IleHis: 0.606 ± 0.215
2.502IleIle: 2.502 ± 0.557
3.108IleLys: 3.108 ± 0.571
3.563IleLeu: 3.563 ± 0.454
1.213IleMet: 1.213 ± 0.342
2.426IleAsn: 2.426 ± 0.486
2.729IlePro: 2.729 ± 0.451
1.819IleGln: 1.819 ± 0.375
2.881IleArg: 2.881 ± 0.361
3.108IleSer: 3.108 ± 0.62
4.472IleThr: 4.472 ± 0.599
3.714IleVal: 3.714 ± 0.436
0.758IleTrp: 0.758 ± 0.21
1.364IleTyr: 1.364 ± 0.361
0.0IleXaa: 0.0 ± 0.0
Lys
5.382LysAla: 5.382 ± 0.834
0.682LysCys: 0.682 ± 0.274
3.79LysAsp: 3.79 ± 0.479
4.548LysGlu: 4.548 ± 0.716
2.35LysPhe: 2.35 ± 0.342
4.245LysGly: 4.245 ± 0.503
1.364LysHis: 1.364 ± 0.331
1.44LysIle: 1.44 ± 0.308
3.108LysLys: 3.108 ± 0.544
5.837LysLeu: 5.837 ± 0.717
2.577LysMet: 2.577 ± 0.597
2.426LysAsn: 2.426 ± 0.386
2.653LysPro: 2.653 ± 0.653
2.274LysGln: 2.274 ± 0.452
3.639LysArg: 3.639 ± 0.609
3.032LysSer: 3.032 ± 0.58
4.169LysThr: 4.169 ± 0.495
3.487LysVal: 3.487 ± 0.635
0.758LysTrp: 0.758 ± 0.219
2.729LysTyr: 2.729 ± 0.486
0.0LysXaa: 0.0 ± 0.0
Leu
7.505LeuAla: 7.505 ± 0.845
0.985LeuCys: 0.985 ± 0.253
3.866LeuAsp: 3.866 ± 0.506
5.382LeuGlu: 5.382 ± 0.754
1.819LeuPhe: 1.819 ± 0.446
5.155LeuGly: 5.155 ± 0.59
1.137LeuHis: 1.137 ± 0.283
4.851LeuIle: 4.851 ± 0.593
5.079LeuLys: 5.079 ± 0.674
6.216LeuLeu: 6.216 ± 0.711
2.274LeuMet: 2.274 ± 0.369
4.245LeuAsn: 4.245 ± 0.482
3.866LeuPro: 3.866 ± 0.567
2.35LeuGln: 2.35 ± 0.39
5.761LeuArg: 5.761 ± 0.663
4.776LeuSer: 4.776 ± 0.561
4.927LeuThr: 4.927 ± 0.496
5.23LeuVal: 5.23 ± 0.624
1.364LeuTrp: 1.364 ± 0.332
2.198LeuTyr: 2.198 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
2.426MetAla: 2.426 ± 0.359
0.227MetCys: 0.227 ± 0.123
1.213MetAsp: 1.213 ± 0.361
0.985MetGlu: 0.985 ± 0.236
0.91MetPhe: 0.91 ± 0.236
2.047MetGly: 2.047 ± 0.352
0.303MetHis: 0.303 ± 0.148
1.061MetIle: 1.061 ± 0.273
1.516MetLys: 1.516 ± 0.415
2.502MetLeu: 2.502 ± 0.49
0.834MetMet: 0.834 ± 0.264
1.213MetAsn: 1.213 ± 0.351
1.44MetPro: 1.44 ± 0.336
1.061MetGln: 1.061 ± 0.255
1.743MetArg: 1.743 ± 0.374
2.047MetSer: 2.047 ± 0.384
2.122MetThr: 2.122 ± 0.328
1.592MetVal: 1.592 ± 0.31
0.379MetTrp: 0.379 ± 0.174
0.834MetTyr: 0.834 ± 0.279
0.0MetXaa: 0.0 ± 0.0
Asn
4.018AsnAla: 4.018 ± 0.627
0.379AsnCys: 0.379 ± 0.159
2.577AsnAsp: 2.577 ± 0.44
2.577AsnGlu: 2.577 ± 0.421
1.516AsnPhe: 1.516 ± 0.302
4.624AsnGly: 4.624 ± 0.683
0.682AsnHis: 0.682 ± 0.222
3.032AsnIle: 3.032 ± 0.44
2.426AsnLys: 2.426 ± 0.446
3.714AsnLeu: 3.714 ± 0.438
0.606AsnMet: 0.606 ± 0.214
2.426AsnAsn: 2.426 ± 0.503
1.668AsnPro: 1.668 ± 0.438
1.137AsnGln: 1.137 ± 0.243
2.653AsnArg: 2.653 ± 0.385
2.198AsnSer: 2.198 ± 0.365
2.122AsnThr: 2.122 ± 0.432
3.563AsnVal: 3.563 ± 0.394
0.834AsnTrp: 0.834 ± 0.225
1.364AsnTyr: 1.364 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
2.956ProAla: 2.956 ± 0.515
0.379ProCys: 0.379 ± 0.186
3.184ProAsp: 3.184 ± 0.46
2.881ProGlu: 2.881 ± 0.507
2.047ProPhe: 2.047 ± 0.384
3.108ProGly: 3.108 ± 0.548
0.531ProHis: 0.531 ± 0.179
1.213ProIle: 1.213 ± 0.375
2.729ProLys: 2.729 ± 0.419
3.108ProLeu: 3.108 ± 0.492
0.834ProMet: 0.834 ± 0.302
1.44ProAsn: 1.44 ± 0.444
1.44ProPro: 1.44 ± 0.329
1.213ProGln: 1.213 ± 0.255
1.971ProArg: 1.971 ± 0.431
2.274ProSer: 2.274 ± 0.485
1.364ProThr: 1.364 ± 0.321
3.942ProVal: 3.942 ± 0.435
0.455ProTrp: 0.455 ± 0.251
1.516ProTyr: 1.516 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
4.624GlnAla: 4.624 ± 0.598
0.379GlnCys: 0.379 ± 0.171
1.668GlnAsp: 1.668 ± 0.341
2.35GlnGlu: 2.35 ± 0.45
1.213GlnPhe: 1.213 ± 0.297
2.047GlnGly: 2.047 ± 0.487
0.455GlnHis: 0.455 ± 0.187
1.895GlnIle: 1.895 ± 0.484
1.971GlnLys: 1.971 ± 0.425
2.577GlnLeu: 2.577 ± 0.372
1.137GlnMet: 1.137 ± 0.323
1.895GlnAsn: 1.895 ± 0.362
2.122GlnPro: 2.122 ± 0.374
2.729GlnGln: 2.729 ± 0.79
1.895GlnArg: 1.895 ± 0.331
1.743GlnSer: 1.743 ± 0.382
2.122GlnThr: 2.122 ± 0.352
2.577GlnVal: 2.577 ± 0.422
0.531GlnTrp: 0.531 ± 0.202
1.364GlnTyr: 1.364 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
4.851ArgAla: 4.851 ± 0.451
0.455ArgCys: 0.455 ± 0.151
3.563ArgAsp: 3.563 ± 0.451
4.472ArgGlu: 4.472 ± 0.61
2.047ArgPhe: 2.047 ± 0.404
4.169ArgGly: 4.169 ± 0.5
1.137ArgHis: 1.137 ± 0.261
3.26ArgIle: 3.26 ± 0.474
3.866ArgLys: 3.866 ± 0.582
3.942ArgLeu: 3.942 ± 0.451
2.198ArgMet: 2.198 ± 0.409
3.184ArgAsn: 3.184 ± 0.435
1.743ArgPro: 1.743 ± 0.347
2.729ArgGln: 2.729 ± 0.448
4.397ArgArg: 4.397 ± 0.618
2.198ArgSer: 2.198 ± 0.383
2.881ArgThr: 2.881 ± 0.521
4.7ArgVal: 4.7 ± 0.503
0.985ArgTrp: 0.985 ± 0.255
1.516ArgTyr: 1.516 ± 0.404
0.0ArgXaa: 0.0 ± 0.0
Ser
7.277SerAla: 7.277 ± 1.093
0.227SerCys: 0.227 ± 0.153
2.805SerAsp: 2.805 ± 0.47
3.563SerGlu: 3.563 ± 0.603
2.502SerPhe: 2.502 ± 0.449
5.761SerGly: 5.761 ± 0.71
0.834SerHis: 0.834 ± 0.221
2.577SerIle: 2.577 ± 0.432
2.956SerLys: 2.956 ± 0.471
5.079SerLeu: 5.079 ± 0.605
1.289SerMet: 1.289 ± 0.291
2.881SerAsn: 2.881 ± 0.397
1.516SerPro: 1.516 ± 0.438
2.047SerGln: 2.047 ± 0.34
3.108SerArg: 3.108 ± 0.458
3.639SerSer: 3.639 ± 0.635
4.169SerThr: 4.169 ± 0.635
4.776SerVal: 4.776 ± 0.693
0.758SerTrp: 0.758 ± 0.196
1.895SerTyr: 1.895 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
5.988ThrAla: 5.988 ± 0.71
0.455ThrCys: 0.455 ± 0.161
3.942ThrAsp: 3.942 ± 0.49
3.942ThrGlu: 3.942 ± 0.49
2.805ThrPhe: 2.805 ± 0.456
5.988ThrGly: 5.988 ± 0.697
1.061ThrHis: 1.061 ± 0.288
2.729ThrIle: 2.729 ± 0.446
3.032ThrLys: 3.032 ± 0.43
4.776ThrLeu: 4.776 ± 0.593
1.213ThrMet: 1.213 ± 0.323
2.047ThrAsn: 2.047 ± 0.382
3.639ThrPro: 3.639 ± 0.563
1.895ThrGln: 1.895 ± 0.363
2.881ThrArg: 2.881 ± 0.413
5.23ThrSer: 5.23 ± 0.576
3.335ThrThr: 3.335 ± 0.366
4.7ThrVal: 4.7 ± 0.717
1.213ThrTrp: 1.213 ± 0.248
2.502ThrTyr: 2.502 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
6.595ValAla: 6.595 ± 0.689
0.91ValCys: 0.91 ± 0.27
3.563ValAsp: 3.563 ± 0.424
6.519ValGlu: 6.519 ± 0.65
1.819ValPhe: 1.819 ± 0.434
4.018ValGly: 4.018 ± 0.529
0.834ValHis: 0.834 ± 0.193
4.093ValIle: 4.093 ± 0.514
4.624ValLys: 4.624 ± 0.72
4.776ValLeu: 4.776 ± 0.665
1.213ValMet: 1.213 ± 0.374
3.411ValAsn: 3.411 ± 0.508
2.122ValPro: 2.122 ± 0.48
2.274ValGln: 2.274 ± 0.336
3.866ValArg: 3.866 ± 0.595
5.913ValSer: 5.913 ± 0.967
5.685ValThr: 5.685 ± 0.74
4.776ValVal: 4.776 ± 0.746
0.985ValTrp: 0.985 ± 0.249
3.108ValTyr: 3.108 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
1.061TrpAla: 1.061 ± 0.376
0.227TrpCys: 0.227 ± 0.152
0.758TrpAsp: 0.758 ± 0.245
0.606TrpGlu: 0.606 ± 0.197
0.834TrpPhe: 0.834 ± 0.308
0.91TrpGly: 0.91 ± 0.224
0.303TrpHis: 0.303 ± 0.177
0.834TrpIle: 0.834 ± 0.244
0.682TrpLys: 0.682 ± 0.266
2.198TrpLeu: 2.198 ± 0.349
0.379TrpMet: 0.379 ± 0.182
0.606TrpAsn: 0.606 ± 0.243
0.455TrpPro: 0.455 ± 0.214
0.682TrpGln: 0.682 ± 0.211
1.44TrpArg: 1.44 ± 0.392
0.531TrpSer: 0.531 ± 0.162
0.758TrpThr: 0.758 ± 0.236
1.364TrpVal: 1.364 ± 0.249
0.379TrpTrp: 0.379 ± 0.195
0.227TrpTyr: 0.227 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.956TyrAla: 2.956 ± 0.453
0.455TyrCys: 0.455 ± 0.197
2.122TyrAsp: 2.122 ± 0.473
2.577TyrGlu: 2.577 ± 0.473
1.516TyrPhe: 1.516 ± 0.362
2.729TyrGly: 2.729 ± 0.453
0.834TyrHis: 0.834 ± 0.223
1.592TyrIle: 1.592 ± 0.321
2.653TyrLys: 2.653 ± 0.451
2.198TyrLeu: 2.198 ± 0.471
0.834TyrMet: 0.834 ± 0.2
1.516TyrAsn: 1.516 ± 0.308
1.061TyrPro: 1.061 ± 0.366
1.289TyrGln: 1.289 ± 0.328
2.122TyrArg: 2.122 ± 0.482
1.895TyrSer: 1.895 ± 0.421
2.35TyrThr: 2.35 ± 0.339
1.668TyrVal: 1.668 ± 0.373
0.076TyrTrp: 0.076 ± 0.082
1.289TyrTyr: 1.289 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13193 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski