Amino acid dipepetide frequency for Salmonella phage 66FD

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.516AlaAla: 13.516 ± 1.761
0.405AlaCys: 0.405 ± 0.203
5.827AlaAsp: 5.827 ± 0.796
6.556AlaGlu: 6.556 ± 1.311
4.047AlaPhe: 4.047 ± 0.302
7.85AlaGly: 7.85 ± 1.151
0.971AlaHis: 0.971 ± 0.248
3.642AlaIle: 3.642 ± 0.616
3.966AlaLys: 3.966 ± 0.678
7.85AlaLeu: 7.85 ± 0.773
3.48AlaMet: 3.48 ± 0.538
3.885AlaAsn: 3.885 ± 0.543
3.804AlaPro: 3.804 ± 0.528
6.475AlaGln: 6.475 ± 1.123
6.313AlaArg: 6.313 ± 0.926
6.879AlaSer: 6.879 ± 0.555
4.128AlaThr: 4.128 ± 0.596
8.093AlaVal: 8.093 ± 0.913
1.214AlaTrp: 1.214 ± 0.285
3.48AlaTyr: 3.48 ± 0.631
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.254
0.162CysCys: 0.162 ± 0.113
0.567CysAsp: 0.567 ± 0.196
0.567CysGlu: 0.567 ± 0.205
0.567CysPhe: 0.567 ± 0.202
1.052CysGly: 1.052 ± 0.388
0.243CysHis: 0.243 ± 0.155
0.89CysIle: 0.89 ± 0.263
0.324CysLys: 0.324 ± 0.176
0.728CysLeu: 0.728 ± 0.261
0.162CysMet: 0.162 ± 0.122
0.324CysAsn: 0.324 ± 0.155
0.728CysPro: 0.728 ± 0.26
0.162CysGln: 0.162 ± 0.138
0.89CysArg: 0.89 ± 0.266
0.324CysSer: 0.324 ± 0.169
0.243CysThr: 0.243 ± 0.178
0.486CysVal: 0.486 ± 0.206
0.243CysTrp: 0.243 ± 0.158
0.081CysTyr: 0.081 ± 0.07
0.0CysXaa: 0.0 ± 0.0
Asp
6.151AspAla: 6.151 ± 0.817
0.809AspCys: 0.809 ± 0.266
4.047AspAsp: 4.047 ± 0.595
3.966AspGlu: 3.966 ± 0.674
1.861AspPhe: 1.861 ± 0.283
5.422AspGly: 5.422 ± 0.664
0.89AspHis: 0.89 ± 0.289
3.561AspIle: 3.561 ± 0.494
3.804AspLys: 3.804 ± 0.491
5.018AspLeu: 5.018 ± 0.492
2.509AspMet: 2.509 ± 0.451
2.185AspAsn: 2.185 ± 0.37
2.509AspPro: 2.509 ± 0.633
1.861AspGln: 1.861 ± 0.39
2.266AspArg: 2.266 ± 0.378
3.804AspSer: 3.804 ± 0.334
2.59AspThr: 2.59 ± 0.516
4.937AspVal: 4.937 ± 0.76
1.295AspTrp: 1.295 ± 0.337
2.428AspTyr: 2.428 ± 0.459
0.0AspXaa: 0.0 ± 0.0
Glu
4.856GluAla: 4.856 ± 0.617
0.405GluCys: 0.405 ± 0.158
3.237GluAsp: 3.237 ± 0.532
3.561GluGlu: 3.561 ± 0.56
2.509GluPhe: 2.509 ± 0.399
3.156GluGly: 3.156 ± 0.516
0.971GluHis: 0.971 ± 0.292
2.833GluIle: 2.833 ± 0.513
3.237GluLys: 3.237 ± 0.532
5.746GluLeu: 5.746 ± 0.713
1.052GluMet: 1.052 ± 0.306
2.023GluAsn: 2.023 ± 0.346
1.861GluPro: 1.861 ± 0.404
4.694GluGln: 4.694 ± 0.742
4.128GluArg: 4.128 ± 0.811
3.561GluSer: 3.561 ± 0.561
3.156GluThr: 3.156 ± 0.452
4.37GluVal: 4.37 ± 0.747
1.052GluTrp: 1.052 ± 0.281
1.861GluTyr: 1.861 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
2.509PheAla: 2.509 ± 0.569
0.324PheCys: 0.324 ± 0.16
1.942PheAsp: 1.942 ± 0.4
2.428PheGlu: 2.428 ± 0.415
1.376PhePhe: 1.376 ± 0.337
2.752PheGly: 2.752 ± 0.462
0.567PheHis: 0.567 ± 0.246
2.752PheIle: 2.752 ± 0.406
2.023PheLys: 2.023 ± 0.411
1.861PheLeu: 1.861 ± 0.402
0.324PheMet: 0.324 ± 0.139
1.7PheAsn: 1.7 ± 0.335
1.457PhePro: 1.457 ± 0.293
0.971PheGln: 0.971 ± 0.356
2.023PheArg: 2.023 ± 0.406
2.914PheSer: 2.914 ± 0.439
1.942PheThr: 1.942 ± 0.398
1.7PheVal: 1.7 ± 0.381
0.647PheTrp: 0.647 ± 0.203
1.376PheTyr: 1.376 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
6.394GlyAla: 6.394 ± 0.825
0.89GlyCys: 0.89 ± 0.277
4.937GlyAsp: 4.937 ± 0.637
4.613GlyGlu: 4.613 ± 0.469
3.075GlyPhe: 3.075 ± 0.592
5.827GlyGly: 5.827 ± 0.799
1.619GlyHis: 1.619 ± 0.435
5.18GlyIle: 5.18 ± 0.611
5.261GlyLys: 5.261 ± 0.609
5.908GlyLeu: 5.908 ± 0.794
2.671GlyMet: 2.671 ± 0.443
3.642GlyAsn: 3.642 ± 0.544
2.023GlyPro: 2.023 ± 0.41
3.156GlyGln: 3.156 ± 0.616
4.047GlyArg: 4.047 ± 0.757
5.342GlySer: 5.342 ± 0.77
5.099GlyThr: 5.099 ± 0.74
5.746GlyVal: 5.746 ± 0.767
1.133GlyTrp: 1.133 ± 0.283
2.104GlyTyr: 2.104 ± 0.333
0.0GlyXaa: 0.0 ± 0.0
His
1.538HisAla: 1.538 ± 0.514
0.405HisCys: 0.405 ± 0.181
0.971HisAsp: 0.971 ± 0.248
1.052HisGlu: 1.052 ± 0.331
0.405HisPhe: 0.405 ± 0.243
1.538HisGly: 1.538 ± 0.411
0.486HisHis: 0.486 ± 0.176
0.971HisIle: 0.971 ± 0.235
0.486HisLys: 0.486 ± 0.211
1.861HisLeu: 1.861 ± 0.348
0.567HisMet: 0.567 ± 0.188
0.728HisAsn: 0.728 ± 0.271
0.728HisPro: 0.728 ± 0.225
0.405HisGln: 0.405 ± 0.237
0.728HisArg: 0.728 ± 0.29
1.052HisSer: 1.052 ± 0.376
0.728HisThr: 0.728 ± 0.213
0.971HisVal: 0.971 ± 0.28
0.567HisTrp: 0.567 ± 0.189
0.486HisTyr: 0.486 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
5.584IleAla: 5.584 ± 0.612
0.486IleCys: 0.486 ± 0.227
4.37IleAsp: 4.37 ± 0.598
2.833IleGlu: 2.833 ± 0.441
1.781IlePhe: 1.781 ± 0.482
4.694IleGly: 4.694 ± 0.657
0.809IleHis: 0.809 ± 0.228
2.833IleIle: 2.833 ± 0.837
2.671IleLys: 2.671 ± 0.645
3.48IleLeu: 3.48 ± 0.718
0.647IleMet: 0.647 ± 0.213
3.399IleAsn: 3.399 ± 0.438
2.509IlePro: 2.509 ± 0.324
2.428IleGln: 2.428 ± 0.583
2.994IleArg: 2.994 ± 0.459
3.966IleSer: 3.966 ± 0.759
3.723IleThr: 3.723 ± 0.64
2.994IleVal: 2.994 ± 0.687
0.89IleTrp: 0.89 ± 0.243
1.295IleTyr: 1.295 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
4.937LysAla: 4.937 ± 0.601
0.728LysCys: 0.728 ± 0.303
2.104LysAsp: 2.104 ± 0.359
2.509LysGlu: 2.509 ± 0.453
1.295LysPhe: 1.295 ± 0.269
2.994LysGly: 2.994 ± 0.529
0.728LysHis: 0.728 ± 0.214
1.538LysIle: 1.538 ± 0.252
2.347LysLys: 2.347 ± 0.456
3.966LysLeu: 3.966 ± 0.603
1.7LysMet: 1.7 ± 0.463
2.266LysAsn: 2.266 ± 0.643
3.318LysPro: 3.318 ± 0.725
1.861LysGln: 1.861 ± 0.397
3.804LysArg: 3.804 ± 0.689
3.804LysSer: 3.804 ± 0.678
2.671LysThr: 2.671 ± 0.435
3.318LysVal: 3.318 ± 0.615
0.89LysTrp: 0.89 ± 0.298
1.619LysTyr: 1.619 ± 0.455
0.0LysXaa: 0.0 ± 0.0
Leu
8.579LeuAla: 8.579 ± 1.131
0.809LeuCys: 0.809 ± 0.255
4.775LeuAsp: 4.775 ± 0.569
4.37LeuGlu: 4.37 ± 0.583
2.914LeuPhe: 2.914 ± 0.528
4.694LeuGly: 4.694 ± 0.977
1.214LeuHis: 1.214 ± 0.32
4.289LeuIle: 4.289 ± 0.766
3.966LeuLys: 3.966 ± 0.648
5.746LeuLeu: 5.746 ± 0.763
1.538LeuMet: 1.538 ± 0.304
3.399LeuAsn: 3.399 ± 0.608
2.914LeuPro: 2.914 ± 0.528
3.318LeuGln: 3.318 ± 0.446
5.261LeuArg: 5.261 ± 0.584
6.232LeuSer: 6.232 ± 0.812
6.313LeuThr: 6.313 ± 0.77
4.208LeuVal: 4.208 ± 0.52
1.133LeuTrp: 1.133 ± 0.34
1.7LeuTyr: 1.7 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
3.642MetAla: 3.642 ± 0.528
0.324MetCys: 0.324 ± 0.154
1.457MetAsp: 1.457 ± 0.368
1.052MetGlu: 1.052 ± 0.322
0.405MetPhe: 0.405 ± 0.206
2.023MetGly: 2.023 ± 0.489
0.0MetHis: 0.0 ± 0.0
1.538MetIle: 1.538 ± 0.438
1.214MetLys: 1.214 ± 0.427
1.781MetLeu: 1.781 ± 0.404
1.052MetMet: 1.052 ± 0.295
1.7MetAsn: 1.7 ± 0.421
1.457MetPro: 1.457 ± 0.37
0.89MetGln: 0.89 ± 0.283
1.295MetArg: 1.295 ± 0.647
2.185MetSer: 2.185 ± 0.387
1.538MetThr: 1.538 ± 0.369
1.781MetVal: 1.781 ± 0.446
0.162MetTrp: 0.162 ± 0.096
0.89MetTyr: 0.89 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
4.128AsnAla: 4.128 ± 0.505
0.162AsnCys: 0.162 ± 0.11
2.752AsnAsp: 2.752 ± 0.508
2.752AsnGlu: 2.752 ± 0.575
0.809AsnPhe: 0.809 ± 0.232
4.694AsnGly: 4.694 ± 0.69
0.971AsnHis: 0.971 ± 0.308
2.59AsnIle: 2.59 ± 0.537
1.7AsnLys: 1.7 ± 0.413
3.804AsnLeu: 3.804 ± 0.678
0.647AsnMet: 0.647 ± 0.27
2.347AsnAsn: 2.347 ± 0.634
2.104AsnPro: 2.104 ± 0.482
2.509AsnGln: 2.509 ± 0.483
2.104AsnArg: 2.104 ± 0.436
3.075AsnSer: 3.075 ± 0.474
2.671AsnThr: 2.671 ± 0.374
2.347AsnVal: 2.347 ± 0.538
0.89AsnTrp: 0.89 ± 0.342
2.266AsnTyr: 2.266 ± 0.532
0.0AsnXaa: 0.0 ± 0.0
Pro
4.37ProAla: 4.37 ± 0.867
0.486ProCys: 0.486 ± 0.205
3.885ProAsp: 3.885 ± 0.593
3.48ProGlu: 3.48 ± 0.567
1.214ProPhe: 1.214 ± 0.285
3.642ProGly: 3.642 ± 0.355
0.567ProHis: 0.567 ± 0.205
2.185ProIle: 2.185 ± 0.457
0.89ProLys: 0.89 ± 0.259
2.752ProLeu: 2.752 ± 0.373
0.809ProMet: 0.809 ± 0.261
1.214ProAsn: 1.214 ± 0.377
1.619ProPro: 1.619 ± 0.324
2.347ProGln: 2.347 ± 0.417
1.781ProArg: 1.781 ± 0.392
2.428ProSer: 2.428 ± 0.453
2.266ProThr: 2.266 ± 0.415
3.642ProVal: 3.642 ± 0.542
0.647ProTrp: 0.647 ± 0.238
1.295ProTyr: 1.295 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
6.232GlnAla: 6.232 ± 1.183
0.486GlnCys: 0.486 ± 0.235
2.185GlnAsp: 2.185 ± 0.35
2.509GlnGlu: 2.509 ± 0.568
1.7GlnPhe: 1.7 ± 0.408
2.994GlnGly: 2.994 ± 0.441
1.052GlnHis: 1.052 ± 0.309
2.185GlnIle: 2.185 ± 0.585
1.942GlnLys: 1.942 ± 0.418
4.289GlnLeu: 4.289 ± 0.76
1.376GlnMet: 1.376 ± 0.406
1.457GlnAsn: 1.457 ± 0.29
1.781GlnPro: 1.781 ± 0.346
3.723GlnGln: 3.723 ± 0.793
3.966GlnArg: 3.966 ± 0.543
1.861GlnSer: 1.861 ± 0.34
3.075GlnThr: 3.075 ± 0.739
2.752GlnVal: 2.752 ± 0.633
0.728GlnTrp: 0.728 ± 0.332
1.7GlnTyr: 1.7 ± 0.367
0.0GlnXaa: 0.0 ± 0.0
Arg
5.908ArgAla: 5.908 ± 0.909
0.728ArgCys: 0.728 ± 0.265
4.451ArgAsp: 4.451 ± 0.544
3.48ArgGlu: 3.48 ± 0.494
1.942ArgPhe: 1.942 ± 0.277
3.318ArgGly: 3.318 ± 0.386
1.052ArgHis: 1.052 ± 0.295
3.804ArgIle: 3.804 ± 0.419
3.642ArgLys: 3.642 ± 0.414
4.694ArgLeu: 4.694 ± 0.575
1.376ArgMet: 1.376 ± 0.321
3.399ArgAsn: 3.399 ± 0.355
1.376ArgPro: 1.376 ± 0.454
3.399ArgGln: 3.399 ± 0.516
4.289ArgArg: 4.289 ± 0.678
3.156ArgSer: 3.156 ± 0.621
2.994ArgThr: 2.994 ± 0.578
3.237ArgVal: 3.237 ± 0.598
1.214ArgTrp: 1.214 ± 0.299
1.942ArgTyr: 1.942 ± 0.42
0.0ArgXaa: 0.0 ± 0.0
Ser
7.284SerAla: 7.284 ± 0.819
0.809SerCys: 0.809 ± 0.259
3.804SerAsp: 3.804 ± 0.497
4.208SerGlu: 4.208 ± 0.473
2.104SerPhe: 2.104 ± 0.432
6.717SerGly: 6.717 ± 0.863
1.295SerHis: 1.295 ± 0.354
3.318SerIle: 3.318 ± 0.519
3.399SerLys: 3.399 ± 0.566
4.128SerLeu: 4.128 ± 0.622
1.861SerMet: 1.861 ± 0.417
3.156SerAsn: 3.156 ± 0.433
3.156SerPro: 3.156 ± 0.478
2.752SerGln: 2.752 ± 0.581
3.723SerArg: 3.723 ± 0.455
3.48SerSer: 3.48 ± 0.597
3.561SerThr: 3.561 ± 0.574
5.018SerVal: 5.018 ± 0.689
0.567SerTrp: 0.567 ± 0.178
1.942SerTyr: 1.942 ± 0.519
0.0SerXaa: 0.0 ± 0.0
Thr
5.827ThrAla: 5.827 ± 0.824
0.324ThrCys: 0.324 ± 0.182
3.156ThrAsp: 3.156 ± 0.456
2.185ThrGlu: 2.185 ± 0.437
1.781ThrPhe: 1.781 ± 0.323
6.475ThrGly: 6.475 ± 0.845
1.052ThrHis: 1.052 ± 0.344
3.237ThrIle: 3.237 ± 0.51
2.023ThrLys: 2.023 ± 0.356
4.37ThrLeu: 4.37 ± 0.589
1.7ThrMet: 1.7 ± 0.346
2.428ThrAsn: 2.428 ± 0.503
3.318ThrPro: 3.318 ± 0.473
2.671ThrGln: 2.671 ± 0.505
2.833ThrArg: 2.833 ± 0.63
4.047ThrSer: 4.047 ± 0.74
2.914ThrThr: 2.914 ± 0.517
3.804ThrVal: 3.804 ± 0.795
1.214ThrTrp: 1.214 ± 0.31
1.781ThrTyr: 1.781 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
5.908ValAla: 5.908 ± 0.753
0.486ValCys: 0.486 ± 0.177
4.856ValAsp: 4.856 ± 0.575
3.642ValGlu: 3.642 ± 0.427
2.023ValPhe: 2.023 ± 0.358
3.885ValGly: 3.885 ± 0.515
0.971ValHis: 0.971 ± 0.268
4.694ValIle: 4.694 ± 0.744
3.399ValLys: 3.399 ± 0.469
5.342ValLeu: 5.342 ± 0.723
1.7ValMet: 1.7 ± 0.362
4.451ValAsn: 4.451 ± 0.612
2.428ValPro: 2.428 ± 0.401
2.509ValGln: 2.509 ± 0.487
4.047ValArg: 4.047 ± 0.585
4.775ValSer: 4.775 ± 0.562
4.208ValThr: 4.208 ± 0.624
4.694ValVal: 4.694 ± 0.804
0.89ValTrp: 0.89 ± 0.381
1.942ValTyr: 1.942 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
1.133TrpAla: 1.133 ± 0.316
0.162TrpCys: 0.162 ± 0.104
0.89TrpAsp: 0.89 ± 0.297
0.971TrpGlu: 0.971 ± 0.333
0.486TrpPhe: 0.486 ± 0.172
1.619TrpGly: 1.619 ± 0.382
0.324TrpHis: 0.324 ± 0.139
0.809TrpIle: 0.809 ± 0.228
0.809TrpLys: 0.809 ± 0.269
1.619TrpLeu: 1.619 ± 0.393
0.486TrpMet: 0.486 ± 0.194
0.486TrpAsn: 0.486 ± 0.17
0.728TrpPro: 0.728 ± 0.277
0.405TrpGln: 0.405 ± 0.196
0.809TrpArg: 0.809 ± 0.291
1.7TrpSer: 1.7 ± 0.418
1.052TrpThr: 1.052 ± 0.338
1.052TrpVal: 1.052 ± 0.319
0.243TrpTrp: 0.243 ± 0.134
0.405TrpTyr: 0.405 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.075TyrAla: 3.075 ± 0.407
0.162TyrCys: 0.162 ± 0.117
1.7TyrAsp: 1.7 ± 0.36
1.538TyrGlu: 1.538 ± 0.301
1.295TyrPhe: 1.295 ± 0.432
3.318TyrGly: 3.318 ± 0.507
0.89TyrHis: 0.89 ± 0.365
1.538TyrIle: 1.538 ± 0.344
1.295TyrLys: 1.295 ± 0.316
2.509TyrLeu: 2.509 ± 0.48
0.647TyrMet: 0.647 ± 0.263
1.133TyrAsn: 1.133 ± 0.305
1.7TyrPro: 1.7 ± 0.476
1.457TyrGln: 1.457 ± 0.351
2.104TyrArg: 2.104 ± 0.334
1.7TyrSer: 1.7 ± 0.365
2.266TyrThr: 2.266 ± 0.467
1.7TyrVal: 1.7 ± 0.373
0.567TyrTrp: 0.567 ± 0.186
1.133TyrTyr: 1.133 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (12357 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski