Amino acid dipepetide frequency for Salmonella phage SPN9CC

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.175AlaAla: 9.175 ± 1.271
1.488AlaCys: 1.488 ± 0.405
6.778AlaAsp: 6.778 ± 0.864
6.117AlaGlu: 6.117 ± 0.823
2.893AlaPhe: 2.893 ± 0.518
7.026AlaGly: 7.026 ± 0.976
1.075AlaHis: 1.075 ± 0.311
6.365AlaIle: 6.365 ± 0.858
5.621AlaLys: 5.621 ± 0.644
6.282AlaLeu: 6.282 ± 0.942
3.224AlaMet: 3.224 ± 0.508
5.455AlaAsn: 5.455 ± 0.893
2.48AlaPro: 2.48 ± 0.403
3.72AlaGln: 3.72 ± 0.667
5.373AlaArg: 5.373 ± 0.719
5.621AlaSer: 5.621 ± 0.68
4.959AlaThr: 4.959 ± 0.958
5.29AlaVal: 5.29 ± 0.659
1.488AlaTrp: 1.488 ± 0.329
2.149AlaTyr: 2.149 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.304
0.248CysCys: 0.248 ± 0.167
0.331CysAsp: 0.331 ± 0.167
0.579CysGlu: 0.579 ± 0.225
0.496CysPhe: 0.496 ± 0.232
1.24CysGly: 1.24 ± 0.366
0.496CysHis: 0.496 ± 0.222
0.827CysIle: 0.827 ± 0.304
0.579CysLys: 0.579 ± 0.202
0.744CysLeu: 0.744 ± 0.295
0.165CysMet: 0.165 ± 0.127
0.579CysAsn: 0.579 ± 0.206
0.248CysPro: 0.248 ± 0.138
0.413CysGln: 0.413 ± 0.168
1.157CysArg: 1.157 ± 0.326
0.661CysSer: 0.661 ± 0.243
0.496CysThr: 0.496 ± 0.231
1.323CysVal: 1.323 ± 0.388
0.165CysTrp: 0.165 ± 0.124
0.413CysTyr: 0.413 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
6.943AspAla: 6.943 ± 0.816
0.413AspCys: 0.413 ± 0.181
3.554AspAsp: 3.554 ± 0.597
3.885AspGlu: 3.885 ± 0.671
2.149AspPhe: 2.149 ± 0.418
5.373AspGly: 5.373 ± 0.751
0.909AspHis: 0.909 ± 0.341
4.216AspIle: 4.216 ± 0.537
3.058AspLys: 3.058 ± 0.455
4.959AspLeu: 4.959 ± 0.705
1.488AspMet: 1.488 ± 0.347
1.901AspAsn: 1.901 ± 0.445
1.736AspPro: 1.736 ± 0.436
1.653AspGln: 1.653 ± 0.32
1.984AspArg: 1.984 ± 0.447
3.306AspSer: 3.306 ± 0.499
1.571AspThr: 1.571 ± 0.41
4.133AspVal: 4.133 ± 0.611
0.992AspTrp: 0.992 ± 0.365
2.728AspTyr: 2.728 ± 0.455
0.0AspXaa: 0.0 ± 0.0
Glu
5.455GluAla: 5.455 ± 0.711
1.24GluCys: 1.24 ± 0.306
2.728GluAsp: 2.728 ± 0.519
4.05GluGlu: 4.05 ± 0.782
2.314GluPhe: 2.314 ± 0.539
3.058GluGly: 3.058 ± 0.437
1.405GluHis: 1.405 ± 0.365
3.802GluIle: 3.802 ± 0.55
4.05GluLys: 4.05 ± 0.634
5.951GluLeu: 5.951 ± 0.759
1.901GluMet: 1.901 ± 0.433
2.314GluAsn: 2.314 ± 0.449
2.562GluPro: 2.562 ± 0.462
3.637GluGln: 3.637 ± 0.57
4.546GluArg: 4.546 ± 0.605
2.728GluSer: 2.728 ± 0.391
3.141GluThr: 3.141 ± 0.492
3.472GluVal: 3.472 ± 0.445
1.653GluTrp: 1.653 ± 0.375
2.314GluTyr: 2.314 ± 0.422
0.0GluXaa: 0.0 ± 0.0
Phe
2.48PheAla: 2.48 ± 0.475
0.496PheCys: 0.496 ± 0.182
1.653PheAsp: 1.653 ± 0.341
2.314PheGlu: 2.314 ± 0.422
1.488PhePhe: 1.488 ± 0.458
2.893PheGly: 2.893 ± 0.406
0.248PheHis: 0.248 ± 0.149
2.562PheIle: 2.562 ± 0.665
2.397PheLys: 2.397 ± 0.442
2.149PheLeu: 2.149 ± 0.486
1.488PheMet: 1.488 ± 0.354
1.901PheAsn: 1.901 ± 0.398
1.323PhePro: 1.323 ± 0.267
1.405PheGln: 1.405 ± 0.376
1.818PheArg: 1.818 ± 0.361
2.893PheSer: 2.893 ± 0.499
2.314PheThr: 2.314 ± 0.465
1.901PheVal: 1.901 ± 0.322
0.744PheTrp: 0.744 ± 0.198
1.488PheTyr: 1.488 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
5.621GlyAla: 5.621 ± 0.784
0.661GlyCys: 0.661 ± 0.242
4.05GlyAsp: 4.05 ± 0.546
3.472GlyGlu: 3.472 ± 0.559
2.728GlyPhe: 2.728 ± 0.517
4.712GlyGly: 4.712 ± 0.665
0.827GlyHis: 0.827 ± 0.255
5.455GlyIle: 5.455 ± 0.702
4.298GlyLys: 4.298 ± 0.641
5.042GlyLeu: 5.042 ± 0.63
3.306GlyMet: 3.306 ± 0.566
3.554GlyAsn: 3.554 ± 0.508
0.744GlyPro: 0.744 ± 0.214
4.133GlyGln: 4.133 ± 0.764
5.538GlyArg: 5.538 ± 0.701
4.546GlySer: 4.546 ± 0.838
3.554GlyThr: 3.554 ± 0.639
5.786GlyVal: 5.786 ± 0.553
1.901GlyTrp: 1.901 ± 0.4
1.571GlyTyr: 1.571 ± 0.323
0.0GlyXaa: 0.0 ± 0.0
His
1.075HisAla: 1.075 ± 0.29
0.331HisCys: 0.331 ± 0.165
0.827HisAsp: 0.827 ± 0.362
1.157HisGlu: 1.157 ± 0.385
0.661HisPhe: 0.661 ± 0.237
1.405HisGly: 1.405 ± 0.398
0.496HisHis: 0.496 ± 0.248
0.496HisIle: 0.496 ± 0.187
0.827HisLys: 0.827 ± 0.27
1.901HisLeu: 1.901 ± 0.418
0.331HisMet: 0.331 ± 0.18
0.413HisAsn: 0.413 ± 0.179
0.909HisPro: 0.909 ± 0.257
0.744HisGln: 0.744 ± 0.24
1.24HisArg: 1.24 ± 0.35
0.909HisSer: 0.909 ± 0.262
0.496HisThr: 0.496 ± 0.202
1.075HisVal: 1.075 ± 0.287
0.331HisTrp: 0.331 ± 0.157
0.579HisTyr: 0.579 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
6.613IleAla: 6.613 ± 0.577
0.661IleCys: 0.661 ± 0.198
3.472IleAsp: 3.472 ± 0.512
4.298IleGlu: 4.298 ± 0.59
1.984IlePhe: 1.984 ± 0.561
4.794IleGly: 4.794 ± 0.643
1.157IleHis: 1.157 ± 0.285
4.629IleIle: 4.629 ± 1.187
3.389IleLys: 3.389 ± 0.545
4.298IleLeu: 4.298 ± 0.798
1.488IleMet: 1.488 ± 0.328
3.058IleAsn: 3.058 ± 0.61
3.058IlePro: 3.058 ± 0.539
2.893IleGln: 2.893 ± 0.559
3.72IleArg: 3.72 ± 0.492
5.042IleSer: 5.042 ± 0.993
4.133IleThr: 4.133 ± 0.62
3.141IleVal: 3.141 ± 0.59
0.661IleTrp: 0.661 ± 0.256
2.562IleTyr: 2.562 ± 0.506
0.0IleXaa: 0.0 ± 0.0
Lys
4.712LysAla: 4.712 ± 0.662
0.579LysCys: 0.579 ± 0.227
3.306LysAsp: 3.306 ± 0.542
3.968LysGlu: 3.968 ± 0.657
1.984LysPhe: 1.984 ± 0.407
4.298LysGly: 4.298 ± 0.594
0.909LysHis: 0.909 ± 0.303
3.141LysIle: 3.141 ± 0.494
4.216LysLys: 4.216 ± 0.806
4.794LysLeu: 4.794 ± 0.471
2.149LysMet: 2.149 ± 0.425
2.314LysAsn: 2.314 ± 0.436
3.637LysPro: 3.637 ± 0.538
3.472LysGln: 3.472 ± 0.567
4.877LysArg: 4.877 ± 0.729
3.472LysSer: 3.472 ± 0.589
3.306LysThr: 3.306 ± 0.569
3.058LysVal: 3.058 ± 0.47
0.496LysTrp: 0.496 ± 0.181
2.149LysTyr: 2.149 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
7.605LeuAla: 7.605 ± 0.851
0.827LeuCys: 0.827 ± 0.269
3.472LeuAsp: 3.472 ± 0.599
4.629LeuGlu: 4.629 ± 0.474
2.645LeuPhe: 2.645 ± 0.531
4.712LeuGly: 4.712 ± 0.709
0.827LeuHis: 0.827 ± 0.216
5.538LeuIle: 5.538 ± 0.949
5.538LeuLys: 5.538 ± 0.663
6.199LeuLeu: 6.199 ± 0.767
2.149LeuMet: 2.149 ± 0.44
4.381LeuAsn: 4.381 ± 0.716
3.472LeuPro: 3.472 ± 0.523
3.141LeuGln: 3.141 ± 0.587
4.546LeuArg: 4.546 ± 0.574
5.869LeuSer: 5.869 ± 0.67
5.207LeuThr: 5.207 ± 0.657
4.794LeuVal: 4.794 ± 0.671
1.075LeuTrp: 1.075 ± 0.33
2.645LeuTyr: 2.645 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
3.141MetAla: 3.141 ± 0.541
0.496MetCys: 0.496 ± 0.281
1.736MetAsp: 1.736 ± 0.369
1.323MetGlu: 1.323 ± 0.31
0.661MetPhe: 0.661 ± 0.292
1.818MetGly: 1.818 ± 0.35
0.331MetHis: 0.331 ± 0.172
1.24MetIle: 1.24 ± 0.405
2.48MetLys: 2.48 ± 0.5
2.149MetLeu: 2.149 ± 0.367
1.157MetMet: 1.157 ± 0.41
0.661MetAsn: 0.661 ± 0.214
1.157MetPro: 1.157 ± 0.34
1.818MetGln: 1.818 ± 0.469
2.893MetArg: 2.893 ± 0.586
2.232MetSer: 2.232 ± 0.403
2.066MetThr: 2.066 ± 0.373
1.075MetVal: 1.075 ± 0.31
0.165MetTrp: 0.165 ± 0.138
1.323MetTyr: 1.323 ± 0.311
0.0MetXaa: 0.0 ± 0.0
Asn
4.794AsnAla: 4.794 ± 0.728
0.248AsnCys: 0.248 ± 0.14
2.645AsnAsp: 2.645 ± 0.463
2.48AsnGlu: 2.48 ± 0.477
0.992AsnPhe: 0.992 ± 0.291
4.381AsnGly: 4.381 ± 0.578
1.157AsnHis: 1.157 ± 0.367
2.728AsnIle: 2.728 ± 0.569
2.728AsnLys: 2.728 ± 0.443
2.893AsnLeu: 2.893 ± 0.55
1.24AsnMet: 1.24 ± 0.289
2.232AsnAsn: 2.232 ± 0.388
2.81AsnPro: 2.81 ± 0.366
2.562AsnGln: 2.562 ± 0.594
2.066AsnArg: 2.066 ± 0.38
2.314AsnSer: 2.314 ± 0.384
2.645AsnThr: 2.645 ± 0.459
2.562AsnVal: 2.562 ± 0.605
0.579AsnTrp: 0.579 ± 0.201
1.323AsnTyr: 1.323 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
3.058ProAla: 3.058 ± 0.425
0.083ProCys: 0.083 ± 0.09
2.976ProAsp: 2.976 ± 0.406
4.546ProGlu: 4.546 ± 0.663
1.736ProPhe: 1.736 ± 0.337
1.818ProGly: 1.818 ± 0.405
0.744ProHis: 0.744 ± 0.269
2.562ProIle: 2.562 ± 0.412
2.397ProLys: 2.397 ± 0.467
3.141ProLeu: 3.141 ± 0.548
0.992ProMet: 0.992 ± 0.31
1.571ProAsn: 1.571 ± 0.411
1.571ProPro: 1.571 ± 0.361
1.736ProGln: 1.736 ± 0.371
1.901ProArg: 1.901 ± 0.362
2.728ProSer: 2.728 ± 0.404
1.653ProThr: 1.653 ± 0.308
3.141ProVal: 3.141 ± 0.642
0.413ProTrp: 0.413 ± 0.168
1.24ProTyr: 1.24 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
4.298GlnAla: 4.298 ± 0.57
0.496GlnCys: 0.496 ± 0.185
2.562GlnAsp: 2.562 ± 0.503
2.314GlnGlu: 2.314 ± 0.426
2.066GlnPhe: 2.066 ± 0.409
3.306GlnGly: 3.306 ± 0.593
0.744GlnHis: 0.744 ± 0.267
3.472GlnIle: 3.472 ± 0.527
1.736GlnLys: 1.736 ± 0.386
4.712GlnLeu: 4.712 ± 0.636
1.405GlnMet: 1.405 ± 0.375
2.48GlnAsn: 2.48 ± 0.726
2.149GlnPro: 2.149 ± 0.379
4.216GlnGln: 4.216 ± 0.908
2.728GlnArg: 2.728 ± 0.555
2.645GlnSer: 2.645 ± 0.594
1.571GlnThr: 1.571 ± 0.388
2.314GlnVal: 2.314 ± 0.419
0.661GlnTrp: 0.661 ± 0.218
2.066GlnTyr: 2.066 ± 0.457
0.0GlnXaa: 0.0 ± 0.0
Arg
5.042ArgAla: 5.042 ± 0.668
0.579ArgCys: 0.579 ± 0.2
4.05ArgAsp: 4.05 ± 0.604
4.381ArgGlu: 4.381 ± 0.596
1.984ArgPhe: 1.984 ± 0.408
3.306ArgGly: 3.306 ± 0.404
1.405ArgHis: 1.405 ± 0.337
4.381ArgIle: 4.381 ± 0.656
4.298ArgLys: 4.298 ± 0.718
5.29ArgLeu: 5.29 ± 0.609
1.901ArgMet: 1.901 ± 0.333
3.72ArgAsn: 3.72 ± 0.499
2.149ArgPro: 2.149 ± 0.342
2.562ArgGln: 2.562 ± 0.523
4.216ArgArg: 4.216 ± 0.702
3.637ArgSer: 3.637 ± 0.461
1.984ArgThr: 1.984 ± 0.363
3.141ArgVal: 3.141 ± 0.484
0.827ArgTrp: 0.827 ± 0.276
2.149ArgTyr: 2.149 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
6.365SerAla: 6.365 ± 0.985
0.827SerCys: 0.827 ± 0.414
3.306SerAsp: 3.306 ± 0.457
3.472SerGlu: 3.472 ± 0.66
2.893SerPhe: 2.893 ± 0.512
5.373SerGly: 5.373 ± 0.792
0.909SerHis: 0.909 ± 0.308
3.637SerIle: 3.637 ± 0.523
3.224SerLys: 3.224 ± 0.508
5.207SerLeu: 5.207 ± 0.664
1.736SerMet: 1.736 ± 0.314
2.232SerAsn: 2.232 ± 0.344
2.81SerPro: 2.81 ± 0.481
2.81SerGln: 2.81 ± 0.449
3.637SerArg: 3.637 ± 0.516
3.968SerSer: 3.968 ± 0.768
3.389SerThr: 3.389 ± 0.562
3.885SerVal: 3.885 ± 0.569
0.909SerTrp: 0.909 ± 0.274
1.984SerTyr: 1.984 ± 0.599
0.0SerXaa: 0.0 ± 0.0
Thr
5.455ThrAla: 5.455 ± 0.795
0.496ThrCys: 0.496 ± 0.172
2.81ThrAsp: 2.81 ± 0.572
2.976ThrGlu: 2.976 ± 0.54
1.901ThrPhe: 1.901 ± 0.553
4.464ThrGly: 4.464 ± 0.68
1.075ThrHis: 1.075 ± 0.208
3.141ThrIle: 3.141 ± 0.485
3.058ThrLys: 3.058 ± 0.455
4.133ThrLeu: 4.133 ± 0.682
0.827ThrMet: 0.827 ± 0.234
1.984ThrAsn: 1.984 ± 0.387
3.472ThrPro: 3.472 ± 0.523
2.48ThrGln: 2.48 ± 0.372
1.653ThrArg: 1.653 ± 0.358
3.389ThrSer: 3.389 ± 0.581
2.81ThrThr: 2.81 ± 0.564
2.976ThrVal: 2.976 ± 0.452
0.496ThrTrp: 0.496 ± 0.232
1.488ThrTyr: 1.488 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
5.951ValAla: 5.951 ± 0.593
0.992ValCys: 0.992 ± 0.333
3.224ValAsp: 3.224 ± 0.547
4.05ValGlu: 4.05 ± 0.525
2.314ValPhe: 2.314 ± 0.384
3.968ValGly: 3.968 ± 0.741
0.413ValHis: 0.413 ± 0.182
4.216ValIle: 4.216 ± 0.678
3.885ValLys: 3.885 ± 0.522
5.373ValLeu: 5.373 ± 0.615
1.323ValMet: 1.323 ± 0.366
3.141ValAsn: 3.141 ± 0.604
1.736ValPro: 1.736 ± 0.385
1.984ValGln: 1.984 ± 0.317
2.81ValArg: 2.81 ± 0.451
3.968ValSer: 3.968 ± 0.651
3.802ValThr: 3.802 ± 0.56
4.464ValVal: 4.464 ± 0.512
0.661ValTrp: 0.661 ± 0.265
2.314ValTyr: 2.314 ± 0.523
0.0ValXaa: 0.0 ± 0.0
Trp
1.157TrpAla: 1.157 ± 0.295
0.165TrpCys: 0.165 ± 0.111
0.909TrpAsp: 0.909 ± 0.268
0.496TrpGlu: 0.496 ± 0.228
0.413TrpPhe: 0.413 ± 0.183
1.157TrpGly: 1.157 ± 0.283
0.331TrpHis: 0.331 ± 0.164
0.579TrpIle: 0.579 ± 0.221
1.075TrpLys: 1.075 ± 0.344
1.818TrpLeu: 1.818 ± 0.469
0.827TrpMet: 0.827 ± 0.28
0.248TrpAsn: 0.248 ± 0.16
0.661TrpPro: 0.661 ± 0.229
0.827TrpGln: 0.827 ± 0.22
1.405TrpArg: 1.405 ± 0.364
0.661TrpSer: 0.661 ± 0.267
0.496TrpThr: 0.496 ± 0.191
0.909TrpVal: 0.909 ± 0.267
0.165TrpTrp: 0.165 ± 0.11
0.579TrpTyr: 0.579 ± 0.215
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.058TyrAla: 3.058 ± 0.539
0.413TyrCys: 0.413 ± 0.187
2.562TyrAsp: 2.562 ± 0.459
1.818TyrGlu: 1.818 ± 0.45
1.653TyrPhe: 1.653 ± 0.397
2.397TyrGly: 2.397 ± 0.468
0.744TyrHis: 0.744 ± 0.236
2.066TyrIle: 2.066 ± 0.507
1.984TyrLys: 1.984 ± 0.367
2.314TyrLeu: 2.314 ± 0.361
0.744TyrMet: 0.744 ± 0.264
1.24TyrAsn: 1.24 ± 0.332
1.323TyrPro: 1.323 ± 0.487
1.736TyrGln: 1.736 ± 0.347
2.893TyrArg: 2.893 ± 0.413
1.984TyrSer: 1.984 ± 0.44
1.488TyrThr: 1.488 ± 0.3
2.149TyrVal: 2.149 ± 0.355
0.496TyrTrp: 0.496 ± 0.203
1.157TyrTyr: 1.157 ± 0.335
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski