Amino acid dipepetide frequency for Staphylococcus phage tp310-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.148AlaAla: 2.148 ± 0.812
0.398AlaCys: 0.398 ± 0.201
1.989AlaAsp: 1.989 ± 0.338
3.341AlaGlu: 3.341 ± 0.643
2.148AlaPhe: 2.148 ± 0.539
4.137AlaGly: 4.137 ± 0.88
0.477AlaHis: 0.477 ± 0.203
4.057AlaIle: 4.057 ± 0.667
4.296AlaLys: 4.296 ± 0.598
4.932AlaLeu: 4.932 ± 0.853
1.75AlaMet: 1.75 ± 0.393
3.659AlaAsn: 3.659 ± 0.509
1.034AlaPro: 1.034 ± 0.225
2.784AlaGln: 2.784 ± 0.591
3.023AlaArg: 3.023 ± 0.53
3.182AlaSer: 3.182 ± 0.563
3.659AlaThr: 3.659 ± 0.654
2.864AlaVal: 2.864 ± 0.536
0.557AlaTrp: 0.557 ± 0.218
1.671AlaTyr: 1.671 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.239CysAla: 0.239 ± 0.134
0.08CysCys: 0.08 ± 0.076
0.159CysAsp: 0.159 ± 0.172
0.159CysGlu: 0.159 ± 0.132
0.398CysPhe: 0.398 ± 0.202
0.318CysGly: 0.318 ± 0.213
0.239CysHis: 0.239 ± 0.133
0.875CysIle: 0.875 ± 0.292
0.239CysLys: 0.239 ± 0.136
0.398CysLeu: 0.398 ± 0.162
0.0CysMet: 0.0 ± 0.0
0.477CysAsn: 0.477 ± 0.198
0.08CysPro: 0.08 ± 0.075
0.08CysGln: 0.08 ± 0.09
0.08CysArg: 0.08 ± 0.069
0.557CysSer: 0.557 ± 0.243
0.318CysThr: 0.318 ± 0.16
0.318CysVal: 0.318 ± 0.149
0.0CysTrp: 0.0 ± 0.0
0.636CysTyr: 0.636 ± 0.273
0.0CysXaa: 0.0 ± 0.0
Asp
2.943AspAla: 2.943 ± 0.498
0.318AspCys: 0.318 ± 0.156
3.898AspAsp: 3.898 ± 0.813
5.648AspGlu: 5.648 ± 0.856
2.943AspPhe: 2.943 ± 0.551
3.5AspGly: 3.5 ± 0.551
0.955AspHis: 0.955 ± 0.25
4.773AspIle: 4.773 ± 0.558
5.807AspLys: 5.807 ± 0.783
4.773AspLeu: 4.773 ± 0.647
1.273AspMet: 1.273 ± 0.298
3.818AspAsn: 3.818 ± 0.639
1.671AspPro: 1.671 ± 0.447
0.716AspGln: 0.716 ± 0.238
2.546AspArg: 2.546 ± 0.593
3.739AspSer: 3.739 ± 0.582
4.455AspThr: 4.455 ± 0.629
3.818AspVal: 3.818 ± 0.691
0.477AspTrp: 0.477 ± 0.204
3.102AspTyr: 3.102 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
3.58GluAla: 3.58 ± 0.744
0.318GluCys: 0.318 ± 0.144
3.023GluAsp: 3.023 ± 0.6
6.443GluGlu: 6.443 ± 0.763
3.023GluPhe: 3.023 ± 0.429
2.466GluGly: 2.466 ± 0.416
1.193GluHis: 1.193 ± 0.322
6.125GluIle: 6.125 ± 1.002
7.0GluLys: 7.0 ± 0.991
7.318GluLeu: 7.318 ± 0.966
2.864GluMet: 2.864 ± 0.515
5.568GluAsn: 5.568 ± 0.619
1.75GluPro: 1.75 ± 0.332
3.739GluGln: 3.739 ± 0.603
3.977GluArg: 3.977 ± 0.635
4.534GluSer: 4.534 ± 0.583
3.659GluThr: 3.659 ± 0.524
4.455GluVal: 4.455 ± 0.596
0.636GluTrp: 0.636 ± 0.231
3.102GluTyr: 3.102 ± 0.609
0.0GluXaa: 0.0 ± 0.0
Phe
1.671PheAla: 1.671 ± 0.356
0.239PheCys: 0.239 ± 0.162
2.705PheAsp: 2.705 ± 0.468
2.386PheGlu: 2.386 ± 0.524
0.955PhePhe: 0.955 ± 0.226
2.386PheGly: 2.386 ± 0.588
0.875PheHis: 0.875 ± 0.286
3.659PheIle: 3.659 ± 0.64
4.216PheLys: 4.216 ± 0.552
2.705PheLeu: 2.705 ± 0.665
1.193PheMet: 1.193 ± 0.323
4.375PheAsn: 4.375 ± 0.594
0.795PhePro: 0.795 ± 0.241
1.273PheGln: 1.273 ± 0.256
1.273PheArg: 1.273 ± 0.289
2.068PheSer: 2.068 ± 0.435
2.307PheThr: 2.307 ± 0.457
2.068PheVal: 2.068 ± 0.496
0.239PheTrp: 0.239 ± 0.129
1.591PheTyr: 1.591 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
3.341GlyAla: 3.341 ± 0.759
0.159GlyCys: 0.159 ± 0.116
2.705GlyAsp: 2.705 ± 0.431
3.102GlyGlu: 3.102 ± 0.47
2.625GlyPhe: 2.625 ± 0.511
3.5GlyGly: 3.5 ± 0.779
1.432GlyHis: 1.432 ± 0.397
4.296GlyIle: 4.296 ± 0.82
4.932GlyLys: 4.932 ± 0.886
5.171GlyLeu: 5.171 ± 0.883
1.114GlyMet: 1.114 ± 0.359
4.057GlyAsn: 4.057 ± 0.719
0.955GlyPro: 0.955 ± 0.27
2.466GlyGln: 2.466 ± 0.511
2.705GlyArg: 2.705 ± 0.505
3.341GlySer: 3.341 ± 0.615
2.705GlyThr: 2.705 ± 0.504
3.102GlyVal: 3.102 ± 0.6
1.114GlyTrp: 1.114 ± 0.338
3.182GlyTyr: 3.182 ± 0.54
0.0GlyXaa: 0.0 ± 0.0
His
0.875HisAla: 0.875 ± 0.262
0.0HisCys: 0.0 ± 0.0
0.875HisAsp: 0.875 ± 0.283
1.432HisGlu: 1.432 ± 0.346
1.034HisPhe: 1.034 ± 0.253
0.716HisGly: 0.716 ± 0.258
0.159HisHis: 0.159 ± 0.108
1.75HisIle: 1.75 ± 0.444
1.114HisLys: 1.114 ± 0.282
1.591HisLeu: 1.591 ± 0.355
0.318HisMet: 0.318 ± 0.167
1.114HisAsn: 1.114 ± 0.366
0.318HisPro: 0.318 ± 0.179
0.716HisGln: 0.716 ± 0.293
0.875HisArg: 0.875 ± 0.207
1.193HisSer: 1.193 ± 0.276
1.193HisThr: 1.193 ± 0.337
1.114HisVal: 1.114 ± 0.393
0.239HisTrp: 0.239 ± 0.146
0.795HisTyr: 0.795 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
4.932IleAla: 4.932 ± 0.548
0.239IleCys: 0.239 ± 0.145
4.614IleAsp: 4.614 ± 0.553
6.364IleGlu: 6.364 ± 0.716
2.546IlePhe: 2.546 ± 0.57
3.977IleGly: 3.977 ± 0.671
1.511IleHis: 1.511 ± 0.291
4.693IleIle: 4.693 ± 0.668
8.989IleLys: 8.989 ± 0.752
5.012IleLeu: 5.012 ± 0.798
1.989IleMet: 1.989 ± 0.371
7.239IleAsn: 7.239 ± 1.196
1.989IlePro: 1.989 ± 0.363
3.5IleGln: 3.5 ± 0.579
2.705IleArg: 2.705 ± 0.612
4.614IleSer: 4.614 ± 0.466
5.091IleThr: 5.091 ± 0.548
5.012IleVal: 5.012 ± 0.589
1.193IleTrp: 1.193 ± 0.445
2.307IleTyr: 2.307 ± 0.368
0.0IleXaa: 0.0 ± 0.0
Lys
5.727LysAla: 5.727 ± 0.661
0.159LysCys: 0.159 ± 0.118
5.648LysAsp: 5.648 ± 0.483
6.921LysGlu: 6.921 ± 0.848
3.739LysPhe: 3.739 ± 0.662
5.409LysGly: 5.409 ± 0.671
1.114LysHis: 1.114 ± 0.285
5.887LysIle: 5.887 ± 0.842
7.398LysLys: 7.398 ± 0.699
7.318LysLeu: 7.318 ± 0.933
2.546LysMet: 2.546 ± 0.468
4.455LysAsn: 4.455 ± 0.6
3.102LysPro: 3.102 ± 0.606
4.932LysGln: 4.932 ± 0.741
4.216LysArg: 4.216 ± 0.629
5.171LysSer: 5.171 ± 0.527
6.762LysThr: 6.762 ± 0.939
5.807LysVal: 5.807 ± 0.636
1.352LysTrp: 1.352 ± 0.31
4.534LysTyr: 4.534 ± 0.652
0.0LysXaa: 0.0 ± 0.0
Leu
3.102LeuAla: 3.102 ± 0.569
0.477LeuCys: 0.477 ± 0.202
5.727LeuAsp: 5.727 ± 0.671
7.398LeuGlu: 7.398 ± 0.792
3.102LeuPhe: 3.102 ± 0.469
4.375LeuGly: 4.375 ± 0.735
1.193LeuHis: 1.193 ± 0.246
5.33LeuIle: 5.33 ± 0.676
8.273LeuLys: 8.273 ± 0.835
5.966LeuLeu: 5.966 ± 0.843
1.034LeuMet: 1.034 ± 0.327
6.046LeuAsn: 6.046 ± 0.59
2.546LeuPro: 2.546 ± 0.411
3.102LeuGln: 3.102 ± 0.498
3.739LeuArg: 3.739 ± 0.542
5.648LeuSer: 5.648 ± 0.672
4.534LeuThr: 4.534 ± 0.606
4.455LeuVal: 4.455 ± 0.588
0.636LeuTrp: 0.636 ± 0.213
2.705LeuTyr: 2.705 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
1.114MetAla: 1.114 ± 0.328
0.08MetCys: 0.08 ± 0.078
1.352MetAsp: 1.352 ± 0.249
2.068MetGlu: 2.068 ± 0.426
1.193MetPhe: 1.193 ± 0.27
0.955MetGly: 0.955 ± 0.347
0.159MetHis: 0.159 ± 0.096
1.75MetIle: 1.75 ± 0.301
1.83MetLys: 1.83 ± 0.355
1.909MetLeu: 1.909 ± 0.327
0.875MetMet: 0.875 ± 0.278
1.83MetAsn: 1.83 ± 0.431
1.432MetPro: 1.432 ± 0.387
1.273MetGln: 1.273 ± 0.359
1.591MetArg: 1.591 ± 0.379
1.83MetSer: 1.83 ± 0.333
1.511MetThr: 1.511 ± 0.344
1.273MetVal: 1.273 ± 0.255
0.398MetTrp: 0.398 ± 0.158
1.114MetTyr: 1.114 ± 0.294
0.0MetXaa: 0.0 ± 0.0
Asn
3.659AsnAla: 3.659 ± 0.546
0.159AsnCys: 0.159 ± 0.112
4.614AsnAsp: 4.614 ± 0.627
5.33AsnGlu: 5.33 ± 0.889
2.307AsnPhe: 2.307 ± 0.465
5.171AsnGly: 5.171 ± 0.69
1.034AsnHis: 1.034 ± 0.326
5.727AsnIle: 5.727 ± 0.667
6.682AsnLys: 6.682 ± 0.993
5.409AsnLeu: 5.409 ± 0.606
1.83AsnMet: 1.83 ± 0.318
5.568AsnAsn: 5.568 ± 0.641
3.102AsnPro: 3.102 ± 0.358
3.977AsnGln: 3.977 ± 0.528
2.625AsnArg: 2.625 ± 0.505
3.421AsnSer: 3.421 ± 0.472
3.182AsnThr: 3.182 ± 0.377
4.534AsnVal: 4.534 ± 0.72
0.955AsnTrp: 0.955 ± 0.347
3.261AsnTyr: 3.261 ± 0.518
0.0AsnXaa: 0.0 ± 0.0
Pro
0.875ProAla: 0.875 ± 0.27
0.0ProCys: 0.0 ± 0.0
1.591ProAsp: 1.591 ± 0.444
1.909ProGlu: 1.909 ± 0.456
1.273ProPhe: 1.273 ± 0.305
0.875ProGly: 0.875 ± 0.264
0.636ProHis: 0.636 ± 0.231
2.625ProIle: 2.625 ± 0.41
2.546ProLys: 2.546 ± 0.593
1.989ProLeu: 1.989 ± 0.494
1.193ProMet: 1.193 ± 0.277
2.068ProAsn: 2.068 ± 0.426
0.795ProPro: 0.795 ± 0.219
0.716ProGln: 0.716 ± 0.243
0.875ProArg: 0.875 ± 0.243
1.511ProSer: 1.511 ± 0.392
1.352ProThr: 1.352 ± 0.401
2.068ProVal: 2.068 ± 0.38
0.159ProTrp: 0.159 ± 0.101
1.273ProTyr: 1.273 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
3.58GlnAla: 3.58 ± 0.532
0.557GlnCys: 0.557 ± 0.219
2.864GlnAsp: 2.864 ± 0.57
3.341GlnGlu: 3.341 ± 0.62
0.955GlnPhe: 0.955 ± 0.243
1.75GlnGly: 1.75 ± 0.347
0.795GlnHis: 0.795 ± 0.334
3.341GlnIle: 3.341 ± 0.392
3.023GlnLys: 3.023 ± 0.54
3.261GlnLeu: 3.261 ± 0.463
0.955GlnMet: 0.955 ± 0.287
3.261GlnAsn: 3.261 ± 0.55
0.557GlnPro: 0.557 ± 0.158
2.307GlnGln: 2.307 ± 0.538
2.307GlnArg: 2.307 ± 0.454
2.546GlnSer: 2.546 ± 0.577
2.307GlnThr: 2.307 ± 0.394
2.227GlnVal: 2.227 ± 0.434
0.318GlnTrp: 0.318 ± 0.141
1.909GlnTyr: 1.909 ± 0.385
0.0GlnXaa: 0.0 ± 0.0
Arg
2.227ArgAla: 2.227 ± 0.476
0.318ArgCys: 0.318 ± 0.195
3.261ArgAsp: 3.261 ± 0.459
3.818ArgGlu: 3.818 ± 0.683
1.83ArgPhe: 1.83 ± 0.437
1.83ArgGly: 1.83 ± 0.384
0.955ArgHis: 0.955 ± 0.233
4.057ArgIle: 4.057 ± 0.75
2.943ArgLys: 2.943 ± 0.48
3.58ArgLeu: 3.58 ± 0.458
0.716ArgMet: 0.716 ± 0.261
3.023ArgAsn: 3.023 ± 0.42
0.636ArgPro: 0.636 ± 0.215
1.591ArgGln: 1.591 ± 0.408
2.227ArgArg: 2.227 ± 0.412
2.068ArgSer: 2.068 ± 0.374
2.864ArgThr: 2.864 ± 0.564
2.386ArgVal: 2.386 ± 0.437
0.875ArgTrp: 0.875 ± 0.226
2.466ArgTyr: 2.466 ± 0.522
0.0ArgXaa: 0.0 ± 0.0
Ser
2.784SerAla: 2.784 ± 0.681
0.398SerCys: 0.398 ± 0.181
4.534SerAsp: 4.534 ± 0.735
4.773SerGlu: 4.773 ± 0.719
2.625SerPhe: 2.625 ± 0.545
3.58SerGly: 3.58 ± 0.657
1.432SerHis: 1.432 ± 0.336
5.727SerIle: 5.727 ± 0.68
5.966SerLys: 5.966 ± 0.93
3.977SerLeu: 3.977 ± 0.488
1.511SerMet: 1.511 ± 0.358
4.932SerAsn: 4.932 ± 0.621
0.636SerPro: 0.636 ± 0.22
2.864SerGln: 2.864 ± 0.527
2.705SerArg: 2.705 ± 0.356
3.898SerSer: 3.898 ± 0.647
3.261SerThr: 3.261 ± 0.469
2.705SerVal: 2.705 ± 0.554
0.477SerTrp: 0.477 ± 0.176
2.227SerTyr: 2.227 ± 0.62
0.0SerXaa: 0.0 ± 0.0
Thr
3.818ThrAla: 3.818 ± 0.508
0.398ThrCys: 0.398 ± 0.188
4.375ThrAsp: 4.375 ± 0.637
3.58ThrGlu: 3.58 ± 0.608
2.148ThrPhe: 2.148 ± 0.43
4.057ThrGly: 4.057 ± 0.645
1.671ThrHis: 1.671 ± 0.417
5.091ThrIle: 5.091 ± 0.493
5.171ThrLys: 5.171 ± 0.682
4.375ThrLeu: 4.375 ± 0.419
1.114ThrMet: 1.114 ± 0.344
3.023ThrAsn: 3.023 ± 0.471
2.148ThrPro: 2.148 ± 0.475
1.909ThrGln: 1.909 ± 0.324
2.705ThrArg: 2.705 ± 0.542
4.773ThrSer: 4.773 ± 0.842
3.898ThrThr: 3.898 ± 0.616
3.58ThrVal: 3.58 ± 0.484
0.875ThrTrp: 0.875 ± 0.294
2.705ThrTyr: 2.705 ± 0.635
0.08ThrXaa: 0.08 ± 0.099
Val
2.943ValAla: 2.943 ± 0.532
0.557ValCys: 0.557 ± 0.221
3.898ValAsp: 3.898 ± 0.606
3.421ValGlu: 3.421 ± 0.511
1.989ValPhe: 1.989 ± 0.366
3.818ValGly: 3.818 ± 0.701
0.557ValHis: 0.557 ± 0.199
4.296ValIle: 4.296 ± 0.591
6.205ValLys: 6.205 ± 0.766
5.012ValLeu: 5.012 ± 0.733
1.591ValMet: 1.591 ± 0.4
4.137ValAsn: 4.137 ± 0.521
1.671ValPro: 1.671 ± 0.351
1.83ValGln: 1.83 ± 0.409
1.511ValArg: 1.511 ± 0.469
3.818ValSer: 3.818 ± 0.513
5.489ValThr: 5.489 ± 0.802
3.659ValVal: 3.659 ± 0.525
0.318ValTrp: 0.318 ± 0.13
1.432ValTyr: 1.432 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
0.318TrpAla: 0.318 ± 0.174
0.0TrpCys: 0.0 ± 0.0
0.636TrpAsp: 0.636 ± 0.267
0.636TrpGlu: 0.636 ± 0.166
0.636TrpPhe: 0.636 ± 0.223
0.716TrpGly: 0.716 ± 0.267
0.0TrpHis: 0.0 ± 0.0
1.114TrpIle: 1.114 ± 0.268
0.795TrpLys: 0.795 ± 0.235
1.273TrpLeu: 1.273 ± 0.358
0.557TrpMet: 0.557 ± 0.188
1.193TrpAsn: 1.193 ± 0.339
0.239TrpPro: 0.239 ± 0.124
0.477TrpGln: 0.477 ± 0.2
0.477TrpArg: 0.477 ± 0.176
0.636TrpSer: 0.636 ± 0.239
0.557TrpThr: 0.557 ± 0.168
0.716TrpVal: 0.716 ± 0.265
0.159TrpTrp: 0.159 ± 0.119
0.239TrpTyr: 0.239 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.227TyrAla: 2.227 ± 0.412
0.795TyrCys: 0.795 ± 0.27
2.705TyrAsp: 2.705 ± 0.589
2.705TyrGlu: 2.705 ± 0.405
1.511TyrPhe: 1.511 ± 0.419
2.466TyrGly: 2.466 ± 0.593
1.034TyrHis: 1.034 ± 0.305
3.341TyrIle: 3.341 ± 0.492
4.693TyrLys: 4.693 ± 0.556
3.421TyrLeu: 3.421 ± 0.564
1.034TyrMet: 1.034 ± 0.216
2.705TyrAsn: 2.705 ± 0.552
0.875TyrPro: 0.875 ± 0.201
1.909TyrGln: 1.909 ± 0.341
1.511TyrArg: 1.511 ± 0.368
2.546TyrSer: 2.546 ± 0.566
2.386TyrThr: 2.386 ± 0.469
1.989TyrVal: 1.989 ± 0.394
0.398TyrTrp: 0.398 ± 0.15
1.193TyrTyr: 1.193 ± 0.366
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.08XaaLys: 0.08 ± 0.099
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12572 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski