Amino acid dipepetide frequency for Staphylococcus phage phi 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.72AlaAla: 1.72 ± 0.466
0.344AlaCys: 0.344 ± 0.144
3.01AlaAsp: 3.01 ± 0.392
3.698AlaGlu: 3.698 ± 0.569
2.752AlaPhe: 2.752 ± 0.535
3.268AlaGly: 3.268 ± 0.555
1.376AlaHis: 1.376 ± 0.31
4.472AlaIle: 4.472 ± 1.128
4.988AlaLys: 4.988 ± 0.661
4.3AlaLeu: 4.3 ± 0.704
1.892AlaMet: 1.892 ± 0.486
3.956AlaAsn: 3.956 ± 0.571
1.892AlaPro: 1.892 ± 0.439
2.236AlaGln: 2.236 ± 0.464
2.838AlaArg: 2.838 ± 0.612
3.784AlaSer: 3.784 ± 0.628
4.214AlaThr: 4.214 ± 0.613
3.44AlaVal: 3.44 ± 0.753
0.774AlaTrp: 0.774 ± 0.334
2.236AlaTyr: 2.236 ± 0.537
0.0AlaXaa: 0.0 ± 0.0
Cys
0.258CysAla: 0.258 ± 0.143
0.0CysCys: 0.0 ± 0.0
0.172CysAsp: 0.172 ± 0.114
0.344CysGlu: 0.344 ± 0.17
0.344CysPhe: 0.344 ± 0.161
0.172CysGly: 0.172 ± 0.115
0.0CysHis: 0.0 ± 0.0
0.344CysIle: 0.344 ± 0.177
0.516CysLys: 0.516 ± 0.204
0.344CysLeu: 0.344 ± 0.216
0.0CysMet: 0.0 ± 0.0
0.258CysAsn: 0.258 ± 0.153
0.344CysPro: 0.344 ± 0.179
0.172CysGln: 0.172 ± 0.111
0.516CysArg: 0.516 ± 0.23
0.344CysSer: 0.344 ± 0.213
0.344CysThr: 0.344 ± 0.155
0.172CysVal: 0.172 ± 0.125
0.086CysTrp: 0.086 ± 0.077
0.086CysTyr: 0.086 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
3.01AspAla: 3.01 ± 0.539
0.344AspCys: 0.344 ± 0.174
4.558AspAsp: 4.558 ± 0.982
5.762AspGlu: 5.762 ± 0.937
3.87AspPhe: 3.87 ± 0.628
3.612AspGly: 3.612 ± 0.581
0.258AspHis: 0.258 ± 0.222
4.73AspIle: 4.73 ± 0.749
5.676AspLys: 5.676 ± 0.846
5.332AspLeu: 5.332 ± 0.866
1.634AspMet: 1.634 ± 0.347
3.44AspAsn: 3.44 ± 0.692
1.29AspPro: 1.29 ± 0.286
1.118AspGln: 1.118 ± 0.303
2.15AspArg: 2.15 ± 0.504
3.956AspSer: 3.956 ± 0.571
3.698AspThr: 3.698 ± 0.736
4.644AspVal: 4.644 ± 0.568
0.688AspTrp: 0.688 ± 0.188
2.838AspTyr: 2.838 ± 0.634
0.0AspXaa: 0.0 ± 0.0
Glu
5.332GluAla: 5.332 ± 0.829
0.43GluCys: 0.43 ± 0.174
4.042GluAsp: 4.042 ± 0.585
6.794GluGlu: 6.794 ± 1.442
3.526GluPhe: 3.526 ± 0.62
3.44GluGly: 3.44 ± 0.63
1.376GluHis: 1.376 ± 0.367
5.246GluIle: 5.246 ± 1.052
5.848GluLys: 5.848 ± 1.026
6.966GluLeu: 6.966 ± 0.969
2.408GluMet: 2.408 ± 0.444
4.558GluAsn: 4.558 ± 0.619
2.236GluPro: 2.236 ± 0.371
4.644GluGln: 4.644 ± 0.725
3.526GluArg: 3.526 ± 0.708
4.214GluSer: 4.214 ± 0.754
3.612GluThr: 3.612 ± 0.522
4.816GluVal: 4.816 ± 0.865
0.602GluTrp: 0.602 ± 0.226
5.074GluTyr: 5.074 ± 0.697
0.0GluXaa: 0.0 ± 0.0
Phe
2.494PheAla: 2.494 ± 0.382
0.344PheCys: 0.344 ± 0.139
3.01PheAsp: 3.01 ± 0.542
4.472PheGlu: 4.472 ± 0.578
1.634PhePhe: 1.634 ± 0.419
2.666PheGly: 2.666 ± 0.503
0.688PheHis: 0.688 ± 0.294
3.096PheIle: 3.096 ± 0.456
3.784PheLys: 3.784 ± 0.489
2.838PheLeu: 2.838 ± 0.46
1.204PheMet: 1.204 ± 0.37
3.268PheAsn: 3.268 ± 0.493
0.946PhePro: 0.946 ± 0.288
1.376PheGln: 1.376 ± 0.389
1.118PheArg: 1.118 ± 0.365
2.408PheSer: 2.408 ± 0.429
3.096PheThr: 3.096 ± 0.538
2.408PheVal: 2.408 ± 0.663
0.172PheTrp: 0.172 ± 0.115
1.29PheTyr: 1.29 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
3.182GlyAla: 3.182 ± 0.596
0.258GlyCys: 0.258 ± 0.147
3.01GlyAsp: 3.01 ± 0.605
4.3GlyGlu: 4.3 ± 0.541
2.494GlyPhe: 2.494 ± 0.503
3.612GlyGly: 3.612 ± 0.534
1.118GlyHis: 1.118 ± 0.444
4.816GlyIle: 4.816 ± 0.668
5.074GlyLys: 5.074 ± 0.561
6.02GlyLeu: 6.02 ± 0.89
1.72GlyMet: 1.72 ± 0.399
2.924GlyAsn: 2.924 ± 0.524
0.688GlyPro: 0.688 ± 0.265
1.204GlyGln: 1.204 ± 0.26
1.892GlyArg: 1.892 ± 0.431
2.322GlySer: 2.322 ± 0.453
3.784GlyThr: 3.784 ± 0.593
4.3GlyVal: 4.3 ± 0.676
1.118GlyTrp: 1.118 ± 0.45
3.526GlyTyr: 3.526 ± 0.613
0.0GlyXaa: 0.0 ± 0.0
His
1.29HisAla: 1.29 ± 0.368
0.086HisCys: 0.086 ± 0.101
1.204HisAsp: 1.204 ± 0.319
0.946HisGlu: 0.946 ± 0.309
0.688HisPhe: 0.688 ± 0.244
1.548HisGly: 1.548 ± 0.37
0.602HisHis: 0.602 ± 0.263
1.204HisIle: 1.204 ± 0.377
0.602HisLys: 0.602 ± 0.226
1.204HisLeu: 1.204 ± 0.339
0.258HisMet: 0.258 ± 0.166
1.204HisAsn: 1.204 ± 0.326
0.344HisPro: 0.344 ± 0.147
0.774HisGln: 0.774 ± 0.261
0.344HisArg: 0.344 ± 0.144
1.548HisSer: 1.548 ± 0.35
0.516HisThr: 0.516 ± 0.219
1.204HisVal: 1.204 ± 0.317
0.086HisTrp: 0.086 ± 0.087
0.946HisTyr: 0.946 ± 0.417
0.0HisXaa: 0.0 ± 0.0
Ile
5.59IleAla: 5.59 ± 0.841
0.258IleCys: 0.258 ± 0.152
5.418IleAsp: 5.418 ± 0.744
6.45IleGlu: 6.45 ± 0.69
2.58IlePhe: 2.58 ± 0.559
4.128IleGly: 4.128 ± 0.596
1.29IleHis: 1.29 ± 0.272
4.472IleIle: 4.472 ± 0.735
7.482IleLys: 7.482 ± 0.788
4.128IleLeu: 4.128 ± 0.62
2.15IleMet: 2.15 ± 0.362
4.816IleAsn: 4.816 ± 0.624
1.892IlePro: 1.892 ± 0.444
1.72IleGln: 1.72 ± 0.365
3.956IleArg: 3.956 ± 0.647
3.956IleSer: 3.956 ± 0.62
5.676IleThr: 5.676 ± 0.896
4.3IleVal: 4.3 ± 0.714
1.634IleTrp: 1.634 ± 0.685
2.752IleTyr: 2.752 ± 0.505
0.0IleXaa: 0.0 ± 0.0
Lys
4.902LysAla: 4.902 ± 0.582
0.172LysCys: 0.172 ± 0.134
5.848LysAsp: 5.848 ± 0.815
7.654LysGlu: 7.654 ± 1.279
3.096LysPhe: 3.096 ± 0.571
4.902LysGly: 4.902 ± 0.664
1.462LysHis: 1.462 ± 0.345
6.536LysIle: 6.536 ± 0.913
8.256LysLys: 8.256 ± 1.022
6.966LysLeu: 6.966 ± 0.851
1.806LysMet: 1.806 ± 0.376
5.59LysAsn: 5.59 ± 0.694
2.752LysPro: 2.752 ± 0.565
4.816LysGln: 4.816 ± 0.739
4.386LysArg: 4.386 ± 0.669
4.73LysSer: 4.73 ± 0.742
4.73LysThr: 4.73 ± 0.829
6.622LysVal: 6.622 ± 0.784
0.688LysTrp: 0.688 ± 0.221
4.3LysTyr: 4.3 ± 0.746
0.0LysXaa: 0.0 ± 0.0
Leu
3.87LeuAla: 3.87 ± 0.559
0.258LeuCys: 0.258 ± 0.203
3.784LeuAsp: 3.784 ± 0.569
5.676LeuGlu: 5.676 ± 0.739
3.096LeuPhe: 3.096 ± 0.601
4.3LeuGly: 4.3 ± 0.567
1.204LeuHis: 1.204 ± 0.328
4.3LeuIle: 4.3 ± 0.713
7.224LeuLys: 7.224 ± 0.579
6.45LeuLeu: 6.45 ± 0.69
1.806LeuMet: 1.806 ± 0.379
5.676LeuAsn: 5.676 ± 0.695
2.838LeuPro: 2.838 ± 0.472
3.182LeuGln: 3.182 ± 0.505
3.87LeuArg: 3.87 ± 0.887
5.16LeuSer: 5.16 ± 0.555
5.59LeuThr: 5.59 ± 0.711
4.042LeuVal: 4.042 ± 0.816
1.032LeuTrp: 1.032 ± 0.313
3.182LeuTyr: 3.182 ± 0.536
0.0LeuXaa: 0.0 ± 0.0
Met
1.204MetAla: 1.204 ± 0.346
0.258MetCys: 0.258 ± 0.157
1.29MetAsp: 1.29 ± 0.405
2.322MetGlu: 2.322 ± 0.464
1.118MetPhe: 1.118 ± 0.252
0.946MetGly: 0.946 ± 0.305
0.344MetHis: 0.344 ± 0.207
1.462MetIle: 1.462 ± 0.306
2.236MetLys: 2.236 ± 0.484
3.01MetLeu: 3.01 ± 0.455
0.774MetMet: 0.774 ± 0.227
1.806MetAsn: 1.806 ± 0.49
0.774MetPro: 0.774 ± 0.248
1.548MetGln: 1.548 ± 0.338
1.032MetArg: 1.032 ± 0.318
1.548MetSer: 1.548 ± 0.368
2.408MetThr: 2.408 ± 0.52
1.118MetVal: 1.118 ± 0.296
0.516MetTrp: 0.516 ± 0.22
1.118MetTyr: 1.118 ± 0.309
0.0MetXaa: 0.0 ± 0.0
Asn
5.246AsnAla: 5.246 ± 0.852
0.172AsnCys: 0.172 ± 0.12
4.558AsnAsp: 4.558 ± 0.76
5.332AsnGlu: 5.332 ± 0.719
2.58AsnPhe: 2.58 ± 0.59
4.73AsnGly: 4.73 ± 0.623
0.946AsnHis: 0.946 ± 0.261
4.386AsnIle: 4.386 ± 0.56
6.966AsnLys: 6.966 ± 0.698
5.16AsnLeu: 5.16 ± 0.718
1.72AsnMet: 1.72 ± 0.448
3.612AsnAsn: 3.612 ± 0.564
2.58AsnPro: 2.58 ± 0.525
2.064AsnGln: 2.064 ± 0.38
2.58AsnArg: 2.58 ± 0.333
2.15AsnSer: 2.15 ± 0.393
3.698AsnThr: 3.698 ± 0.519
4.386AsnVal: 4.386 ± 0.655
0.688AsnTrp: 0.688 ± 0.255
2.838AsnTyr: 2.838 ± 0.551
0.0AsnXaa: 0.0 ± 0.0
Pro
1.118ProAla: 1.118 ± 0.29
0.172ProCys: 0.172 ± 0.113
1.634ProAsp: 1.634 ± 0.362
2.236ProGlu: 2.236 ± 0.44
1.204ProPhe: 1.204 ± 0.386
1.806ProGly: 1.806 ± 0.458
0.344ProHis: 0.344 ± 0.145
2.064ProIle: 2.064 ± 0.482
3.268ProLys: 3.268 ± 0.677
1.032ProLeu: 1.032 ± 0.267
1.204ProMet: 1.204 ± 0.297
2.064ProAsn: 2.064 ± 0.344
0.688ProPro: 0.688 ± 0.288
1.376ProGln: 1.376 ± 0.334
1.29ProArg: 1.29 ± 0.327
1.462ProSer: 1.462 ± 0.41
2.15ProThr: 2.15 ± 0.405
2.408ProVal: 2.408 ± 0.515
0.086ProTrp: 0.086 ± 0.094
1.29ProTyr: 1.29 ± 0.356
0.0ProXaa: 0.0 ± 0.0
Gln
2.58GlnAla: 2.58 ± 0.436
0.43GlnCys: 0.43 ± 0.204
2.408GlnAsp: 2.408 ± 0.444
2.924GlnGlu: 2.924 ± 0.665
1.72GlnPhe: 1.72 ± 0.404
2.15GlnGly: 2.15 ± 0.429
0.86GlnHis: 0.86 ± 0.258
2.752GlnIle: 2.752 ± 0.47
2.666GlnLys: 2.666 ± 0.55
3.268GlnLeu: 3.268 ± 0.555
1.376GlnMet: 1.376 ± 0.363
1.892GlnAsn: 1.892 ± 0.355
1.892GlnPro: 1.892 ± 0.525
1.806GlnGln: 1.806 ± 0.595
1.462GlnArg: 1.462 ± 0.34
2.236GlnSer: 2.236 ± 0.451
2.064GlnThr: 2.064 ± 0.401
1.978GlnVal: 1.978 ± 0.456
0.43GlnTrp: 0.43 ± 0.202
1.29GlnTyr: 1.29 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
1.462ArgAla: 1.462 ± 0.325
0.258ArgCys: 0.258 ± 0.154
2.236ArgAsp: 2.236 ± 0.459
3.268ArgGlu: 3.268 ± 0.488
1.892ArgPhe: 1.892 ± 0.443
2.494ArgGly: 2.494 ± 0.571
1.204ArgHis: 1.204 ± 0.323
3.354ArgIle: 3.354 ± 0.595
4.3ArgLys: 4.3 ± 0.705
3.784ArgLeu: 3.784 ± 0.637
0.516ArgMet: 0.516 ± 0.196
3.956ArgAsn: 3.956 ± 0.552
0.86ArgPro: 0.86 ± 0.253
1.806ArgGln: 1.806 ± 0.409
1.892ArgArg: 1.892 ± 0.479
2.15ArgSer: 2.15 ± 0.432
2.408ArgThr: 2.408 ± 0.539
1.892ArgVal: 1.892 ± 0.456
0.344ArgTrp: 0.344 ± 0.212
2.408ArgTyr: 2.408 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
3.87SerAla: 3.87 ± 0.595
0.258SerCys: 0.258 ± 0.179
5.246SerAsp: 5.246 ± 0.853
3.01SerGlu: 3.01 ± 0.595
2.15SerPhe: 2.15 ± 0.51
3.096SerGly: 3.096 ± 0.53
0.946SerHis: 0.946 ± 0.364
4.988SerIle: 4.988 ± 0.646
4.644SerLys: 4.644 ± 0.632
3.612SerLeu: 3.612 ± 0.479
1.72SerMet: 1.72 ± 0.356
4.558SerAsn: 4.558 ± 0.646
1.118SerPro: 1.118 ± 0.361
1.806SerGln: 1.806 ± 0.394
2.494SerArg: 2.494 ± 0.331
3.268SerSer: 3.268 ± 0.547
3.182SerThr: 3.182 ± 0.412
3.612SerVal: 3.612 ± 0.718
0.172SerTrp: 0.172 ± 0.124
1.978SerTyr: 1.978 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
3.612ThrAla: 3.612 ± 0.657
0.086ThrCys: 0.086 ± 0.095
3.956ThrAsp: 3.956 ± 0.701
4.3ThrGlu: 4.3 ± 0.606
2.58ThrPhe: 2.58 ± 0.557
4.3ThrGly: 4.3 ± 0.773
0.946ThrHis: 0.946 ± 0.285
6.106ThrIle: 6.106 ± 1.222
5.504ThrLys: 5.504 ± 0.571
4.386ThrLeu: 4.386 ± 0.588
1.29ThrMet: 1.29 ± 0.339
3.526ThrAsn: 3.526 ± 0.571
2.494ThrPro: 2.494 ± 0.477
2.752ThrGln: 2.752 ± 0.478
2.494ThrArg: 2.494 ± 0.461
3.44ThrSer: 3.44 ± 0.435
3.354ThrThr: 3.354 ± 0.843
3.698ThrVal: 3.698 ± 0.753
0.774ThrTrp: 0.774 ± 0.244
2.236ThrTyr: 2.236 ± 0.473
0.0ThrXaa: 0.0 ± 0.0
Val
3.354ValAla: 3.354 ± 0.826
0.43ValCys: 0.43 ± 0.183
4.386ValAsp: 4.386 ± 0.622
5.59ValGlu: 5.59 ± 0.841
2.666ValPhe: 2.666 ± 0.642
2.494ValGly: 2.494 ± 0.581
0.344ValHis: 0.344 ± 0.194
5.762ValIle: 5.762 ± 0.691
5.332ValLys: 5.332 ± 0.56
4.988ValLeu: 4.988 ± 0.713
1.978ValMet: 1.978 ± 0.366
4.644ValAsn: 4.644 ± 0.619
2.064ValPro: 2.064 ± 0.514
1.29ValGln: 1.29 ± 0.318
2.064ValArg: 2.064 ± 0.437
3.956ValSer: 3.956 ± 0.777
3.354ValThr: 3.354 ± 0.555
3.87ValVal: 3.87 ± 0.633
1.29ValTrp: 1.29 ± 0.43
2.494ValTyr: 2.494 ± 0.589
0.0ValXaa: 0.0 ± 0.0
Trp
1.118TrpAla: 1.118 ± 0.352
0.086TrpCys: 0.086 ± 0.077
0.258TrpAsp: 0.258 ± 0.159
0.516TrpGlu: 0.516 ± 0.218
0.258TrpPhe: 0.258 ± 0.122
0.86TrpGly: 0.86 ± 0.39
0.172TrpHis: 0.172 ± 0.124
0.946TrpIle: 0.946 ± 0.305
0.946TrpLys: 0.946 ± 0.337
0.43TrpLeu: 0.43 ± 0.178
0.172TrpMet: 0.172 ± 0.154
1.978TrpAsn: 1.978 ± 1.144
0.172TrpPro: 0.172 ± 0.142
0.43TrpGln: 0.43 ± 0.157
0.258TrpArg: 0.258 ± 0.135
0.774TrpSer: 0.774 ± 0.323
1.118TrpThr: 1.118 ± 0.295
0.946TrpVal: 0.946 ± 0.31
0.0TrpTrp: 0.0 ± 0.0
0.516TrpTyr: 0.516 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.978TyrAla: 1.978 ± 0.525
0.172TyrCys: 0.172 ± 0.122
2.58TyrAsp: 2.58 ± 0.592
3.096TyrGlu: 3.096 ± 0.525
2.064TyrPhe: 2.064 ± 0.493
2.838TyrGly: 2.838 ± 0.605
1.118TyrHis: 1.118 ± 0.336
3.698TyrIle: 3.698 ± 0.666
4.902TyrLys: 4.902 ± 0.63
2.322TyrLeu: 2.322 ± 0.424
1.118TyrMet: 1.118 ± 0.296
2.666TyrAsn: 2.666 ± 0.393
1.032TyrPro: 1.032 ± 0.297
2.064TyrGln: 2.064 ± 0.419
2.322TyrArg: 2.322 ± 0.465
2.322TyrSer: 2.322 ± 0.502
2.838TyrThr: 2.838 ± 0.44
2.494TyrVal: 2.494 ± 0.484
0.774TyrTrp: 0.774 ± 0.293
1.978TyrTyr: 1.978 ± 0.42
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11629 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski