Amino acid dipepetide frequency for Escherichia phage PA29

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.589AlaAla: 8.589 ± 0.688
1.026AlaCys: 1.026 ± 0.307
4.862AlaAsp: 4.862 ± 0.642
8.373AlaGlu: 8.373 ± 0.795
3.511AlaPhe: 3.511 ± 0.535
7.617AlaGly: 7.617 ± 1.048
1.296AlaHis: 1.296 ± 0.328
4.862AlaIle: 4.862 ± 0.611
4.43AlaLys: 4.43 ± 0.451
7.185AlaLeu: 7.185 ± 0.678
3.187AlaMet: 3.187 ± 0.429
2.809AlaAsn: 2.809 ± 0.371
3.133AlaPro: 3.133 ± 0.449
5.186AlaGln: 5.186 ± 0.678
5.726AlaArg: 5.726 ± 0.621
5.672AlaSer: 5.672 ± 0.447
5.564AlaThr: 5.564 ± 0.756
6.482AlaVal: 6.482 ± 0.633
1.783AlaTrp: 1.783 ± 0.325
2.593AlaTyr: 2.593 ± 0.344
0.0AlaXaa: 0.0 ± 0.0
Cys
1.188CysAla: 1.188 ± 0.306
0.324CysCys: 0.324 ± 0.146
0.648CysAsp: 0.648 ± 0.173
0.864CysGlu: 0.864 ± 0.273
0.432CysPhe: 0.432 ± 0.176
0.81CysGly: 0.81 ± 0.249
0.486CysHis: 0.486 ± 0.181
0.594CysIle: 0.594 ± 0.205
0.594CysLys: 0.594 ± 0.243
0.918CysLeu: 0.918 ± 0.233
0.054CysMet: 0.054 ± 0.054
0.27CysAsn: 0.27 ± 0.101
0.486CysPro: 0.486 ± 0.155
0.486CysGln: 0.486 ± 0.167
0.972CysArg: 0.972 ± 0.286
1.188CysSer: 1.188 ± 0.359
0.486CysThr: 0.486 ± 0.181
0.756CysVal: 0.756 ± 0.24
0.162CysTrp: 0.162 ± 0.098
0.324CysTyr: 0.324 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
6.104AspAla: 6.104 ± 0.611
0.81AspCys: 0.81 ± 0.227
4.051AspAsp: 4.051 ± 0.455
4.268AspGlu: 4.268 ± 0.488
1.783AspPhe: 1.783 ± 0.274
4.7AspGly: 4.7 ± 0.545
0.918AspHis: 0.918 ± 0.3
3.457AspIle: 3.457 ± 0.394
4.268AspLys: 4.268 ± 0.447
4.105AspLeu: 4.105 ± 0.422
1.729AspMet: 1.729 ± 0.306
2.593AspAsn: 2.593 ± 0.435
2.431AspPro: 2.431 ± 0.398
1.567AspGln: 1.567 ± 0.33
3.241AspArg: 3.241 ± 0.404
3.403AspSer: 3.403 ± 0.361
2.755AspThr: 2.755 ± 0.417
3.943AspVal: 3.943 ± 0.387
1.188AspTrp: 1.188 ± 0.341
1.675AspTyr: 1.675 ± 0.344
0.0AspXaa: 0.0 ± 0.0
Glu
6.644GluAla: 6.644 ± 0.626
1.188GluCys: 1.188 ± 0.301
2.161GluAsp: 2.161 ± 0.319
3.997GluGlu: 3.997 ± 0.481
2.431GluPhe: 2.431 ± 0.406
3.673GluGly: 3.673 ± 0.536
1.026GluHis: 1.026 ± 0.2
3.943GluIle: 3.943 ± 0.48
5.078GluLys: 5.078 ± 0.7
5.888GluLeu: 5.888 ± 0.616
2.377GluMet: 2.377 ± 0.348
3.025GluAsn: 3.025 ± 0.452
2.107GluPro: 2.107 ± 0.474
4.538GluGln: 4.538 ± 0.532
5.726GluArg: 5.726 ± 0.685
3.781GluSer: 3.781 ± 0.443
3.835GluThr: 3.835 ± 0.697
4.105GluVal: 4.105 ± 0.706
0.864GluTrp: 0.864 ± 0.206
2.161GluTyr: 2.161 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.755PheAla: 2.755 ± 0.43
0.432PheCys: 0.432 ± 0.185
1.837PheAsp: 1.837 ± 0.327
1.404PheGlu: 1.404 ± 0.362
0.756PhePhe: 0.756 ± 0.217
2.161PheGly: 2.161 ± 0.254
0.54PheHis: 0.54 ± 0.146
1.837PheIle: 1.837 ± 0.3
1.837PheLys: 1.837 ± 0.241
1.999PheLeu: 1.999 ± 0.303
1.026PheMet: 1.026 ± 0.236
1.404PheAsn: 1.404 ± 0.266
1.188PhePro: 1.188 ± 0.238
0.972PheGln: 0.972 ± 0.233
2.161PheArg: 2.161 ± 0.354
2.863PheSer: 2.863 ± 0.372
2.485PheThr: 2.485 ± 0.298
2.269PheVal: 2.269 ± 0.354
0.81PheTrp: 0.81 ± 0.222
1.026PheTyr: 1.026 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
5.996GlyAla: 5.996 ± 0.863
0.864GlyCys: 0.864 ± 0.289
4.808GlyAsp: 4.808 ± 0.753
6.266GlyGlu: 6.266 ± 1.397
2.647GlyPhe: 2.647 ± 0.346
5.24GlyGly: 5.24 ± 0.711
1.242GlyHis: 1.242 ± 0.272
3.943GlyIle: 3.943 ± 0.501
4.646GlyLys: 4.646 ± 0.835
4.754GlyLeu: 4.754 ± 0.481
2.647GlyMet: 2.647 ± 0.398
2.863GlyAsn: 2.863 ± 0.351
3.889GlyPro: 3.889 ± 2.124
2.701GlyGln: 2.701 ± 0.39
4.105GlyArg: 4.105 ± 0.524
3.889GlySer: 3.889 ± 0.503
3.673GlyThr: 3.673 ± 0.526
5.294GlyVal: 5.294 ± 0.538
1.188GlyTrp: 1.188 ± 0.243
2.593GlyTyr: 2.593 ± 0.392
0.0GlyXaa: 0.0 ± 0.0
His
1.459HisAla: 1.459 ± 0.26
0.216HisCys: 0.216 ± 0.109
0.972HisAsp: 0.972 ± 0.196
0.972HisGlu: 0.972 ± 0.225
0.702HisPhe: 0.702 ± 0.237
1.837HisGly: 1.837 ± 0.384
0.54HisHis: 0.54 ± 0.19
0.648HisIle: 0.648 ± 0.197
1.026HisLys: 1.026 ± 0.307
1.567HisLeu: 1.567 ± 0.426
0.27HisMet: 0.27 ± 0.099
0.918HisAsn: 0.918 ± 0.196
0.864HisPro: 0.864 ± 0.247
0.702HisGln: 0.702 ± 0.238
0.972HisArg: 0.972 ± 0.236
0.756HisSer: 0.756 ± 0.193
1.188HisThr: 1.188 ± 0.289
0.918HisVal: 0.918 ± 0.245
0.432HisTrp: 0.432 ± 0.197
0.486HisTyr: 0.486 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
4.916IleAla: 4.916 ± 0.477
1.134IleCys: 1.134 ± 0.312
4.322IleAsp: 4.322 ± 0.548
3.025IleGlu: 3.025 ± 0.507
0.918IlePhe: 0.918 ± 0.271
3.133IleGly: 3.133 ± 0.536
0.864IleHis: 0.864 ± 0.188
2.215IleIle: 2.215 ± 0.309
2.971IleLys: 2.971 ± 0.4
3.457IleLeu: 3.457 ± 0.582
1.188IleMet: 1.188 ± 0.233
2.917IleAsn: 2.917 ± 0.405
2.323IlePro: 2.323 ± 0.349
1.729IleGln: 1.729 ± 0.245
4.376IleArg: 4.376 ± 0.414
4.051IleSer: 4.051 ± 0.597
4.105IleThr: 4.105 ± 0.623
1.999IleVal: 1.999 ± 0.38
0.378IleTrp: 0.378 ± 0.156
1.459IleTyr: 1.459 ± 0.344
0.0IleXaa: 0.0 ± 0.0
Lys
6.158LysAla: 6.158 ± 0.42
0.378LysCys: 0.378 ± 0.167
3.295LysAsp: 3.295 ± 0.44
3.727LysGlu: 3.727 ± 0.468
0.918LysPhe: 0.918 ± 0.184
5.726LysGly: 5.726 ± 1.09
0.918LysHis: 0.918 ± 0.246
3.403LysIle: 3.403 ± 0.474
3.835LysLys: 3.835 ± 0.499
4.916LysLeu: 4.916 ± 0.5
1.837LysMet: 1.837 ± 0.298
3.781LysAsn: 3.781 ± 0.505
2.593LysPro: 2.593 ± 0.343
3.025LysGln: 3.025 ± 0.576
2.971LysArg: 2.971 ± 0.396
2.917LysSer: 2.917 ± 0.329
3.349LysThr: 3.349 ± 0.472
3.187LysVal: 3.187 ± 0.422
0.756LysTrp: 0.756 ± 0.255
1.783LysTyr: 1.783 ± 0.305
0.0LysXaa: 0.0 ± 0.0
Leu
7.941LeuAla: 7.941 ± 0.833
1.35LeuCys: 1.35 ± 0.321
3.835LeuAsp: 3.835 ± 0.455
4.268LeuGlu: 4.268 ± 0.55
2.809LeuPhe: 2.809 ± 0.467
4.213LeuGly: 4.213 ± 0.522
1.459LeuHis: 1.459 ± 0.282
3.781LeuIle: 3.781 ± 0.494
4.646LeuLys: 4.646 ± 0.435
6.482LeuLeu: 6.482 ± 0.564
2.053LeuMet: 2.053 ± 0.304
4.159LeuAsn: 4.159 ± 0.478
3.781LeuPro: 3.781 ± 0.498
3.187LeuGln: 3.187 ± 0.717
5.78LeuArg: 5.78 ± 0.592
4.592LeuSer: 4.592 ± 0.418
4.7LeuThr: 4.7 ± 0.495
4.538LeuVal: 4.538 ± 0.38
1.08LeuTrp: 1.08 ± 0.311
2.323LeuTyr: 2.323 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
3.241MetAla: 3.241 ± 0.344
0.0MetCys: 0.0 ± 0.0
1.404MetAsp: 1.404 ± 0.285
1.729MetGlu: 1.729 ± 0.267
0.81MetPhe: 0.81 ± 0.178
1.567MetGly: 1.567 ± 0.289
0.324MetHis: 0.324 ± 0.132
0.972MetIle: 0.972 ± 0.202
2.107MetLys: 2.107 ± 0.405
1.999MetLeu: 1.999 ± 0.314
0.81MetMet: 0.81 ± 0.222
1.459MetAsn: 1.459 ± 0.321
1.783MetPro: 1.783 ± 0.313
1.567MetGln: 1.567 ± 0.275
1.459MetArg: 1.459 ± 0.25
2.107MetSer: 2.107 ± 0.335
2.701MetThr: 2.701 ± 0.364
1.404MetVal: 1.404 ± 0.324
0.216MetTrp: 0.216 ± 0.098
0.486MetTyr: 0.486 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
4.7AsnAla: 4.7 ± 0.656
0.54AsnCys: 0.54 ± 0.17
2.485AsnAsp: 2.485 ± 0.382
2.917AsnGlu: 2.917 ± 0.392
1.026AsnPhe: 1.026 ± 0.221
3.133AsnGly: 3.133 ± 0.48
1.459AsnHis: 1.459 ± 0.298
2.539AsnIle: 2.539 ± 0.387
2.269AsnLys: 2.269 ± 0.337
3.295AsnLeu: 3.295 ± 0.472
1.026AsnMet: 1.026 ± 0.231
1.945AsnAsn: 1.945 ± 0.383
1.729AsnPro: 1.729 ± 0.26
1.999AsnGln: 1.999 ± 0.358
3.133AsnArg: 3.133 ± 0.5
2.647AsnSer: 2.647 ± 0.42
2.539AsnThr: 2.539 ± 0.4
2.107AsnVal: 2.107 ± 0.344
0.378AsnTrp: 0.378 ± 0.111
1.296AsnTyr: 1.296 ± 0.321
0.0AsnXaa: 0.0 ± 0.0
Pro
3.511ProAla: 3.511 ± 0.57
0.324ProCys: 0.324 ± 0.13
4.376ProAsp: 4.376 ± 0.502
4.97ProGlu: 4.97 ± 0.858
1.35ProPhe: 1.35 ± 0.228
3.349ProGly: 3.349 ± 0.622
0.594ProHis: 0.594 ± 0.183
1.08ProIle: 1.08 ± 0.238
2.971ProLys: 2.971 ± 0.737
2.809ProLeu: 2.809 ± 0.372
0.864ProMet: 0.864 ± 0.192
0.756ProAsn: 0.756 ± 0.185
1.513ProPro: 1.513 ± 0.256
2.377ProGln: 2.377 ± 0.507
1.783ProArg: 1.783 ± 0.295
2.755ProSer: 2.755 ± 0.401
1.891ProThr: 1.891 ± 0.307
4.213ProVal: 4.213 ± 0.448
0.594ProTrp: 0.594 ± 0.182
1.459ProTyr: 1.459 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
4.862GlnAla: 4.862 ± 0.609
0.756GlnCys: 0.756 ± 0.228
2.431GlnAsp: 2.431 ± 0.383
3.079GlnGlu: 3.079 ± 0.435
1.459GlnPhe: 1.459 ± 0.292
3.187GlnGly: 3.187 ± 0.675
0.864GlnHis: 0.864 ± 0.2
2.539GlnIle: 2.539 ± 0.499
3.295GlnLys: 3.295 ± 0.479
3.403GlnLeu: 3.403 ± 0.408
1.242GlnMet: 1.242 ± 0.262
1.837GlnAsn: 1.837 ± 0.333
2.053GlnPro: 2.053 ± 0.382
3.943GlnGln: 3.943 ± 0.844
3.457GlnArg: 3.457 ± 0.56
2.701GlnSer: 2.701 ± 0.42
2.053GlnThr: 2.053 ± 0.378
2.323GlnVal: 2.323 ± 0.471
0.702GlnTrp: 0.702 ± 0.194
1.404GlnTyr: 1.404 ± 0.319
0.0GlnXaa: 0.0 ± 0.0
Arg
4.916ArgAla: 4.916 ± 0.493
0.27ArgCys: 0.27 ± 0.123
4.105ArgAsp: 4.105 ± 0.613
5.456ArgGlu: 5.456 ± 0.668
2.323ArgPhe: 2.323 ± 0.336
4.808ArgGly: 4.808 ± 0.885
1.675ArgHis: 1.675 ± 0.255
3.511ArgIle: 3.511 ± 0.422
4.43ArgLys: 4.43 ± 0.661
5.348ArgLeu: 5.348 ± 0.488
2.107ArgMet: 2.107 ± 0.288
3.241ArgAsn: 3.241 ± 0.43
2.161ArgPro: 2.161 ± 0.395
3.295ArgGln: 3.295 ± 0.473
5.672ArgArg: 5.672 ± 0.603
3.781ArgSer: 3.781 ± 0.435
3.511ArgThr: 3.511 ± 0.497
3.781ArgVal: 3.781 ± 0.444
1.188ArgTrp: 1.188 ± 0.224
2.107ArgTyr: 2.107 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
5.726SerAla: 5.726 ± 0.577
0.702SerCys: 0.702 ± 0.202
3.619SerAsp: 3.619 ± 0.445
4.051SerGlu: 4.051 ± 0.549
1.621SerPhe: 1.621 ± 0.289
5.51SerGly: 5.51 ± 0.695
0.81SerHis: 0.81 ± 0.208
2.485SerIle: 2.485 ± 0.344
2.647SerLys: 2.647 ± 0.396
5.456SerLeu: 5.456 ± 0.623
1.404SerMet: 1.404 ± 0.306
2.593SerAsn: 2.593 ± 0.464
3.349SerPro: 3.349 ± 0.388
3.403SerGln: 3.403 ± 0.405
4.43SerArg: 4.43 ± 0.638
3.295SerSer: 3.295 ± 0.729
3.187SerThr: 3.187 ± 0.406
3.943SerVal: 3.943 ± 0.469
0.918SerTrp: 0.918 ± 0.207
1.783SerTyr: 1.783 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
5.456ThrAla: 5.456 ± 0.581
0.486ThrCys: 0.486 ± 0.182
3.349ThrAsp: 3.349 ± 0.468
3.619ThrGlu: 3.619 ± 0.421
2.269ThrPhe: 2.269 ± 0.415
5.942ThrGly: 5.942 ± 0.735
0.918ThrHis: 0.918 ± 0.229
3.565ThrIle: 3.565 ± 0.448
2.809ThrLys: 2.809 ± 0.404
5.024ThrLeu: 5.024 ± 0.455
0.864ThrMet: 0.864 ± 0.209
1.513ThrAsn: 1.513 ± 0.209
3.511ThrPro: 3.511 ± 0.359
1.999ThrGln: 1.999 ± 0.409
2.809ThrArg: 2.809 ± 0.379
3.403ThrSer: 3.403 ± 0.478
3.403ThrThr: 3.403 ± 0.426
4.268ThrVal: 4.268 ± 0.549
0.918ThrTrp: 0.918 ± 0.223
1.188ThrTyr: 1.188 ± 0.281
0.0ThrXaa: 0.0 ± 0.0
Val
5.888ValAla: 5.888 ± 0.656
0.864ValCys: 0.864 ± 0.252
3.835ValAsp: 3.835 ± 0.479
3.403ValGlu: 3.403 ± 0.357
2.107ValPhe: 2.107 ± 0.318
3.457ValGly: 3.457 ± 0.465
0.702ValHis: 0.702 ± 0.165
3.673ValIle: 3.673 ± 0.456
3.457ValLys: 3.457 ± 0.444
5.402ValLeu: 5.402 ± 0.575
1.729ValMet: 1.729 ± 0.294
3.187ValAsn: 3.187 ± 0.39
2.755ValPro: 2.755 ± 0.392
2.323ValGln: 2.323 ± 0.34
5.186ValArg: 5.186 ± 0.912
4.646ValSer: 4.646 ± 0.535
3.673ValThr: 3.673 ± 0.44
4.213ValVal: 4.213 ± 0.471
0.864ValTrp: 0.864 ± 0.228
1.675ValTyr: 1.675 ± 0.295
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.305
0.108TrpCys: 0.108 ± 0.079
0.702TrpAsp: 0.702 ± 0.218
0.594TrpGlu: 0.594 ± 0.143
0.648TrpPhe: 0.648 ± 0.16
0.594TrpGly: 0.594 ± 0.159
0.324TrpHis: 0.324 ± 0.122
0.756TrpIle: 0.756 ± 0.204
1.188TrpLys: 1.188 ± 0.23
1.513TrpLeu: 1.513 ± 0.357
0.864TrpMet: 0.864 ± 0.216
0.648TrpAsn: 0.648 ± 0.175
0.702TrpPro: 0.702 ± 0.227
1.08TrpGln: 1.08 ± 0.189
1.296TrpArg: 1.296 ± 0.292
0.81TrpSer: 0.81 ± 0.231
0.54TrpThr: 0.54 ± 0.195
1.296TrpVal: 1.296 ± 0.293
0.324TrpTrp: 0.324 ± 0.175
0.432TrpTyr: 0.432 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.863TyrAla: 2.863 ± 0.385
0.27TyrCys: 0.27 ± 0.132
1.891TyrAsp: 1.891 ± 0.297
1.567TyrGlu: 1.567 ± 0.374
1.242TyrPhe: 1.242 ± 0.249
2.593TyrGly: 2.593 ± 0.497
0.486TyrHis: 0.486 ± 0.15
1.675TyrIle: 1.675 ± 0.343
1.026TyrLys: 1.026 ± 0.238
1.459TyrLeu: 1.459 ± 0.284
0.81TyrMet: 0.81 ± 0.207
1.242TyrAsn: 1.242 ± 0.282
1.188TyrPro: 1.188 ± 0.259
1.513TyrGln: 1.513 ± 0.282
2.431TyrArg: 2.431 ± 0.427
1.675TyrSer: 1.675 ± 0.278
1.675TyrThr: 1.675 ± 0.397
1.999TyrVal: 1.999 ± 0.331
0.648TyrTrp: 0.648 ± 0.158
1.08TyrTyr: 1.08 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (18513 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski