Amino acid dipepetide frequency for Cellulophaga phage phi47:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.723AlaAla: 5.723 ± 0.917
0.561AlaCys: 0.561 ± 0.173
3.815AlaAsp: 3.815 ± 0.528
4.04AlaGlu: 4.04 ± 0.798
2.02AlaPhe: 2.02 ± 0.445
4.601AlaGly: 4.601 ± 0.7
0.786AlaHis: 0.786 ± 0.223
4.937AlaIle: 4.937 ± 0.413
6.901AlaLys: 6.901 ± 1.25
5.611AlaLeu: 5.611 ± 0.609
1.066AlaMet: 1.066 ± 0.249
3.198AlaAsn: 3.198 ± 0.396
1.515AlaPro: 1.515 ± 0.453
1.908AlaGln: 1.908 ± 0.313
2.357AlaArg: 2.357 ± 0.375
4.545AlaSer: 4.545 ± 0.486
4.657AlaThr: 4.657 ± 0.641
4.881AlaVal: 4.881 ± 0.551
0.898AlaTrp: 0.898 ± 0.367
1.571AlaTyr: 1.571 ± 0.257
0.0AlaXaa: 0.0 ± 0.0
Cys
0.337CysAla: 0.337 ± 0.146
0.281CysCys: 0.281 ± 0.116
0.337CysAsp: 0.337 ± 0.121
0.729CysGlu: 0.729 ± 0.258
0.337CysPhe: 0.337 ± 0.153
0.673CysGly: 0.673 ± 0.219
0.0CysHis: 0.0 ± 0.0
0.505CysIle: 0.505 ± 0.204
0.729CysLys: 0.729 ± 0.182
0.617CysLeu: 0.617 ± 0.265
0.112CysMet: 0.112 ± 0.092
0.281CysAsn: 0.281 ± 0.109
0.505CysPro: 0.505 ± 0.202
0.337CysGln: 0.337 ± 0.131
0.393CysArg: 0.393 ± 0.147
0.561CysSer: 0.561 ± 0.179
0.449CysThr: 0.449 ± 0.154
0.617CysVal: 0.617 ± 0.188
0.056CysTrp: 0.056 ± 0.058
0.281CysTyr: 0.281 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
3.31AspAla: 3.31 ± 0.455
0.449AspCys: 0.449 ± 0.146
3.759AspAsp: 3.759 ± 0.458
4.152AspGlu: 4.152 ± 0.532
3.423AspPhe: 3.423 ± 0.39
4.545AspGly: 4.545 ± 0.498
0.786AspHis: 0.786 ± 0.191
3.871AspIle: 3.871 ± 0.432
4.601AspLys: 4.601 ± 0.562
5.442AspLeu: 5.442 ± 0.706
1.066AspMet: 1.066 ± 0.214
3.366AspAsn: 3.366 ± 0.471
1.571AspPro: 1.571 ± 0.288
1.178AspGln: 1.178 ± 0.201
2.469AspArg: 2.469 ± 0.52
5.05AspSer: 5.05 ± 0.497
2.974AspThr: 2.974 ± 0.375
4.04AspVal: 4.04 ± 0.474
1.347AspTrp: 1.347 ± 0.278
2.637AspTyr: 2.637 ± 0.408
0.0AspXaa: 0.0 ± 0.0
Glu
5.33GluAla: 5.33 ± 0.813
0.337GluCys: 0.337 ± 0.147
4.096GluAsp: 4.096 ± 0.463
5.442GluGlu: 5.442 ± 1.079
2.581GluPhe: 2.581 ± 0.344
3.142GluGly: 3.142 ± 0.465
1.403GluHis: 1.403 ± 0.336
6.228GluIle: 6.228 ± 0.66
6.621GluLys: 6.621 ± 0.829
6.06GluLeu: 6.06 ± 0.627
1.234GluMet: 1.234 ± 0.329
4.769GluAsn: 4.769 ± 0.532
1.403GluPro: 1.403 ± 0.315
2.469GluGln: 2.469 ± 0.41
2.805GluArg: 2.805 ± 0.807
4.994GluSer: 4.994 ± 0.623
4.32GluThr: 4.32 ± 0.476
4.881GluVal: 4.881 ± 0.548
0.617GluTrp: 0.617 ± 0.168
1.852GluTyr: 1.852 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
2.3PheAla: 2.3 ± 0.308
0.168PheCys: 0.168 ± 0.11
3.759PheAsp: 3.759 ± 0.418
2.693PheGlu: 2.693 ± 0.358
1.852PhePhe: 1.852 ± 0.276
2.188PheGly: 2.188 ± 0.31
0.842PheHis: 0.842 ± 0.227
3.366PheIle: 3.366 ± 0.439
4.713PheLys: 4.713 ± 0.522
3.535PheLeu: 3.535 ± 0.468
1.571PheMet: 1.571 ± 0.291
3.31PheAsn: 3.31 ± 0.419
1.515PhePro: 1.515 ± 0.248
1.403PheGln: 1.403 ± 0.333
1.795PheArg: 1.795 ± 0.325
4.264PheSer: 4.264 ± 0.461
2.525PheThr: 2.525 ± 0.373
2.413PheVal: 2.413 ± 0.345
0.505PheTrp: 0.505 ± 0.151
1.515PheTyr: 1.515 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
3.984GlyAla: 3.984 ± 0.581
0.673GlyCys: 0.673 ± 0.19
3.479GlyAsp: 3.479 ± 0.453
4.096GlyGlu: 4.096 ± 0.509
3.535GlyPhe: 3.535 ± 0.511
4.881GlyGly: 4.881 ± 0.887
0.898GlyHis: 0.898 ± 0.212
3.928GlyIle: 3.928 ± 0.522
4.937GlyLys: 4.937 ± 0.614
4.376GlyLeu: 4.376 ± 0.546
1.178GlyMet: 1.178 ± 0.247
3.703GlyAsn: 3.703 ± 0.513
0.617GlyPro: 0.617 ± 0.236
1.795GlyGln: 1.795 ± 0.303
2.974GlyArg: 2.974 ± 0.57
4.32GlySer: 4.32 ± 0.558
4.152GlyThr: 4.152 ± 0.507
4.432GlyVal: 4.432 ± 0.771
0.505GlyTrp: 0.505 ± 0.142
2.02GlyTyr: 2.02 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
0.954HisAla: 0.954 ± 0.193
0.224HisCys: 0.224 ± 0.108
0.729HisAsp: 0.729 ± 0.188
0.786HisGlu: 0.786 ± 0.193
0.898HisPhe: 0.898 ± 0.271
1.234HisGly: 1.234 ± 0.301
0.281HisHis: 0.281 ± 0.141
0.954HisIle: 0.954 ± 0.257
1.459HisLys: 1.459 ± 0.341
1.571HisLeu: 1.571 ± 0.314
0.337HisMet: 0.337 ± 0.157
0.842HisAsn: 0.842 ± 0.213
0.673HisPro: 0.673 ± 0.184
0.729HisGln: 0.729 ± 0.23
0.786HisArg: 0.786 ± 0.191
1.01HisSer: 1.01 ± 0.209
0.617HisThr: 0.617 ± 0.179
0.673HisVal: 0.673 ± 0.179
0.393HisTrp: 0.393 ± 0.138
0.449HisTyr: 0.449 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
3.928IleAla: 3.928 ± 0.46
0.561IleCys: 0.561 ± 0.182
5.33IleAsp: 5.33 ± 0.475
6.003IleGlu: 6.003 ± 0.72
1.964IlePhe: 1.964 ± 0.341
2.918IleGly: 2.918 ± 0.433
0.842IleHis: 0.842 ± 0.241
3.984IleIle: 3.984 ± 0.53
6.845IleLys: 6.845 ± 0.795
5.106IleLeu: 5.106 ± 0.567
1.403IleMet: 1.403 ± 0.268
5.106IleAsn: 5.106 ± 0.612
2.3IlePro: 2.3 ± 0.341
1.459IleGln: 1.459 ± 0.274
2.3IleArg: 2.3 ± 0.405
5.667IleSer: 5.667 ± 0.505
4.208IleThr: 4.208 ± 0.56
4.657IleVal: 4.657 ± 0.508
0.786IleTrp: 0.786 ± 0.199
2.413IleTyr: 2.413 ± 0.424
0.0IleXaa: 0.0 ± 0.0
Lys
6.06LysAla: 6.06 ± 0.637
0.729LysCys: 0.729 ± 0.197
5.162LysAsp: 5.162 ± 0.754
7.911LysGlu: 7.911 ± 1.299
4.264LysPhe: 4.264 ± 0.558
5.667LysGly: 5.667 ± 0.745
1.964LysHis: 1.964 ± 0.474
5.442LysIle: 5.442 ± 0.532
12.68LysLys: 12.68 ± 2.126
6.677LysLeu: 6.677 ± 0.833
2.244LysMet: 2.244 ± 0.448
5.667LysAsn: 5.667 ± 0.688
1.964LysPro: 1.964 ± 0.371
2.918LysGln: 2.918 ± 0.489
2.805LysArg: 2.805 ± 0.43
6.452LysSer: 6.452 ± 0.759
6.901LysThr: 6.901 ± 0.6
4.657LysVal: 4.657 ± 0.588
1.122LysTrp: 1.122 ± 0.251
2.749LysTyr: 2.749 ± 0.357
0.0LysXaa: 0.0 ± 0.0
Leu
5.106LeuAla: 5.106 ± 0.574
1.178LeuCys: 1.178 ± 0.224
5.499LeuAsp: 5.499 ± 0.497
6.003LeuGlu: 6.003 ± 0.647
2.974LeuPhe: 2.974 ± 0.418
4.769LeuGly: 4.769 ± 0.55
1.066LeuHis: 1.066 ± 0.253
5.218LeuIle: 5.218 ± 0.597
8.865LeuLys: 8.865 ± 0.79
5.891LeuLeu: 5.891 ± 0.621
2.02LeuMet: 2.02 ± 0.347
5.611LeuAsn: 5.611 ± 0.609
2.581LeuPro: 2.581 ± 0.328
2.637LeuGln: 2.637 ± 0.343
3.366LeuArg: 3.366 ± 0.483
7.631LeuSer: 7.631 ± 0.618
3.984LeuThr: 3.984 ± 0.444
4.657LeuVal: 4.657 ± 0.442
1.01LeuTrp: 1.01 ± 0.22
2.525LeuTyr: 2.525 ± 0.442
0.0LeuXaa: 0.0 ± 0.0
Met
1.739MetAla: 1.739 ± 0.315
0.056MetCys: 0.056 ± 0.051
0.729MetAsp: 0.729 ± 0.185
1.29MetGlu: 1.29 ± 0.236
0.673MetPhe: 0.673 ± 0.187
1.066MetGly: 1.066 ± 0.322
0.449MetHis: 0.449 ± 0.162
1.01MetIle: 1.01 ± 0.224
2.132MetLys: 2.132 ± 0.354
1.066MetLeu: 1.066 ± 0.221
0.561MetMet: 0.561 ± 0.172
1.852MetAsn: 1.852 ± 0.304
0.842MetPro: 0.842 ± 0.243
1.403MetGln: 1.403 ± 0.243
1.066MetArg: 1.066 ± 0.236
1.795MetSer: 1.795 ± 0.314
1.122MetThr: 1.122 ± 0.274
0.561MetVal: 0.561 ± 0.157
0.112MetTrp: 0.112 ± 0.086
0.561MetTyr: 0.561 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
3.423AsnAla: 3.423 ± 0.595
0.561AsnCys: 0.561 ± 0.162
3.198AsnAsp: 3.198 ± 0.517
4.825AsnGlu: 4.825 ± 0.495
3.198AsnPhe: 3.198 ± 0.483
3.984AsnGly: 3.984 ± 0.501
0.842AsnHis: 0.842 ± 0.187
4.376AsnIle: 4.376 ± 0.459
5.779AsnLys: 5.779 ± 0.674
6.508AsnLeu: 6.508 ± 0.673
1.01AsnMet: 1.01 ± 0.252
3.928AsnAsn: 3.928 ± 0.531
2.02AsnPro: 2.02 ± 0.303
2.693AsnGln: 2.693 ± 0.449
2.469AsnArg: 2.469 ± 0.36
5.274AsnSer: 5.274 ± 0.601
2.749AsnThr: 2.749 ± 0.432
4.208AsnVal: 4.208 ± 0.509
0.617AsnTrp: 0.617 ± 0.153
2.469AsnTyr: 2.469 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
1.964ProAla: 1.964 ± 0.348
0.168ProCys: 0.168 ± 0.113
2.076ProAsp: 2.076 ± 0.394
2.3ProGlu: 2.3 ± 0.39
1.795ProPhe: 1.795 ± 0.306
1.066ProGly: 1.066 ± 0.299
0.673ProHis: 0.673 ± 0.192
1.347ProIle: 1.347 ± 0.286
1.908ProLys: 1.908 ± 0.294
2.581ProLeu: 2.581 ± 0.354
0.393ProMet: 0.393 ± 0.146
1.739ProAsn: 1.739 ± 0.325
0.898ProPro: 0.898 ± 0.256
1.01ProGln: 1.01 ± 0.209
0.729ProArg: 0.729 ± 0.214
2.188ProSer: 2.188 ± 0.341
1.739ProThr: 1.739 ± 0.366
2.188ProVal: 2.188 ± 0.437
0.112ProTrp: 0.112 ± 0.076
1.122ProTyr: 1.122 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
2.581GlnAla: 2.581 ± 0.352
0.224GlnCys: 0.224 ± 0.118
1.627GlnAsp: 1.627 ± 0.27
2.188GlnGlu: 2.188 ± 0.357
1.739GlnPhe: 1.739 ± 0.271
1.627GlnGly: 1.627 ± 0.309
0.673GlnHis: 0.673 ± 0.264
2.469GlnIle: 2.469 ± 0.451
2.637GlnLys: 2.637 ± 0.489
2.974GlnLeu: 2.974 ± 0.389
0.673GlnMet: 0.673 ± 0.204
2.188GlnAsn: 2.188 ± 0.279
0.842GlnPro: 0.842 ± 0.202
1.178GlnGln: 1.178 ± 0.254
1.627GlnArg: 1.627 ± 0.46
2.132GlnSer: 2.132 ± 0.306
1.908GlnThr: 1.908 ± 0.279
2.02GlnVal: 2.02 ± 0.389
0.224GlnTrp: 0.224 ± 0.119
1.29GlnTyr: 1.29 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
3.198ArgAla: 3.198 ± 0.588
0.337ArgCys: 0.337 ± 0.141
2.188ArgAsp: 2.188 ± 0.366
2.076ArgGlu: 2.076 ± 0.613
1.683ArgPhe: 1.683 ± 0.336
2.076ArgGly: 2.076 ± 0.508
0.729ArgHis: 0.729 ± 0.199
2.918ArgIle: 2.918 ± 0.42
3.31ArgLys: 3.31 ± 0.384
3.759ArgLeu: 3.759 ± 0.46
0.954ArgMet: 0.954 ± 0.244
2.637ArgAsn: 2.637 ± 0.438
0.954ArgPro: 0.954 ± 0.259
1.739ArgGln: 1.739 ± 0.413
2.076ArgArg: 2.076 ± 0.383
2.581ArgSer: 2.581 ± 0.435
2.02ArgThr: 2.02 ± 0.317
2.413ArgVal: 2.413 ± 0.463
0.617ArgTrp: 0.617 ± 0.238
1.178ArgTyr: 1.178 ± 0.271
0.0ArgXaa: 0.0 ± 0.0
Ser
4.713SerAla: 4.713 ± 0.609
0.449SerCys: 0.449 ± 0.187
4.657SerAsp: 4.657 ± 0.498
4.713SerGlu: 4.713 ± 0.45
5.106SerPhe: 5.106 ± 0.488
5.499SerGly: 5.499 ± 0.712
1.234SerHis: 1.234 ± 0.277
5.162SerIle: 5.162 ± 0.494
6.284SerLys: 6.284 ± 0.833
6.06SerLeu: 6.06 ± 0.616
1.122SerMet: 1.122 ± 0.236
5.555SerAsn: 5.555 ± 0.577
2.076SerPro: 2.076 ± 0.309
2.525SerGln: 2.525 ± 0.307
2.974SerArg: 2.974 ± 0.402
6.396SerSer: 6.396 ± 0.739
4.825SerThr: 4.825 ± 0.569
5.05SerVal: 5.05 ± 0.572
0.729SerTrp: 0.729 ± 0.235
2.749SerTyr: 2.749 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
3.647ThrAla: 3.647 ± 0.517
0.449ThrCys: 0.449 ± 0.208
3.366ThrAsp: 3.366 ± 0.455
3.984ThrGlu: 3.984 ± 0.501
2.693ThrPhe: 2.693 ± 0.358
4.432ThrGly: 4.432 ± 0.565
0.729ThrHis: 0.729 ± 0.236
5.106ThrIle: 5.106 ± 0.498
4.545ThrLys: 4.545 ± 0.48
5.05ThrLeu: 5.05 ± 0.524
0.898ThrMet: 0.898 ± 0.259
2.918ThrAsn: 2.918 ± 0.406
2.525ThrPro: 2.525 ± 0.301
1.795ThrGln: 1.795 ± 0.334
2.413ThrArg: 2.413 ± 0.386
4.264ThrSer: 4.264 ± 0.434
3.086ThrThr: 3.086 ± 0.486
3.871ThrVal: 3.871 ± 0.395
0.449ThrTrp: 0.449 ± 0.144
2.244ThrTyr: 2.244 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
5.106ValAla: 5.106 ± 0.989
0.449ValCys: 0.449 ± 0.156
3.871ValAsp: 3.871 ± 0.487
4.432ValGlu: 4.432 ± 0.588
3.366ValPhe: 3.366 ± 0.348
3.198ValGly: 3.198 ± 0.481
0.898ValHis: 0.898 ± 0.232
4.264ValIle: 4.264 ± 0.526
5.33ValLys: 5.33 ± 0.466
5.442ValLeu: 5.442 ± 0.677
1.122ValMet: 1.122 ± 0.232
4.937ValAsn: 4.937 ± 0.51
1.459ValPro: 1.459 ± 0.245
1.683ValGln: 1.683 ± 0.295
2.244ValArg: 2.244 ± 0.322
5.499ValSer: 5.499 ± 0.63
3.759ValThr: 3.759 ± 0.455
3.928ValVal: 3.928 ± 0.525
0.673ValTrp: 0.673 ± 0.13
1.852ValTyr: 1.852 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.786TrpAla: 0.786 ± 0.201
0.112TrpCys: 0.112 ± 0.083
0.729TrpAsp: 0.729 ± 0.208
0.673TrpGlu: 0.673 ± 0.177
0.673TrpPhe: 0.673 ± 0.226
0.673TrpGly: 0.673 ± 0.194
0.168TrpHis: 0.168 ± 0.08
0.561TrpIle: 0.561 ± 0.168
0.505TrpLys: 0.505 ± 0.159
0.898TrpLeu: 0.898 ± 0.201
0.337TrpMet: 0.337 ± 0.127
0.561TrpAsn: 0.561 ± 0.218
0.281TrpPro: 0.281 ± 0.122
0.673TrpGln: 0.673 ± 0.231
0.505TrpArg: 0.505 ± 0.162
0.842TrpSer: 0.842 ± 0.233
0.617TrpThr: 0.617 ± 0.198
1.178TrpVal: 1.178 ± 0.249
0.168TrpTrp: 0.168 ± 0.1
0.393TrpTyr: 0.393 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.403TyrAla: 1.403 ± 0.333
0.168TyrCys: 0.168 ± 0.105
1.627TyrAsp: 1.627 ± 0.274
2.244TyrGlu: 2.244 ± 0.286
1.459TyrPhe: 1.459 ± 0.295
2.244TyrGly: 2.244 ± 0.34
0.393TyrHis: 0.393 ± 0.143
2.132TyrIle: 2.132 ± 0.411
3.254TyrLys: 3.254 ± 0.454
3.366TyrLeu: 3.366 ± 0.434
0.729TyrMet: 0.729 ± 0.204
1.964TyrAsn: 1.964 ± 0.283
1.459TyrPro: 1.459 ± 0.254
1.29TyrGln: 1.29 ± 0.261
1.29TyrArg: 1.29 ± 0.257
2.413TyrSer: 2.413 ± 0.347
1.852TyrThr: 1.852 ± 0.344
2.188TyrVal: 2.188 ± 0.315
0.393TyrTrp: 0.393 ± 0.139
1.403TyrTyr: 1.403 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (17824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski