Amino acid dipepetide frequency for Faecalibacterium phage FP_Brigit

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.365AlaAla: 10.365 ± 0.975
1.21AlaCys: 1.21 ± 0.277
6.472AlaAsp: 6.472 ± 0.688
7.366AlaGlu: 7.366 ± 0.654
3.262AlaPhe: 3.262 ± 0.436
6.524AlaGly: 6.524 ± 0.764
1.421AlaHis: 1.421 ± 0.257
4.683AlaIle: 4.683 ± 0.687
5.525AlaLys: 5.525 ± 0.566
7.682AlaLeu: 7.682 ± 0.675
2.473AlaMet: 2.473 ± 0.33
2.683AlaAsn: 2.683 ± 0.363
2.841AlaPro: 2.841 ± 0.391
3.63AlaGln: 3.63 ± 0.553
4.42AlaArg: 4.42 ± 0.461
5.156AlaSer: 5.156 ± 0.59
5.577AlaThr: 5.577 ± 0.78
7.103AlaVal: 7.103 ± 0.783
1.315AlaTrp: 1.315 ± 0.312
2.578AlaTyr: 2.578 ± 0.538
0.0AlaXaa: 0.0 ± 0.0
Cys
0.789CysAla: 0.789 ± 0.204
0.474CysCys: 0.474 ± 0.13
0.894CysAsp: 0.894 ± 0.213
1.368CysGlu: 1.368 ± 0.315
0.631CysPhe: 0.631 ± 0.216
1.421CysGly: 1.421 ± 0.284
0.316CysHis: 0.316 ± 0.125
1.052CysIle: 1.052 ± 0.234
0.526CysLys: 0.526 ± 0.162
0.737CysLeu: 0.737 ± 0.208
0.421CysMet: 0.421 ± 0.134
0.474CysAsn: 0.474 ± 0.167
0.631CysPro: 0.631 ± 0.174
0.368CysGln: 0.368 ± 0.116
0.789CysArg: 0.789 ± 0.202
1.105CysSer: 1.105 ± 0.248
1.368CysThr: 1.368 ± 0.252
1.052CysVal: 1.052 ± 0.226
0.21CysTrp: 0.21 ± 0.094
1.0CysTyr: 1.0 ± 0.259
0.0CysXaa: 0.0 ± 0.0
Asp
5.735AspAla: 5.735 ± 0.579
1.315AspCys: 1.315 ± 0.296
4.946AspAsp: 4.946 ± 0.564
5.156AspGlu: 5.156 ± 0.509
2.42AspPhe: 2.42 ± 0.382
7.05AspGly: 7.05 ± 0.561
0.842AspHis: 0.842 ± 0.209
4.262AspIle: 4.262 ± 0.456
4.314AspLys: 4.314 ± 0.512
5.261AspLeu: 5.261 ± 0.527
2.105AspMet: 2.105 ± 0.334
2.683AspAsn: 2.683 ± 0.452
2.473AspPro: 2.473 ± 0.363
1.631AspGln: 1.631 ± 0.288
3.104AspArg: 3.104 ± 0.443
3.473AspSer: 3.473 ± 0.45
3.63AspThr: 3.63 ± 0.543
4.683AspVal: 4.683 ± 0.481
1.105AspTrp: 1.105 ± 0.223
2.526AspTyr: 2.526 ± 0.347
0.0AspXaa: 0.0 ± 0.0
Glu
6.314GluAla: 6.314 ± 0.627
1.0GluCys: 1.0 ± 0.253
4.314GluAsp: 4.314 ± 0.45
5.525GluGlu: 5.525 ± 0.754
2.21GluPhe: 2.21 ± 0.359
3.946GluGly: 3.946 ± 0.564
1.21GluHis: 1.21 ± 0.27
3.736GluIle: 3.736 ± 0.441
4.841GluLys: 4.841 ± 0.483
6.209GluLeu: 6.209 ± 0.732
2.946GluMet: 2.946 ± 0.444
2.946GluAsn: 2.946 ± 0.446
2.262GluPro: 2.262 ± 0.375
3.473GluGln: 3.473 ± 0.44
4.367GluArg: 4.367 ± 0.61
4.42GluSer: 4.42 ± 0.456
3.525GluThr: 3.525 ± 0.416
4.788GluVal: 4.788 ± 0.583
1.21GluTrp: 1.21 ± 0.209
2.683GluTyr: 2.683 ± 0.4
0.0GluXaa: 0.0 ± 0.0
Phe
2.631PheAla: 2.631 ± 0.327
0.474PheCys: 0.474 ± 0.135
2.578PheAsp: 2.578 ± 0.448
1.947PheGlu: 1.947 ± 0.278
1.052PhePhe: 1.052 ± 0.227
3.315PheGly: 3.315 ± 0.466
0.421PheHis: 0.421 ± 0.158
1.263PheIle: 1.263 ± 0.233
1.736PheLys: 1.736 ± 0.238
1.894PheLeu: 1.894 ± 0.33
0.631PheMet: 0.631 ± 0.183
1.315PheAsn: 1.315 ± 0.237
1.158PhePro: 1.158 ± 0.296
0.842PheGln: 0.842 ± 0.21
2.157PheArg: 2.157 ± 0.345
2.157PheSer: 2.157 ± 0.33
2.21PheThr: 2.21 ± 0.387
2.21PheVal: 2.21 ± 0.319
0.526PheTrp: 0.526 ± 0.166
1.421PheTyr: 1.421 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
4.735GlyAla: 4.735 ± 0.708
1.315GlyCys: 1.315 ± 0.273
4.893GlyAsp: 4.893 ± 0.526
4.63GlyGlu: 4.63 ± 0.453
2.631GlyPhe: 2.631 ± 0.371
4.788GlyGly: 4.788 ± 0.658
1.315GlyHis: 1.315 ± 0.29
4.42GlyIle: 4.42 ± 0.658
6.209GlyLys: 6.209 ± 0.524
4.525GlyLeu: 4.525 ± 0.48
2.473GlyMet: 2.473 ± 0.29
2.578GlyAsn: 2.578 ± 0.408
1.21GlyPro: 1.21 ± 0.234
1.631GlyGln: 1.631 ± 0.291
3.367GlyArg: 3.367 ± 0.479
4.841GlySer: 4.841 ± 0.782
4.104GlyThr: 4.104 ± 0.528
5.419GlyVal: 5.419 ± 0.735
1.421GlyTrp: 1.421 ± 0.285
3.946GlyTyr: 3.946 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.105HisAla: 1.105 ± 0.288
0.526HisCys: 0.526 ± 0.172
1.105HisAsp: 1.105 ± 0.276
0.789HisGlu: 0.789 ± 0.2
0.526HisPhe: 0.526 ± 0.215
1.052HisGly: 1.052 ± 0.194
0.421HisHis: 0.421 ± 0.133
0.579HisIle: 0.579 ± 0.198
0.737HisLys: 0.737 ± 0.185
1.263HisLeu: 1.263 ± 0.254
0.474HisMet: 0.474 ± 0.139
0.631HisAsn: 0.631 ± 0.183
0.421HisPro: 0.421 ± 0.141
0.368HisGln: 0.368 ± 0.132
0.579HisArg: 0.579 ± 0.174
0.947HisSer: 0.947 ± 0.212
0.894HisThr: 0.894 ± 0.215
0.947HisVal: 0.947 ± 0.222
0.21HisTrp: 0.21 ± 0.099
0.474HisTyr: 0.474 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
5.367IleAla: 5.367 ± 0.54
0.947IleCys: 0.947 ± 0.224
3.946IleAsp: 3.946 ± 0.426
3.736IleGlu: 3.736 ± 0.421
1.736IlePhe: 1.736 ± 0.236
3.052IleGly: 3.052 ± 0.613
0.631IleHis: 0.631 ± 0.184
3.104IleIle: 3.104 ± 0.37
3.473IleLys: 3.473 ± 0.576
3.736IleLeu: 3.736 ± 0.513
1.0IleMet: 1.0 ± 0.27
1.684IleAsn: 1.684 ± 0.301
2.631IlePro: 2.631 ± 0.316
1.684IleGln: 1.684 ± 0.311
3.104IleArg: 3.104 ± 0.361
3.683IleSer: 3.683 ± 0.521
3.999IleThr: 3.999 ± 0.458
3.894IleVal: 3.894 ± 0.441
0.263IleTrp: 0.263 ± 0.113
1.473IleTyr: 1.473 ± 0.276
0.0IleXaa: 0.0 ± 0.0
Lys
6.84LysAla: 6.84 ± 0.739
0.737LysCys: 0.737 ± 0.187
4.367LysAsp: 4.367 ± 0.438
4.578LysGlu: 4.578 ± 0.636
1.158LysPhe: 1.158 ± 0.278
3.894LysGly: 3.894 ± 0.454
0.631LysHis: 0.631 ± 0.206
3.315LysIle: 3.315 ± 0.469
4.051LysLys: 4.051 ± 0.495
5.788LysLeu: 5.788 ± 0.54
1.842LysMet: 1.842 ± 0.329
3.578LysAsn: 3.578 ± 0.503
3.42LysPro: 3.42 ± 0.487
2.526LysGln: 2.526 ± 0.305
2.894LysArg: 2.894 ± 0.429
3.999LysSer: 3.999 ± 0.522
3.894LysThr: 3.894 ± 0.396
4.946LysVal: 4.946 ± 0.612
0.842LysTrp: 0.842 ± 0.201
1.789LysTyr: 1.789 ± 0.305
0.0LysXaa: 0.0 ± 0.0
Leu
6.103LeuAla: 6.103 ± 0.66
1.158LeuCys: 1.158 ± 0.241
5.998LeuAsp: 5.998 ± 0.54
5.577LeuGlu: 5.577 ± 0.57
2.157LeuPhe: 2.157 ± 0.304
4.262LeuGly: 4.262 ± 0.524
1.21LeuHis: 1.21 ± 0.242
3.841LeuIle: 3.841 ± 0.476
5.84LeuLys: 5.84 ± 0.502
6.261LeuLeu: 6.261 ± 0.71
2.894LeuMet: 2.894 ± 0.406
3.736LeuAsn: 3.736 ± 0.376
2.999LeuPro: 2.999 ± 0.388
2.105LeuGln: 2.105 ± 0.392
4.42LeuArg: 4.42 ± 0.466
4.788LeuSer: 4.788 ± 0.531
5.261LeuThr: 5.261 ± 0.505
4.998LeuVal: 4.998 ± 0.572
0.789LeuTrp: 0.789 ± 0.171
2.894LeuTyr: 2.894 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
3.473MetAla: 3.473 ± 0.367
0.263MetCys: 0.263 ± 0.113
1.684MetAsp: 1.684 ± 0.239
1.368MetGlu: 1.368 ± 0.254
0.789MetPhe: 0.789 ± 0.194
1.789MetGly: 1.789 ± 0.246
0.21MetHis: 0.21 ± 0.1
1.0MetIle: 1.0 ± 0.24
1.894MetLys: 1.894 ± 0.339
2.368MetLeu: 2.368 ± 0.311
0.631MetMet: 0.631 ± 0.172
1.789MetAsn: 1.789 ± 0.318
1.368MetPro: 1.368 ± 0.226
1.0MetGln: 1.0 ± 0.242
1.473MetArg: 1.473 ± 0.295
2.157MetSer: 2.157 ± 0.357
2.368MetThr: 2.368 ± 0.418
1.631MetVal: 1.631 ± 0.32
0.474MetTrp: 0.474 ± 0.182
0.842MetTyr: 0.842 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.683AsnAla: 4.683 ± 0.494
0.631AsnCys: 0.631 ± 0.18
2.683AsnAsp: 2.683 ± 0.395
2.526AsnGlu: 2.526 ± 0.427
0.894AsnPhe: 0.894 ± 0.205
4.367AsnGly: 4.367 ± 0.57
0.684AsnHis: 0.684 ± 0.178
2.157AsnIle: 2.157 ± 0.305
1.999AsnLys: 1.999 ± 0.352
2.946AsnLeu: 2.946 ± 0.36
1.052AsnMet: 1.052 ± 0.228
1.421AsnAsn: 1.421 ± 0.362
1.894AsnPro: 1.894 ± 0.329
1.421AsnGln: 1.421 ± 0.246
1.789AsnArg: 1.789 ± 0.318
2.052AsnSer: 2.052 ± 0.455
2.262AsnThr: 2.262 ± 0.396
2.262AsnVal: 2.262 ± 0.358
0.368AsnTrp: 0.368 ± 0.124
1.315AsnTyr: 1.315 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
4.735ProAla: 4.735 ± 0.531
0.316ProCys: 0.316 ± 0.145
2.999ProAsp: 2.999 ± 0.362
3.578ProGlu: 3.578 ± 0.535
1.368ProPhe: 1.368 ± 0.234
2.368ProGly: 2.368 ± 0.339
0.474ProHis: 0.474 ± 0.147
1.263ProIle: 1.263 ± 0.275
1.842ProLys: 1.842 ± 0.34
2.736ProLeu: 2.736 ± 0.425
0.842ProMet: 0.842 ± 0.225
1.368ProAsn: 1.368 ± 0.315
1.736ProPro: 1.736 ± 0.352
1.368ProGln: 1.368 ± 0.341
1.21ProArg: 1.21 ± 0.259
2.315ProSer: 2.315 ± 0.455
2.262ProThr: 2.262 ± 0.32
3.262ProVal: 3.262 ± 0.348
0.526ProTrp: 0.526 ± 0.161
1.21ProTyr: 1.21 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
3.157GlnAla: 3.157 ± 0.514
0.474GlnCys: 0.474 ± 0.186
1.473GlnAsp: 1.473 ± 0.359
2.368GlnGlu: 2.368 ± 0.391
1.473GlnPhe: 1.473 ± 0.27
1.631GlnGly: 1.631 ± 0.275
0.526GlnHis: 0.526 ± 0.186
1.947GlnIle: 1.947 ± 0.333
2.999GlnLys: 2.999 ± 0.414
2.789GlnLeu: 2.789 ± 0.412
1.315GlnMet: 1.315 ± 0.217
1.842GlnAsn: 1.842 ± 0.304
1.473GlnPro: 1.473 ± 0.262
1.842GlnGln: 1.842 ± 0.294
1.947GlnArg: 1.947 ± 0.374
1.789GlnSer: 1.789 ± 0.299
2.105GlnThr: 2.105 ± 0.391
2.368GlnVal: 2.368 ± 0.326
0.368GlnTrp: 0.368 ± 0.129
0.947GlnTyr: 0.947 ± 0.222
0.0GlnXaa: 0.0 ± 0.0
Arg
3.841ArgAla: 3.841 ± 0.467
0.947ArgCys: 0.947 ± 0.232
2.946ArgAsp: 2.946 ± 0.39
4.367ArgGlu: 4.367 ± 0.446
1.842ArgPhe: 1.842 ± 0.344
3.104ArgGly: 3.104 ± 0.415
0.737ArgHis: 0.737 ± 0.236
2.578ArgIle: 2.578 ± 0.399
3.788ArgLys: 3.788 ± 0.481
4.998ArgLeu: 4.998 ± 0.479
1.368ArgMet: 1.368 ± 0.254
1.736ArgAsn: 1.736 ± 0.278
2.262ArgPro: 2.262 ± 0.367
2.21ArgGln: 2.21 ± 0.379
3.63ArgArg: 3.63 ± 0.533
2.999ArgSer: 2.999 ± 0.346
2.841ArgThr: 2.841 ± 0.423
2.736ArgVal: 2.736 ± 0.355
0.737ArgTrp: 0.737 ± 0.216
2.105ArgTyr: 2.105 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
5.419SerAla: 5.419 ± 0.55
0.631SerCys: 0.631 ± 0.217
3.946SerAsp: 3.946 ± 0.443
4.104SerGlu: 4.104 ± 0.541
2.157SerPhe: 2.157 ± 0.369
5.209SerGly: 5.209 ± 0.67
0.631SerHis: 0.631 ± 0.19
4.209SerIle: 4.209 ± 0.53
3.788SerLys: 3.788 ± 0.422
4.893SerLeu: 4.893 ± 0.478
1.631SerMet: 1.631 ± 0.254
2.315SerAsn: 2.315 ± 0.398
1.894SerPro: 1.894 ± 0.345
1.578SerGln: 1.578 ± 0.236
2.736SerArg: 2.736 ± 0.41
4.314SerSer: 4.314 ± 1.022
4.367SerThr: 4.367 ± 0.646
5.104SerVal: 5.104 ± 0.566
0.631SerTrp: 0.631 ± 0.221
2.683SerTyr: 2.683 ± 0.383
0.0SerXaa: 0.0 ± 0.0
Thr
6.629ThrAla: 6.629 ± 0.662
0.789ThrCys: 0.789 ± 0.192
3.841ThrAsp: 3.841 ± 0.398
3.788ThrGlu: 3.788 ± 0.483
1.684ThrPhe: 1.684 ± 0.268
4.998ThrGly: 4.998 ± 0.512
0.737ThrHis: 0.737 ± 0.198
3.157ThrIle: 3.157 ± 0.352
3.788ThrLys: 3.788 ± 0.474
4.209ThrLeu: 4.209 ± 0.575
1.789ThrMet: 1.789 ± 0.293
2.526ThrAsn: 2.526 ± 0.488
2.631ThrPro: 2.631 ± 0.423
2.157ThrGln: 2.157 ± 0.353
2.315ThrArg: 2.315 ± 0.34
4.209ThrSer: 4.209 ± 0.46
4.42ThrThr: 4.42 ± 0.533
5.682ThrVal: 5.682 ± 0.661
0.579ThrTrp: 0.579 ± 0.193
2.789ThrTyr: 2.789 ± 0.299
0.0ThrXaa: 0.0 ± 0.0
Val
6.314ValAla: 6.314 ± 0.659
1.052ValCys: 1.052 ± 0.299
5.577ValAsp: 5.577 ± 0.527
5.314ValGlu: 5.314 ± 0.487
2.368ValPhe: 2.368 ± 0.321
4.104ValGly: 4.104 ± 0.552
0.842ValHis: 0.842 ± 0.184
4.051ValIle: 4.051 ± 0.617
5.314ValLys: 5.314 ± 0.513
4.841ValLeu: 4.841 ± 0.446
1.315ValMet: 1.315 ± 0.27
1.947ValAsn: 1.947 ± 0.31
2.736ValPro: 2.736 ± 0.365
2.999ValGln: 2.999 ± 0.356
4.209ValArg: 4.209 ± 0.423
5.051ValSer: 5.051 ± 0.544
5.209ValThr: 5.209 ± 0.836
5.104ValVal: 5.104 ± 0.669
1.158ValTrp: 1.158 ± 0.215
2.262ValTyr: 2.262 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
0.842TrpAla: 0.842 ± 0.199
0.474TrpCys: 0.474 ± 0.175
1.263TrpAsp: 1.263 ± 0.248
0.894TrpGlu: 0.894 ± 0.188
0.368TrpPhe: 0.368 ± 0.131
0.368TrpGly: 0.368 ± 0.135
0.316TrpHis: 0.316 ± 0.116
0.421TrpIle: 0.421 ± 0.154
0.894TrpLys: 0.894 ± 0.178
1.368TrpLeu: 1.368 ± 0.267
0.579TrpMet: 0.579 ± 0.186
0.631TrpAsn: 0.631 ± 0.169
0.474TrpPro: 0.474 ± 0.158
0.579TrpGln: 0.579 ± 0.22
0.737TrpArg: 0.737 ± 0.234
1.0TrpSer: 1.0 ± 0.239
0.737TrpThr: 0.737 ± 0.201
0.789TrpVal: 0.789 ± 0.16
0.316TrpTrp: 0.316 ± 0.117
0.474TrpTyr: 0.474 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.21TyrAla: 3.21 ± 0.427
0.894TyrCys: 0.894 ± 0.179
2.789TyrAsp: 2.789 ± 0.385
2.946TyrGlu: 2.946 ± 0.384
1.21TyrPhe: 1.21 ± 0.294
3.21TyrGly: 3.21 ± 0.452
0.474TyrHis: 0.474 ± 0.146
2.105TyrIle: 2.105 ± 0.284
1.578TyrLys: 1.578 ± 0.297
2.789TyrLeu: 2.789 ± 0.372
0.737TyrMet: 0.737 ± 0.187
1.526TyrAsn: 1.526 ± 0.245
1.263TyrPro: 1.263 ± 0.244
1.421TyrGln: 1.421 ± 0.238
2.526TyrArg: 2.526 ± 0.402
1.789TyrSer: 1.789 ± 0.309
1.684TyrThr: 1.684 ± 0.329
2.789TyrVal: 2.789 ± 0.413
0.474TyrTrp: 0.474 ± 0.144
1.473TyrTyr: 1.473 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (19007 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski