Amino acid dipepetide frequency for Aeromonas phage CF7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.634AlaAla: 13.634 ± 1.579
1.136AlaCys: 1.136 ± 0.278
6.135AlaAsp: 6.135 ± 0.598
7.953AlaGlu: 7.953 ± 0.819
3.03AlaPhe: 3.03 ± 0.446
9.09AlaGly: 9.09 ± 1.04
1.894AlaHis: 1.894 ± 0.349
4.393AlaIle: 4.393 ± 0.684
5.908AlaLys: 5.908 ± 0.914
9.923AlaLeu: 9.923 ± 1.039
2.651AlaMet: 2.651 ± 0.445
4.469AlaAsn: 4.469 ± 0.498
3.181AlaPro: 3.181 ± 0.447
4.999AlaGln: 4.999 ± 0.95
6.363AlaArg: 6.363 ± 0.814
4.772AlaSer: 4.772 ± 0.431
5.757AlaThr: 5.757 ± 0.792
7.272AlaVal: 7.272 ± 0.932
0.985AlaTrp: 0.985 ± 0.29
4.166AlaTyr: 4.166 ± 0.707
0.0AlaXaa: 0.0 ± 0.0
Cys
1.212CysAla: 1.212 ± 0.303
0.227CysCys: 0.227 ± 0.137
0.606CysAsp: 0.606 ± 0.242
0.53CysGlu: 0.53 ± 0.192
0.379CysPhe: 0.379 ± 0.155
0.985CysGly: 0.985 ± 0.339
0.454CysHis: 0.454 ± 0.169
0.682CysIle: 0.682 ± 0.244
0.454CysLys: 0.454 ± 0.234
0.909CysLeu: 0.909 ± 0.311
0.53CysMet: 0.53 ± 0.196
0.303CysAsn: 0.303 ± 0.135
0.53CysPro: 0.53 ± 0.178
0.379CysGln: 0.379 ± 0.162
0.757CysArg: 0.757 ± 0.264
0.53CysSer: 0.53 ± 0.218
1.288CysThr: 1.288 ± 0.371
0.985CysVal: 0.985 ± 0.252
0.151CysTrp: 0.151 ± 0.112
0.379CysTyr: 0.379 ± 0.23
0.0CysXaa: 0.0 ± 0.0
Asp
5.605AspAla: 5.605 ± 0.486
0.833AspCys: 0.833 ± 0.253
3.333AspAsp: 3.333 ± 0.607
4.09AspGlu: 4.09 ± 0.732
2.651AspPhe: 2.651 ± 0.466
4.015AspGly: 4.015 ± 0.599
0.909AspHis: 0.909 ± 0.294
3.181AspIle: 3.181 ± 0.475
3.636AspLys: 3.636 ± 0.403
4.923AspLeu: 4.923 ± 0.774
2.045AspMet: 2.045 ± 0.418
2.272AspAsn: 2.272 ± 0.333
2.424AspPro: 2.424 ± 0.4
2.424AspGln: 2.424 ± 0.369
2.954AspArg: 2.954 ± 0.421
4.09AspSer: 4.09 ± 0.442
2.878AspThr: 2.878 ± 0.607
3.333AspVal: 3.333 ± 0.474
1.06AspTrp: 1.06 ± 0.331
2.575AspTyr: 2.575 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
6.893GluAla: 6.893 ± 0.717
0.757GluCys: 0.757 ± 0.249
3.636GluAsp: 3.636 ± 0.749
3.181GluGlu: 3.181 ± 0.576
1.969GluPhe: 1.969 ± 0.408
4.999GluGly: 4.999 ± 0.708
1.439GluHis: 1.439 ± 0.331
1.288GluIle: 1.288 ± 0.333
3.257GluLys: 3.257 ± 0.585
6.514GluLeu: 6.514 ± 0.863
1.288GluMet: 1.288 ± 0.344
1.363GluAsn: 1.363 ± 0.254
1.818GluPro: 1.818 ± 0.426
4.09GluGln: 4.09 ± 0.686
3.939GluArg: 3.939 ± 0.557
3.181GluSer: 3.181 ± 0.423
3.409GluThr: 3.409 ± 0.457
4.242GluVal: 4.242 ± 0.433
1.212GluTrp: 1.212 ± 0.254
2.045GluTyr: 2.045 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
2.424PheAla: 2.424 ± 0.347
0.53PheCys: 0.53 ± 0.251
3.03PheAsp: 3.03 ± 0.389
1.818PheGlu: 1.818 ± 0.349
1.06PhePhe: 1.06 ± 0.278
3.03PheGly: 3.03 ± 0.573
0.379PheHis: 0.379 ± 0.132
1.818PheIle: 1.818 ± 0.368
2.575PheLys: 2.575 ± 0.489
2.348PheLeu: 2.348 ± 0.371
1.212PheMet: 1.212 ± 0.258
1.439PheAsn: 1.439 ± 0.29
0.682PhePro: 0.682 ± 0.211
1.515PheGln: 1.515 ± 0.327
1.818PheArg: 1.818 ± 0.273
2.121PheSer: 2.121 ± 0.491
1.439PheThr: 1.439 ± 0.322
2.424PheVal: 2.424 ± 0.35
0.303PheTrp: 0.303 ± 0.116
0.985PheTyr: 0.985 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
7.802GlyAla: 7.802 ± 1.055
1.288GlyCys: 1.288 ± 0.425
6.135GlyAsp: 6.135 ± 0.568
3.409GlyGlu: 3.409 ± 0.444
2.954GlyPhe: 2.954 ± 0.427
7.044GlyGly: 7.044 ± 1.124
1.742GlyHis: 1.742 ± 0.42
3.257GlyIle: 3.257 ± 0.416
4.696GlyLys: 4.696 ± 0.558
7.347GlyLeu: 7.347 ± 0.826
2.197GlyMet: 2.197 ± 0.518
2.348GlyAsn: 2.348 ± 0.48
2.197GlyPro: 2.197 ± 0.294
2.651GlyGln: 2.651 ± 0.438
4.848GlyArg: 4.848 ± 0.495
4.772GlySer: 4.772 ± 0.67
5.302GlyThr: 5.302 ± 0.598
5.681GlyVal: 5.681 ± 0.836
1.288GlyTrp: 1.288 ± 0.251
3.03GlyTyr: 3.03 ± 0.489
0.0GlyXaa: 0.0 ± 0.0
His
1.591HisAla: 1.591 ± 0.356
0.303HisCys: 0.303 ± 0.183
1.363HisAsp: 1.363 ± 0.315
1.363HisGlu: 1.363 ± 0.349
0.53HisPhe: 0.53 ± 0.179
1.894HisGly: 1.894 ± 0.443
0.454HisHis: 0.454 ± 0.16
0.682HisIle: 0.682 ± 0.257
0.606HisLys: 0.606 ± 0.235
1.136HisLeu: 1.136 ± 0.272
0.682HisMet: 0.682 ± 0.27
0.985HisAsn: 0.985 ± 0.324
1.288HisPro: 1.288 ± 0.416
0.606HisGln: 0.606 ± 0.223
1.969HisArg: 1.969 ± 0.46
1.515HisSer: 1.515 ± 0.471
0.833HisThr: 0.833 ± 0.185
1.212HisVal: 1.212 ± 0.31
0.227HisTrp: 0.227 ± 0.133
0.985HisTyr: 0.985 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
4.318IleAla: 4.318 ± 0.495
0.454IleCys: 0.454 ± 0.176
2.424IleAsp: 2.424 ± 0.374
2.651IleGlu: 2.651 ± 0.313
0.909IlePhe: 0.909 ± 0.292
3.333IleGly: 3.333 ± 0.519
1.288IleHis: 1.288 ± 0.326
2.045IleIle: 2.045 ± 0.432
2.5IleLys: 2.5 ± 0.56
3.333IleLeu: 3.333 ± 0.5
1.363IleMet: 1.363 ± 0.305
1.742IleAsn: 1.742 ± 0.441
1.742IlePro: 1.742 ± 0.305
1.894IleGln: 1.894 ± 0.348
2.272IleArg: 2.272 ± 0.409
2.424IleSer: 2.424 ± 0.394
2.727IleThr: 2.727 ± 0.454
3.106IleVal: 3.106 ± 0.52
0.303IleTrp: 0.303 ± 0.142
1.136IleTyr: 1.136 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
7.347LysAla: 7.347 ± 0.85
0.303LysCys: 0.303 ± 0.168
3.484LysAsp: 3.484 ± 0.626
4.166LysGlu: 4.166 ± 0.527
1.591LysPhe: 1.591 ± 0.313
3.787LysGly: 3.787 ± 0.518
1.363LysHis: 1.363 ± 0.342
1.515LysIle: 1.515 ± 0.381
2.878LysLys: 2.878 ± 0.61
6.06LysLeu: 6.06 ± 0.825
1.666LysMet: 1.666 ± 0.428
2.272LysAsn: 2.272 ± 0.389
3.56LysPro: 3.56 ± 0.738
2.121LysGln: 2.121 ± 0.294
3.181LysArg: 3.181 ± 0.444
2.651LysSer: 2.651 ± 0.399
2.424LysThr: 2.424 ± 0.455
3.56LysVal: 3.56 ± 0.502
1.06LysTrp: 1.06 ± 0.231
1.666LysTyr: 1.666 ± 0.347
0.0LysXaa: 0.0 ± 0.0
Leu
9.09LeuAla: 9.09 ± 0.872
1.363LeuCys: 1.363 ± 0.508
5.908LeuAsp: 5.908 ± 0.601
4.999LeuGlu: 4.999 ± 0.637
2.803LeuPhe: 2.803 ± 0.431
6.363LeuGly: 6.363 ± 0.524
1.591LeuHis: 1.591 ± 0.331
3.787LeuIle: 3.787 ± 0.595
5.529LeuLys: 5.529 ± 0.594
7.65LeuLeu: 7.65 ± 0.915
2.121LeuMet: 2.121 ± 0.358
3.636LeuAsn: 3.636 ± 0.438
4.469LeuPro: 4.469 ± 0.451
4.242LeuGln: 4.242 ± 0.576
5.757LeuArg: 5.757 ± 0.743
6.287LeuSer: 6.287 ± 0.686
5.529LeuThr: 5.529 ± 0.684
6.135LeuVal: 6.135 ± 0.874
0.909LeuTrp: 0.909 ± 0.321
3.484LeuTyr: 3.484 ± 0.657
0.0LeuXaa: 0.0 ± 0.0
Met
4.09MetAla: 4.09 ± 0.659
0.227MetCys: 0.227 ± 0.115
1.515MetAsp: 1.515 ± 0.339
1.515MetGlu: 1.515 ± 0.38
0.985MetPhe: 0.985 ± 0.331
1.515MetGly: 1.515 ± 0.344
0.682MetHis: 0.682 ± 0.234
1.136MetIle: 1.136 ± 0.213
1.666MetLys: 1.666 ± 0.377
2.803MetLeu: 2.803 ± 0.339
0.757MetMet: 0.757 ± 0.263
1.818MetAsn: 1.818 ± 0.384
1.136MetPro: 1.136 ± 0.236
1.288MetGln: 1.288 ± 0.289
1.742MetArg: 1.742 ± 0.361
1.742MetSer: 1.742 ± 0.326
1.666MetThr: 1.666 ± 0.336
1.969MetVal: 1.969 ± 0.398
0.303MetTrp: 0.303 ± 0.148
0.53MetTyr: 0.53 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
4.015AsnAla: 4.015 ± 0.476
0.303AsnCys: 0.303 ± 0.191
1.969AsnAsp: 1.969 ± 0.472
1.742AsnGlu: 1.742 ± 0.355
0.985AsnPhe: 0.985 ± 0.212
4.469AsnGly: 4.469 ± 0.625
0.682AsnHis: 0.682 ± 0.239
1.06AsnIle: 1.06 ± 0.247
2.272AsnLys: 2.272 ± 0.403
3.56AsnLeu: 3.56 ± 0.486
1.288AsnMet: 1.288 ± 0.388
1.136AsnAsn: 1.136 ± 0.229
1.666AsnPro: 1.666 ± 0.271
1.666AsnGln: 1.666 ± 0.393
2.651AsnArg: 2.651 ± 0.528
1.969AsnSer: 1.969 ± 0.635
2.121AsnThr: 2.121 ± 0.273
2.424AsnVal: 2.424 ± 0.361
0.53AsnTrp: 0.53 ± 0.181
1.06AsnTyr: 1.06 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
4.545ProAla: 4.545 ± 0.556
0.53ProCys: 0.53 ± 0.22
2.5ProAsp: 2.5 ± 0.423
4.772ProGlu: 4.772 ± 0.684
1.06ProPhe: 1.06 ± 0.24
3.787ProGly: 3.787 ± 0.663
0.606ProHis: 0.606 ± 0.225
2.272ProIle: 2.272 ± 0.487
1.818ProLys: 1.818 ± 0.274
2.878ProLeu: 2.878 ± 0.521
1.136ProMet: 1.136 ± 0.291
1.666ProAsn: 1.666 ± 0.346
1.591ProPro: 1.591 ± 0.488
1.591ProGln: 1.591 ± 0.317
1.515ProArg: 1.515 ± 0.399
1.969ProSer: 1.969 ± 0.323
1.666ProThr: 1.666 ± 0.341
2.727ProVal: 2.727 ± 0.517
0.379ProTrp: 0.379 ± 0.208
0.985ProTyr: 0.985 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
6.06GlnAla: 6.06 ± 0.978
0.53GlnCys: 0.53 ± 0.205
1.591GlnAsp: 1.591 ± 0.417
2.651GlnGlu: 2.651 ± 0.508
1.666GlnPhe: 1.666 ± 0.298
3.181GlnGly: 3.181 ± 0.388
0.985GlnHis: 0.985 ± 0.274
1.818GlnIle: 1.818 ± 0.424
2.197GlnLys: 2.197 ± 0.502
5.151GlnLeu: 5.151 ± 0.637
1.591GlnMet: 1.591 ± 0.245
1.212GlnAsn: 1.212 ± 0.241
1.742GlnPro: 1.742 ± 0.383
3.106GlnGln: 3.106 ± 0.52
2.5GlnArg: 2.5 ± 0.518
1.742GlnSer: 1.742 ± 0.495
2.575GlnThr: 2.575 ± 0.508
2.727GlnVal: 2.727 ± 0.552
0.379GlnTrp: 0.379 ± 0.196
1.818GlnTyr: 1.818 ± 0.416
0.0GlnXaa: 0.0 ± 0.0
Arg
6.135ArgAla: 6.135 ± 0.895
0.757ArgCys: 0.757 ± 0.289
3.257ArgAsp: 3.257 ± 0.56
3.636ArgGlu: 3.636 ± 0.473
1.894ArgPhe: 1.894 ± 0.446
4.393ArgGly: 4.393 ± 0.566
1.591ArgHis: 1.591 ± 0.307
3.181ArgIle: 3.181 ± 0.46
3.712ArgLys: 3.712 ± 0.578
6.363ArgLeu: 6.363 ± 0.761
2.121ArgMet: 2.121 ± 0.611
2.424ArgAsn: 2.424 ± 0.386
2.045ArgPro: 2.045 ± 0.398
2.803ArgGln: 2.803 ± 0.437
3.712ArgArg: 3.712 ± 0.546
2.348ArgSer: 2.348 ± 0.476
2.575ArgThr: 2.575 ± 0.391
3.333ArgVal: 3.333 ± 0.565
1.212ArgTrp: 1.212 ± 0.396
2.045ArgTyr: 2.045 ± 0.378
0.0ArgXaa: 0.0 ± 0.0
Ser
6.06SerAla: 6.06 ± 0.585
0.682SerCys: 0.682 ± 0.219
2.121SerAsp: 2.121 ± 0.409
2.651SerGlu: 2.651 ± 0.446
1.969SerPhe: 1.969 ± 0.301
4.923SerGly: 4.923 ± 0.628
0.53SerHis: 0.53 ± 0.21
2.272SerIle: 2.272 ± 0.303
3.181SerLys: 3.181 ± 0.494
5.378SerLeu: 5.378 ± 0.678
1.894SerMet: 1.894 ± 0.328
2.045SerAsn: 2.045 ± 0.548
2.121SerPro: 2.121 ± 0.4
3.181SerGln: 3.181 ± 0.468
3.863SerArg: 3.863 ± 0.547
3.484SerSer: 3.484 ± 0.616
3.106SerThr: 3.106 ± 0.423
3.636SerVal: 3.636 ± 0.479
1.212SerTrp: 1.212 ± 0.324
1.742SerTyr: 1.742 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
5.075ThrAla: 5.075 ± 0.706
0.303ThrCys: 0.303 ± 0.195
3.106ThrAsp: 3.106 ± 0.405
3.106ThrGlu: 3.106 ± 0.617
1.894ThrPhe: 1.894 ± 0.333
5.378ThrGly: 5.378 ± 0.617
0.682ThrHis: 0.682 ± 0.185
2.5ThrIle: 2.5 ± 0.428
3.712ThrLys: 3.712 ± 0.468
4.923ThrLeu: 4.923 ± 0.607
1.591ThrMet: 1.591 ± 0.303
1.363ThrAsn: 1.363 ± 0.342
2.878ThrPro: 2.878 ± 0.462
1.439ThrGln: 1.439 ± 0.353
2.727ThrArg: 2.727 ± 0.393
4.015ThrSer: 4.015 ± 0.6
3.409ThrThr: 3.409 ± 0.628
4.772ThrVal: 4.772 ± 0.697
0.454ThrTrp: 0.454 ± 0.182
1.894ThrTyr: 1.894 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
7.65ValAla: 7.65 ± 0.652
0.909ValCys: 0.909 ± 0.281
3.409ValAsp: 3.409 ± 0.626
3.863ValGlu: 3.863 ± 0.549
2.803ValPhe: 2.803 ± 0.478
4.848ValGly: 4.848 ± 0.795
1.969ValHis: 1.969 ± 0.308
3.257ValIle: 3.257 ± 0.55
3.787ValLys: 3.787 ± 0.61
5.075ValLeu: 5.075 ± 0.528
1.591ValMet: 1.591 ± 0.329
2.954ValAsn: 2.954 ± 0.491
3.106ValPro: 3.106 ± 0.572
3.106ValGln: 3.106 ± 0.535
4.09ValArg: 4.09 ± 0.648
3.333ValSer: 3.333 ± 0.575
4.318ValThr: 4.318 ± 0.812
4.999ValVal: 4.999 ± 0.864
0.606ValTrp: 0.606 ± 0.226
1.894ValTyr: 1.894 ± 0.494
0.0ValXaa: 0.0 ± 0.0
Trp
1.439TrpAla: 1.439 ± 0.254
0.227TrpCys: 0.227 ± 0.144
1.06TrpAsp: 1.06 ± 0.285
0.757TrpGlu: 0.757 ± 0.251
1.212TrpPhe: 1.212 ± 0.246
0.682TrpGly: 0.682 ± 0.247
0.151TrpHis: 0.151 ± 0.102
0.0TrpIle: 0.0 ± 0.0
0.303TrpLys: 0.303 ± 0.128
2.121TrpLeu: 2.121 ± 0.417
0.53TrpMet: 0.53 ± 0.166
0.606TrpAsn: 0.606 ± 0.229
0.379TrpPro: 0.379 ± 0.196
0.53TrpGln: 0.53 ± 0.222
0.606TrpArg: 0.606 ± 0.184
0.606TrpSer: 0.606 ± 0.193
0.682TrpThr: 0.682 ± 0.206
0.682TrpVal: 0.682 ± 0.222
0.53TrpTrp: 0.53 ± 0.244
0.303TrpTyr: 0.303 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.727TyrAla: 2.727 ± 0.439
0.606TyrCys: 0.606 ± 0.216
2.651TyrAsp: 2.651 ± 0.5
1.666TyrGlu: 1.666 ± 0.425
0.606TyrPhe: 0.606 ± 0.185
2.272TyrGly: 2.272 ± 0.397
0.833TyrHis: 0.833 ± 0.266
1.818TyrIle: 1.818 ± 0.315
1.969TyrLys: 1.969 ± 0.368
3.257TyrLeu: 3.257 ± 0.631
0.757TyrMet: 0.757 ± 0.284
1.515TyrAsn: 1.515 ± 0.376
1.439TyrPro: 1.439 ± 0.284
1.591TyrGln: 1.591 ± 0.399
2.348TyrArg: 2.348 ± 0.386
2.5TyrSer: 2.5 ± 0.633
1.515TyrThr: 1.515 ± 0.292
2.348TyrVal: 2.348 ± 0.422
0.303TyrTrp: 0.303 ± 0.126
0.454TyrTyr: 0.454 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski