Amino acid dipepetide frequency for Shewanella phage Spp001

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.344AlaAla: 11.344 ± 1.597
1.086AlaCys: 1.086 ± 0.331
5.913AlaAsp: 5.913 ± 0.548
5.37AlaGlu: 5.37 ± 0.749
2.353AlaPhe: 2.353 ± 0.269
6.215AlaGly: 6.215 ± 0.68
1.75AlaHis: 1.75 ± 0.25
4.827AlaIle: 4.827 ± 0.653
4.887AlaLys: 4.887 ± 0.6
7.723AlaLeu: 7.723 ± 0.792
2.595AlaMet: 2.595 ± 0.401
3.62AlaAsn: 3.62 ± 0.547
4.525AlaPro: 4.525 ± 0.816
4.646AlaGln: 4.646 ± 0.594
6.698AlaArg: 6.698 ± 0.755
5.612AlaSer: 5.612 ± 0.934
7.301AlaThr: 7.301 ± 0.975
6.094AlaVal: 6.094 ± 0.609
1.207AlaTrp: 1.207 ± 0.274
3.077AlaTyr: 3.077 ± 0.467
0.0AlaXaa: 0.0 ± 0.0
Cys
0.603CysAla: 0.603 ± 0.16
0.121CysCys: 0.121 ± 0.089
0.483CysAsp: 0.483 ± 0.179
0.664CysGlu: 0.664 ± 0.181
0.483CysPhe: 0.483 ± 0.187
1.569CysGly: 1.569 ± 0.409
0.241CysHis: 0.241 ± 0.132
0.905CysIle: 0.905 ± 0.229
0.664CysLys: 0.664 ± 0.189
0.965CysLeu: 0.965 ± 0.278
0.362CysMet: 0.362 ± 0.164
0.483CysAsn: 0.483 ± 0.161
0.06CysPro: 0.06 ± 0.059
0.362CysGln: 0.362 ± 0.146
0.724CysArg: 0.724 ± 0.25
0.603CysSer: 0.603 ± 0.237
0.603CysThr: 0.603 ± 0.184
0.664CysVal: 0.664 ± 0.258
0.181CysTrp: 0.181 ± 0.093
0.483CysTyr: 0.483 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
6.215AspAla: 6.215 ± 0.655
0.422AspCys: 0.422 ± 0.149
3.017AspAsp: 3.017 ± 0.462
3.439AspGlu: 3.439 ± 0.423
2.534AspPhe: 2.534 ± 0.372
5.129AspGly: 5.129 ± 0.568
1.026AspHis: 1.026 ± 0.275
3.801AspIle: 3.801 ± 0.46
2.715AspLys: 2.715 ± 0.519
5.793AspLeu: 5.793 ± 0.583
1.207AspMet: 1.207 ± 0.367
2.595AspAsn: 2.595 ± 0.488
3.258AspPro: 3.258 ± 0.563
1.629AspGln: 1.629 ± 0.398
2.293AspArg: 2.293 ± 0.45
2.474AspSer: 2.474 ± 0.379
3.379AspThr: 3.379 ± 0.377
4.586AspVal: 4.586 ± 0.496
1.086AspTrp: 1.086 ± 0.266
1.81AspTyr: 1.81 ± 0.251
0.0AspXaa: 0.0 ± 0.0
Glu
6.275GluAla: 6.275 ± 0.766
0.241GluCys: 0.241 ± 0.141
2.715GluAsp: 2.715 ± 0.392
4.646GluGlu: 4.646 ± 0.668
1.81GluPhe: 1.81 ± 0.29
3.741GluGly: 3.741 ± 0.453
1.267GluHis: 1.267 ± 0.27
3.982GluIle: 3.982 ± 0.475
2.655GluLys: 2.655 ± 0.544
5.37GluLeu: 5.37 ± 0.604
2.052GluMet: 2.052 ± 0.417
1.931GluAsn: 1.931 ± 0.316
2.836GluPro: 2.836 ± 0.445
1.81GluGln: 1.81 ± 0.347
3.077GluArg: 3.077 ± 0.427
3.319GluSer: 3.319 ± 0.511
4.103GluThr: 4.103 ± 0.601
4.525GluVal: 4.525 ± 0.616
0.664GluTrp: 0.664 ± 0.202
2.776GluTyr: 2.776 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
2.172PheAla: 2.172 ± 0.442
0.905PheCys: 0.905 ± 0.247
2.474PheAsp: 2.474 ± 0.361
2.655PheGlu: 2.655 ± 0.407
1.086PhePhe: 1.086 ± 0.261
2.595PheGly: 2.595 ± 0.497
0.784PheHis: 0.784 ± 0.224
1.448PheIle: 1.448 ± 0.398
1.689PheLys: 1.689 ± 0.322
2.293PheLeu: 2.293 ± 0.335
1.388PheMet: 1.388 ± 0.358
2.233PheAsn: 2.233 ± 0.413
1.207PhePro: 1.207 ± 0.271
1.327PheGln: 1.327 ± 0.241
1.388PheArg: 1.388 ± 0.331
2.052PheSer: 2.052 ± 0.31
2.474PheThr: 2.474 ± 0.3
2.172PheVal: 2.172 ± 0.381
0.483PheTrp: 0.483 ± 0.184
0.784PheTyr: 0.784 ± 0.179
0.0PheXaa: 0.0 ± 0.0
Gly
5.612GlyAla: 5.612 ± 0.954
0.905GlyCys: 0.905 ± 0.221
4.948GlyAsp: 4.948 ± 0.534
4.103GlyGlu: 4.103 ± 0.504
2.655GlyPhe: 2.655 ± 0.413
4.586GlyGly: 4.586 ± 0.597
0.905GlyHis: 0.905 ± 0.23
4.344GlyIle: 4.344 ± 0.645
4.767GlyLys: 4.767 ± 0.524
6.517GlyLeu: 6.517 ± 0.56
1.871GlyMet: 1.871 ± 0.333
3.198GlyAsn: 3.198 ± 0.727
0.905GlyPro: 0.905 ± 0.26
2.896GlyGln: 2.896 ± 0.519
3.258GlyArg: 3.258 ± 0.518
3.801GlySer: 3.801 ± 0.579
5.189GlyThr: 5.189 ± 0.894
5.491GlyVal: 5.491 ± 0.613
0.965GlyTrp: 0.965 ± 0.252
1.991GlyTyr: 1.991 ± 0.285
0.0GlyXaa: 0.0 ± 0.0
His
1.991HisAla: 1.991 ± 0.325
0.603HisCys: 0.603 ± 0.165
1.569HisAsp: 1.569 ± 0.28
1.146HisGlu: 1.146 ± 0.239
0.965HisPhe: 0.965 ± 0.308
2.052HisGly: 2.052 ± 0.363
0.362HisHis: 0.362 ± 0.182
1.448HisIle: 1.448 ± 0.371
1.146HisLys: 1.146 ± 0.243
1.931HisLeu: 1.931 ± 0.347
0.784HisMet: 0.784 ± 0.24
0.965HisAsn: 0.965 ± 0.248
1.267HisPro: 1.267 ± 0.361
0.784HisGln: 0.784 ± 0.227
0.664HisArg: 0.664 ± 0.183
1.388HisSer: 1.388 ± 0.229
1.931HisThr: 1.931 ± 0.699
1.629HisVal: 1.629 ± 0.292
0.181HisTrp: 0.181 ± 0.106
1.267HisTyr: 1.267 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
6.034IleAla: 6.034 ± 0.637
0.603IleCys: 0.603 ± 0.207
4.465IleAsp: 4.465 ± 0.473
2.534IleGlu: 2.534 ± 0.394
1.086IlePhe: 1.086 ± 0.365
3.319IleGly: 3.319 ± 0.675
0.965IleHis: 0.965 ± 0.287
3.138IleIle: 3.138 ± 0.595
2.655IleLys: 2.655 ± 0.447
3.138IleLeu: 3.138 ± 0.448
1.026IleMet: 1.026 ± 0.277
2.414IleAsn: 2.414 ± 0.384
3.138IlePro: 3.138 ± 0.479
2.353IleGln: 2.353 ± 0.354
3.258IleArg: 3.258 ± 0.548
3.258IleSer: 3.258 ± 0.405
4.646IleThr: 4.646 ± 0.648
3.741IleVal: 3.741 ± 0.525
0.543IleTrp: 0.543 ± 0.197
1.207IleTyr: 1.207 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
4.284LysAla: 4.284 ± 0.544
0.603LysCys: 0.603 ± 0.2
3.741LysAsp: 3.741 ± 0.499
2.293LysGlu: 2.293 ± 0.409
1.871LysPhe: 1.871 ± 0.34
2.233LysGly: 2.233 ± 0.368
1.75LysHis: 1.75 ± 0.242
2.715LysIle: 2.715 ± 0.431
2.595LysLys: 2.595 ± 0.412
4.767LysLeu: 4.767 ± 0.653
1.81LysMet: 1.81 ± 0.37
1.508LysAsn: 1.508 ± 0.248
2.414LysPro: 2.414 ± 0.372
2.595LysGln: 2.595 ± 0.408
3.198LysArg: 3.198 ± 0.565
2.776LysSer: 2.776 ± 0.347
3.741LysThr: 3.741 ± 0.474
4.586LysVal: 4.586 ± 0.513
0.664LysTrp: 0.664 ± 0.238
1.508LysTyr: 1.508 ± 0.309
0.0LysXaa: 0.0 ± 0.0
Leu
6.939LeuAla: 6.939 ± 0.865
1.026LeuCys: 1.026 ± 0.275
4.284LeuAsp: 4.284 ± 0.439
6.456LeuGlu: 6.456 ± 0.788
2.595LeuPhe: 2.595 ± 0.406
4.284LeuGly: 4.284 ± 0.475
1.81LeuHis: 1.81 ± 0.341
3.862LeuIle: 3.862 ± 0.46
4.284LeuLys: 4.284 ± 0.519
5.37LeuLeu: 5.37 ± 0.697
1.871LeuMet: 1.871 ± 0.377
3.922LeuAsn: 3.922 ± 0.476
4.103LeuPro: 4.103 ± 0.47
3.741LeuGln: 3.741 ± 0.514
6.456LeuArg: 6.456 ± 0.652
5.068LeuSer: 5.068 ± 0.481
5.31LeuThr: 5.31 ± 0.604
5.974LeuVal: 5.974 ± 0.665
1.508LeuTrp: 1.508 ± 0.295
1.871LeuTyr: 1.871 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
2.595MetAla: 2.595 ± 0.411
0.241MetCys: 0.241 ± 0.137
1.026MetAsp: 1.026 ± 0.236
1.207MetGlu: 1.207 ± 0.292
0.845MetPhe: 0.845 ± 0.223
1.207MetGly: 1.207 ± 0.234
0.603MetHis: 0.603 ± 0.208
1.267MetIle: 1.267 ± 0.239
1.81MetLys: 1.81 ± 0.295
2.172MetLeu: 2.172 ± 0.371
0.724MetMet: 0.724 ± 0.226
0.784MetAsn: 0.784 ± 0.223
1.146MetPro: 1.146 ± 0.257
1.327MetGln: 1.327 ± 0.349
1.629MetArg: 1.629 ± 0.346
2.293MetSer: 2.293 ± 0.377
1.508MetThr: 1.508 ± 0.302
1.026MetVal: 1.026 ± 0.268
0.422MetTrp: 0.422 ± 0.167
1.146MetTyr: 1.146 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
5.129AsnAla: 5.129 ± 0.599
0.483AsnCys: 0.483 ± 0.178
1.75AsnAsp: 1.75 ± 0.281
1.871AsnGlu: 1.871 ± 0.415
1.569AsnPhe: 1.569 ± 0.267
4.284AsnGly: 4.284 ± 0.557
1.207AsnHis: 1.207 ± 0.251
1.931AsnIle: 1.931 ± 0.344
2.052AsnLys: 2.052 ± 0.398
2.534AsnLeu: 2.534 ± 0.417
0.784AsnMet: 0.784 ± 0.205
1.508AsnAsn: 1.508 ± 0.272
2.414AsnPro: 2.414 ± 0.387
1.569AsnGln: 1.569 ± 0.267
1.931AsnArg: 1.931 ± 0.311
2.172AsnSer: 2.172 ± 0.399
2.474AsnThr: 2.474 ± 0.409
3.379AsnVal: 3.379 ± 0.446
0.664AsnTrp: 0.664 ± 0.3
1.207AsnTyr: 1.207 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
5.25ProAla: 5.25 ± 0.994
0.362ProCys: 0.362 ± 0.179
3.077ProAsp: 3.077 ± 0.45
3.198ProGlu: 3.198 ± 0.412
1.207ProPhe: 1.207 ± 0.276
1.448ProGly: 1.448 ± 0.35
1.388ProHis: 1.388 ± 0.354
2.474ProIle: 2.474 ± 0.409
2.414ProLys: 2.414 ± 0.363
3.862ProLeu: 3.862 ± 0.491
0.965ProMet: 0.965 ± 0.241
1.75ProAsn: 1.75 ± 0.337
1.388ProPro: 1.388 ± 0.348
1.448ProGln: 1.448 ± 0.287
2.414ProArg: 2.414 ± 0.377
3.258ProSer: 3.258 ± 0.426
3.801ProThr: 3.801 ± 0.446
4.224ProVal: 4.224 ± 0.557
0.664ProTrp: 0.664 ± 0.387
1.569ProTyr: 1.569 ± 0.38
0.0ProXaa: 0.0 ± 0.0
Gln
3.982GlnAla: 3.982 ± 0.59
0.362GlnCys: 0.362 ± 0.14
1.75GlnAsp: 1.75 ± 0.302
1.991GlnGlu: 1.991 ± 0.386
2.112GlnPhe: 2.112 ± 0.378
2.474GlnGly: 2.474 ± 0.426
1.086GlnHis: 1.086 ± 0.265
2.052GlnIle: 2.052 ± 0.406
1.448GlnLys: 1.448 ± 0.297
3.62GlnLeu: 3.62 ± 0.469
1.086GlnMet: 1.086 ± 0.22
1.689GlnAsn: 1.689 ± 0.315
1.931GlnPro: 1.931 ± 0.374
2.353GlnGln: 2.353 ± 0.459
2.534GlnArg: 2.534 ± 0.374
1.81GlnSer: 1.81 ± 0.27
2.112GlnThr: 2.112 ± 0.337
3.801GlnVal: 3.801 ± 0.407
1.267GlnTrp: 1.267 ± 0.292
1.81GlnTyr: 1.81 ± 0.318
0.0GlnXaa: 0.0 ± 0.0
Arg
4.586ArgAla: 4.586 ± 0.542
0.784ArgCys: 0.784 ± 0.234
3.62ArgAsp: 3.62 ± 0.543
3.62ArgGlu: 3.62 ± 0.532
1.991ArgPhe: 1.991 ± 0.428
4.163ArgGly: 4.163 ± 0.607
1.388ArgHis: 1.388 ± 0.332
3.62ArgIle: 3.62 ± 0.558
3.017ArgLys: 3.017 ± 0.334
4.344ArgLeu: 4.344 ± 0.619
1.569ArgMet: 1.569 ± 0.247
2.776ArgAsn: 2.776 ± 0.438
1.569ArgPro: 1.569 ± 0.342
2.595ArgGln: 2.595 ± 0.365
3.319ArgArg: 3.319 ± 0.5
3.258ArgSer: 3.258 ± 0.415
3.801ArgThr: 3.801 ± 0.378
4.344ArgVal: 4.344 ± 0.645
1.629ArgTrp: 1.629 ± 0.616
1.689ArgTyr: 1.689 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
5.612SerAla: 5.612 ± 0.503
0.845SerCys: 0.845 ± 0.192
3.439SerAsp: 3.439 ± 0.537
3.138SerGlu: 3.138 ± 0.354
2.172SerPhe: 2.172 ± 0.372
5.491SerGly: 5.491 ± 0.762
2.293SerHis: 2.293 ± 0.466
2.715SerIle: 2.715 ± 0.473
2.293SerLys: 2.293 ± 0.501
5.189SerLeu: 5.189 ± 0.578
1.086SerMet: 1.086 ± 0.174
2.112SerAsn: 2.112 ± 0.399
2.715SerPro: 2.715 ± 0.392
2.293SerGln: 2.293 ± 0.292
2.776SerArg: 2.776 ± 0.48
3.862SerSer: 3.862 ± 0.564
4.586SerThr: 4.586 ± 0.365
3.56SerVal: 3.56 ± 0.503
0.905SerTrp: 0.905 ± 0.229
2.353SerTyr: 2.353 ± 0.5
0.0SerXaa: 0.0 ± 0.0
Thr
6.396ThrAla: 6.396 ± 0.876
0.483ThrCys: 0.483 ± 0.168
3.319ThrAsp: 3.319 ± 0.326
4.163ThrGlu: 4.163 ± 0.582
2.414ThrPhe: 2.414 ± 0.387
5.672ThrGly: 5.672 ± 1.097
2.172ThrHis: 2.172 ± 0.576
3.258ThrIle: 3.258 ± 0.435
3.741ThrLys: 3.741 ± 0.444
5.25ThrLeu: 5.25 ± 0.545
0.845ThrMet: 0.845 ± 0.201
2.776ThrAsn: 2.776 ± 0.409
4.948ThrPro: 4.948 ± 0.529
2.474ThrGln: 2.474 ± 0.513
4.525ThrArg: 4.525 ± 0.674
3.862ThrSer: 3.862 ± 0.54
4.586ThrThr: 4.586 ± 0.639
5.25ThrVal: 5.25 ± 0.619
1.026ThrTrp: 1.026 ± 0.342
2.414ThrTyr: 2.414 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
7.482ValAla: 7.482 ± 0.883
0.603ValCys: 0.603 ± 0.21
3.862ValAsp: 3.862 ± 0.433
4.224ValGlu: 4.224 ± 0.617
2.534ValPhe: 2.534 ± 0.428
5.31ValGly: 5.31 ± 0.833
1.448ValHis: 1.448 ± 0.272
3.379ValIle: 3.379 ± 0.459
4.163ValLys: 4.163 ± 0.592
5.732ValLeu: 5.732 ± 0.619
1.689ValMet: 1.689 ± 0.326
2.715ValAsn: 2.715 ± 0.498
3.62ValPro: 3.62 ± 0.559
3.741ValGln: 3.741 ± 0.456
4.887ValArg: 4.887 ± 0.538
4.646ValSer: 4.646 ± 0.614
5.25ValThr: 5.25 ± 0.808
5.612ValVal: 5.612 ± 0.657
1.146ValTrp: 1.146 ± 0.269
1.388ValTyr: 1.388 ± 0.251
0.0ValXaa: 0.0 ± 0.0
Trp
1.629TrpAla: 1.629 ± 0.357
0.181TrpCys: 0.181 ± 0.089
0.784TrpAsp: 0.784 ± 0.273
0.603TrpGlu: 0.603 ± 0.177
0.362TrpPhe: 0.362 ± 0.177
1.026TrpGly: 1.026 ± 0.269
0.241TrpHis: 0.241 ± 0.104
0.965TrpIle: 0.965 ± 0.224
0.905TrpLys: 0.905 ± 0.257
1.388TrpLeu: 1.388 ± 0.318
0.422TrpMet: 0.422 ± 0.163
0.664TrpAsn: 0.664 ± 0.253
1.026TrpPro: 1.026 ± 0.618
0.603TrpGln: 0.603 ± 0.171
0.845TrpArg: 0.845 ± 0.197
0.965TrpSer: 0.965 ± 0.415
0.664TrpThr: 0.664 ± 0.226
1.207TrpVal: 1.207 ± 0.24
0.121TrpTrp: 0.121 ± 0.084
0.784TrpTyr: 0.784 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.715TyrAla: 2.715 ± 0.53
0.362TyrCys: 0.362 ± 0.127
2.172TyrAsp: 2.172 ± 0.37
2.534TyrGlu: 2.534 ± 0.447
0.965TyrPhe: 0.965 ± 0.194
2.112TyrGly: 2.112 ± 0.423
1.388TyrHis: 1.388 ± 0.253
1.388TyrIle: 1.388 ± 0.276
1.75TyrLys: 1.75 ± 0.284
2.776TyrLeu: 2.776 ± 0.485
0.784TyrMet: 0.784 ± 0.23
1.267TyrAsn: 1.267 ± 0.299
1.689TyrPro: 1.689 ± 0.335
0.724TyrGln: 0.724 ± 0.159
1.81TyrArg: 1.81 ± 0.311
2.957TyrSer: 2.957 ± 0.503
2.112TyrThr: 2.112 ± 0.33
1.569TyrVal: 1.569 ± 0.277
0.06TyrTrp: 0.06 ± 0.062
1.267TyrTyr: 1.267 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (16574 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski