Amino acid dipepetide frequency for Nocardia phage NTR1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.653AlaAla: 17.653 ± 1.694
0.455AlaCys: 0.455 ± 0.175
7.739AlaAsp: 7.739 ± 0.569
8.649AlaGlu: 8.649 ± 0.771
3.085AlaPhe: 3.085 ± 0.507
9.004AlaGly: 9.004 ± 0.792
2.479AlaHis: 2.479 ± 0.389
5.109AlaIle: 5.109 ± 1.011
3.187AlaLys: 3.187 ± 0.427
10.066AlaLeu: 10.066 ± 0.923
2.883AlaMet: 2.883 ± 0.405
2.883AlaAsn: 2.883 ± 0.53
7.081AlaPro: 7.081 ± 0.624
4.502AlaGln: 4.502 ± 0.834
7.436AlaArg: 7.436 ± 0.672
5.969AlaSer: 5.969 ± 0.647
6.576AlaThr: 6.576 ± 0.536
10.824AlaVal: 10.824 ± 0.894
2.175AlaTrp: 2.175 ± 0.281
2.428AlaTyr: 2.428 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.267
0.152CysCys: 0.152 ± 0.087
1.113CysAsp: 1.113 ± 0.277
0.658CysGlu: 0.658 ± 0.231
0.101CysPhe: 0.101 ± 0.068
1.77CysGly: 1.77 ± 0.436
0.303CysHis: 0.303 ± 0.135
0.405CysIle: 0.405 ± 0.184
0.152CysLys: 0.152 ± 0.106
0.556CysLeu: 0.556 ± 0.198
0.202CysMet: 0.202 ± 0.118
0.303CysAsn: 0.303 ± 0.148
0.759CysPro: 0.759 ± 0.247
0.152CysGln: 0.152 ± 0.098
1.062CysArg: 1.062 ± 0.25
0.708CysSer: 0.708 ± 0.153
0.506CysThr: 0.506 ± 0.164
1.012CysVal: 1.012 ± 0.261
0.202CysTrp: 0.202 ± 0.106
0.051CysTyr: 0.051 ± 0.057
0.0CysXaa: 0.0 ± 0.0
Asp
7.739AspAla: 7.739 ± 0.587
0.809AspCys: 0.809 ± 0.301
5.008AspAsp: 5.008 ± 0.741
3.945AspGlu: 3.945 ± 0.564
2.074AspPhe: 2.074 ± 0.272
4.704AspGly: 4.704 ± 0.517
1.163AspHis: 1.163 ± 0.358
3.338AspIle: 3.338 ± 0.451
1.163AspLys: 1.163 ± 0.215
5.867AspLeu: 5.867 ± 0.461
1.77AspMet: 1.77 ± 0.312
1.669AspAsn: 1.669 ± 0.318
4.704AspPro: 4.704 ± 0.526
2.327AspGln: 2.327 ± 0.346
3.187AspArg: 3.187 ± 0.477
2.681AspSer: 2.681 ± 0.338
4.097AspThr: 4.097 ± 0.47
3.945AspVal: 3.945 ± 0.375
1.821AspTrp: 1.821 ± 0.291
1.467AspTyr: 1.467 ± 0.291
0.0AspXaa: 0.0 ± 0.0
Glu
7.486GluAla: 7.486 ± 0.672
1.062GluCys: 1.062 ± 0.245
4.148GluAsp: 4.148 ± 0.422
5.362GluGlu: 5.362 ± 0.738
1.973GluPhe: 1.973 ± 0.279
4.552GluGly: 4.552 ± 0.462
1.72GluHis: 1.72 ± 0.262
2.428GluIle: 2.428 ± 0.312
1.821GluLys: 1.821 ± 0.346
6.677GluLeu: 6.677 ± 0.661
1.669GluMet: 1.669 ± 0.296
1.77GluAsn: 1.77 ± 0.328
3.085GluPro: 3.085 ± 0.488
2.58GluGln: 2.58 ± 0.448
4.299GluArg: 4.299 ± 0.478
3.743GluSer: 3.743 ± 0.401
3.743GluThr: 3.743 ± 0.484
4.299GluVal: 4.299 ± 0.564
1.669GluTrp: 1.669 ± 0.328
1.72GluTyr: 1.72 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
2.833PheAla: 2.833 ± 0.467
0.506PheCys: 0.506 ± 0.173
2.124PheAsp: 2.124 ± 0.306
1.669PheGlu: 1.669 ± 0.27
0.809PhePhe: 0.809 ± 0.213
2.276PheGly: 2.276 ± 0.351
0.202PheHis: 0.202 ± 0.084
1.012PheIle: 1.012 ± 0.25
0.405PheLys: 0.405 ± 0.134
2.023PheLeu: 2.023 ± 0.274
0.809PheMet: 0.809 ± 0.2
1.113PheAsn: 1.113 ± 0.198
1.872PhePro: 1.872 ± 0.305
1.163PheGln: 1.163 ± 0.272
2.023PheArg: 2.023 ± 0.288
1.669PheSer: 1.669 ± 0.285
2.58PheThr: 2.58 ± 0.363
2.023PheVal: 2.023 ± 0.309
0.556PheTrp: 0.556 ± 0.18
0.809PheTyr: 0.809 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
8.245GlyAla: 8.245 ± 0.838
1.265GlyCys: 1.265 ± 0.325
4.502GlyAsp: 4.502 ± 0.483
4.198GlyGlu: 4.198 ± 0.472
2.58GlyPhe: 2.58 ± 0.377
8.042GlyGly: 8.042 ± 0.704
1.416GlyHis: 1.416 ± 0.303
4.097GlyIle: 4.097 ± 0.443
2.833GlyLys: 2.833 ± 0.405
6.576GlyLeu: 6.576 ± 0.683
1.77GlyMet: 1.77 ± 0.277
2.833GlyAsn: 2.833 ± 0.345
3.338GlyPro: 3.338 ± 0.45
2.58GlyGln: 2.58 ± 0.298
5.615GlyArg: 5.615 ± 0.611
5.109GlySer: 5.109 ± 0.523
7.537GlyThr: 7.537 ± 0.75
6.12GlyVal: 6.12 ± 0.651
2.276GlyTrp: 2.276 ± 0.478
2.782GlyTyr: 2.782 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.619HisAla: 1.619 ± 0.286
0.152HisCys: 0.152 ± 0.087
1.012HisAsp: 1.012 ± 0.216
1.214HisGlu: 1.214 ± 0.261
0.708HisPhe: 0.708 ± 0.196
1.467HisGly: 1.467 ± 0.317
0.759HisHis: 0.759 ± 0.195
0.809HisIle: 0.809 ± 0.184
0.405HisLys: 0.405 ± 0.166
1.77HisLeu: 1.77 ± 0.292
0.303HisMet: 0.303 ± 0.138
0.506HisAsn: 0.506 ± 0.165
1.416HisPro: 1.416 ± 0.331
0.506HisGln: 0.506 ± 0.132
1.517HisArg: 1.517 ± 0.312
0.759HisSer: 0.759 ± 0.227
2.124HisThr: 2.124 ± 0.343
1.366HisVal: 1.366 ± 0.232
0.354HisTrp: 0.354 ± 0.159
0.607HisTyr: 0.607 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
6.07IleAla: 6.07 ± 0.569
0.354IleCys: 0.354 ± 0.16
3.642IleAsp: 3.642 ± 0.435
3.996IleGlu: 3.996 ± 0.414
0.759IlePhe: 0.759 ± 0.177
4.401IleGly: 4.401 ± 0.752
0.658IleHis: 0.658 ± 0.183
2.124IleIle: 2.124 ± 0.309
1.366IleLys: 1.366 ± 0.349
1.922IleLeu: 1.922 ± 0.287
0.961IleMet: 0.961 ± 0.206
1.416IleAsn: 1.416 ± 0.274
2.833IlePro: 2.833 ± 0.432
2.023IleGln: 2.023 ± 0.327
2.984IleArg: 2.984 ± 0.355
2.074IleSer: 2.074 ± 0.259
3.49IleThr: 3.49 ± 0.456
3.237IleVal: 3.237 ± 0.354
0.556IleTrp: 0.556 ± 0.19
1.012IleTyr: 1.012 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
3.895LysAla: 3.895 ± 0.658
0.152LysCys: 0.152 ± 0.107
1.012LysAsp: 1.012 ± 0.224
1.163LysGlu: 1.163 ± 0.31
0.455LysPhe: 0.455 ± 0.136
2.023LysGly: 2.023 ± 0.289
0.354LysHis: 0.354 ± 0.123
1.012LysIle: 1.012 ± 0.272
0.91LysLys: 0.91 ± 0.205
2.124LysLeu: 2.124 ± 0.296
0.202LysMet: 0.202 ± 0.106
1.214LysAsn: 1.214 ± 0.32
1.568LysPro: 1.568 ± 0.268
0.658LysGln: 0.658 ± 0.164
1.214LysArg: 1.214 ± 0.263
1.467LysSer: 1.467 ± 0.348
1.872LysThr: 1.872 ± 0.302
3.187LysVal: 3.187 ± 0.397
0.708LysTrp: 0.708 ± 0.244
0.91LysTyr: 0.91 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
10.218LeuAla: 10.218 ± 1.029
0.809LeuCys: 0.809 ± 0.183
6.171LeuAsp: 6.171 ± 0.575
4.603LeuGlu: 4.603 ± 0.458
1.821LeuPhe: 1.821 ± 0.321
6.525LeuGly: 6.525 ± 0.545
1.366LeuHis: 1.366 ± 0.282
4.906LeuIle: 4.906 ± 0.588
1.872LeuLys: 1.872 ± 0.407
6.12LeuLeu: 6.12 ± 0.622
1.265LeuMet: 1.265 ± 0.24
2.023LeuAsn: 2.023 ± 0.357
4.957LeuPro: 4.957 ± 0.5
2.276LeuGln: 2.276 ± 0.294
6.93LeuArg: 6.93 ± 0.634
4.047LeuSer: 4.047 ± 0.358
6.879LeuThr: 6.879 ± 0.566
5.918LeuVal: 5.918 ± 0.653
1.315LeuTrp: 1.315 ± 0.303
2.276LeuTyr: 2.276 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.681MetAla: 2.681 ± 0.332
0.405MetCys: 0.405 ± 0.16
1.517MetAsp: 1.517 ± 0.234
1.062MetGlu: 1.062 ± 0.201
0.708MetPhe: 0.708 ± 0.205
1.265MetGly: 1.265 ± 0.26
0.658MetHis: 0.658 ± 0.157
0.91MetIle: 0.91 ± 0.201
0.556MetLys: 0.556 ± 0.168
1.72MetLeu: 1.72 ± 0.293
0.354MetMet: 0.354 ± 0.131
0.658MetAsn: 0.658 ± 0.173
1.517MetPro: 1.517 ± 0.32
0.455MetGln: 0.455 ± 0.156
0.961MetArg: 0.961 ± 0.221
2.124MetSer: 2.124 ± 0.351
2.529MetThr: 2.529 ± 0.376
1.113MetVal: 1.113 ± 0.214
0.253MetTrp: 0.253 ± 0.109
0.556MetTyr: 0.556 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.945AsnAla: 3.945 ± 0.617
0.506AsnCys: 0.506 ± 0.164
1.366AsnAsp: 1.366 ± 0.282
1.517AsnGlu: 1.517 ± 0.257
0.607AsnPhe: 0.607 ± 0.185
2.984AsnGly: 2.984 ± 0.407
0.708AsnHis: 0.708 ± 0.176
1.113AsnIle: 1.113 ± 0.271
0.405AsnLys: 0.405 ± 0.126
3.288AsnLeu: 3.288 ± 0.65
0.455AsnMet: 0.455 ± 0.124
0.86AsnAsn: 0.86 ± 0.225
1.973AsnPro: 1.973 ± 0.294
1.012AsnGln: 1.012 ± 0.282
1.619AsnArg: 1.619 ± 0.299
1.315AsnSer: 1.315 ± 0.248
2.377AsnThr: 2.377 ± 0.371
2.63AsnVal: 2.63 ± 0.375
0.809AsnTrp: 0.809 ± 0.186
0.405AsnTyr: 0.405 ± 0.136
0.0AsnXaa: 0.0 ± 0.0
Pro
6.222ProAla: 6.222 ± 0.662
0.759ProCys: 0.759 ± 0.231
3.895ProAsp: 3.895 ± 0.489
5.412ProGlu: 5.412 ± 0.548
1.922ProPhe: 1.922 ± 0.361
5.362ProGly: 5.362 ± 0.571
0.86ProHis: 0.86 ± 0.225
2.58ProIle: 2.58 ± 0.367
1.669ProLys: 1.669 ± 0.304
4.249ProLeu: 4.249 ± 0.473
1.113ProMet: 1.113 ± 0.241
1.72ProAsn: 1.72 ± 0.323
4.148ProPro: 4.148 ± 0.553
1.568ProGln: 1.568 ± 0.274
2.681ProArg: 2.681 ± 0.425
2.529ProSer: 2.529 ± 0.373
3.541ProThr: 3.541 ± 0.508
4.906ProVal: 4.906 ± 0.481
0.961ProTrp: 0.961 ± 0.212
1.568ProTyr: 1.568 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
3.895GlnAla: 3.895 ± 0.499
0.202GlnCys: 0.202 ± 0.114
1.821GlnAsp: 1.821 ± 0.298
1.872GlnGlu: 1.872 ± 0.276
1.012GlnPhe: 1.012 ± 0.197
2.276GlnGly: 2.276 ± 0.35
0.405GlnHis: 0.405 ± 0.142
1.619GlnIle: 1.619 ± 0.314
1.062GlnLys: 1.062 ± 0.275
3.844GlnLeu: 3.844 ± 0.461
1.113GlnMet: 1.113 ± 0.236
1.113GlnAsn: 1.113 ± 0.242
1.366GlnPro: 1.366 ± 0.292
1.416GlnGln: 1.416 ± 0.306
2.175GlnArg: 2.175 ± 0.339
1.72GlnSer: 1.72 ± 0.286
2.327GlnThr: 2.327 ± 0.29
2.731GlnVal: 2.731 ± 0.397
0.91GlnTrp: 0.91 ± 0.178
0.556GlnTyr: 0.556 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
6.879ArgAla: 6.879 ± 0.538
0.658ArgCys: 0.658 ± 0.192
3.237ArgAsp: 3.237 ± 0.451
4.552ArgGlu: 4.552 ± 0.477
1.872ArgPhe: 1.872 ± 0.329
4.502ArgGly: 4.502 ± 0.474
1.315ArgHis: 1.315 ± 0.278
2.681ArgIle: 2.681 ± 0.35
1.922ArgLys: 1.922 ± 0.349
6.019ArgLeu: 6.019 ± 0.619
2.124ArgMet: 2.124 ± 0.279
2.226ArgAsn: 2.226 ± 0.382
3.49ArgPro: 3.49 ± 0.594
2.782ArgGln: 2.782 ± 0.496
5.159ArgArg: 5.159 ± 0.537
2.833ArgSer: 2.833 ± 0.321
4.299ArgThr: 4.299 ± 0.524
4.856ArgVal: 4.856 ± 0.522
1.669ArgTrp: 1.669 ± 0.322
1.922ArgTyr: 1.922 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
6.879SerAla: 6.879 ± 0.548
0.708SerCys: 0.708 ± 0.255
2.833SerAsp: 2.833 ± 0.336
2.883SerGlu: 2.883 ± 0.371
2.124SerPhe: 2.124 ± 0.344
5.362SerGly: 5.362 ± 0.615
1.113SerHis: 1.113 ± 0.274
2.327SerIle: 2.327 ± 0.36
1.113SerLys: 1.113 ± 0.263
3.49SerLeu: 3.49 ± 0.493
1.366SerMet: 1.366 ± 0.344
1.517SerAsn: 1.517 ± 0.296
2.934SerPro: 2.934 ± 0.42
1.619SerGln: 1.619 ± 0.327
3.389SerArg: 3.389 ± 0.407
2.276SerSer: 2.276 ± 0.366
3.085SerThr: 3.085 ± 0.4
3.541SerVal: 3.541 ± 0.467
1.113SerTrp: 1.113 ± 0.233
1.265SerTyr: 1.265 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
8.801ThrAla: 8.801 ± 0.897
1.113ThrCys: 1.113 ± 0.288
4.704ThrAsp: 4.704 ± 0.411
5.109ThrGlu: 5.109 ± 0.519
2.479ThrPhe: 2.479 ± 0.608
6.829ThrGly: 6.829 ± 0.635
1.113ThrHis: 1.113 ± 0.232
3.035ThrIle: 3.035 ± 0.418
1.821ThrLys: 1.821 ± 0.317
4.957ThrLeu: 4.957 ± 0.646
1.163ThrMet: 1.163 ± 0.224
2.175ThrAsn: 2.175 ± 0.45
4.856ThrPro: 4.856 ± 0.526
1.821ThrGln: 1.821 ± 0.372
4.148ThrArg: 4.148 ± 0.521
3.389ThrSer: 3.389 ± 0.461
4.047ThrThr: 4.047 ± 0.486
5.716ThrVal: 5.716 ± 0.742
1.366ThrTrp: 1.366 ± 0.241
1.973ThrTyr: 1.973 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
8.953ValAla: 8.953 ± 0.773
0.809ValCys: 0.809 ± 0.206
4.249ValAsp: 4.249 ± 0.441
5.109ValGlu: 5.109 ± 0.615
1.973ValPhe: 1.973 ± 0.284
6.879ValGly: 6.879 ± 0.693
2.023ValHis: 2.023 ± 0.371
4.805ValIle: 4.805 ± 0.426
2.681ValLys: 2.681 ± 0.336
7.031ValLeu: 7.031 ± 0.534
1.821ValMet: 1.821 ± 0.331
2.377ValAsn: 2.377 ± 0.372
3.44ValPro: 3.44 ± 0.438
2.074ValGln: 2.074 ± 0.272
4.552ValArg: 4.552 ± 0.55
4.401ValSer: 4.401 ± 0.558
5.26ValThr: 5.26 ± 0.591
6.525ValVal: 6.525 ± 0.611
1.467ValTrp: 1.467 ± 0.273
1.467ValTyr: 1.467 ± 0.268
0.0ValXaa: 0.0 ± 0.0
Trp
2.226TrpAla: 2.226 ± 0.296
0.455TrpCys: 0.455 ± 0.177
1.163TrpAsp: 1.163 ± 0.201
1.265TrpGlu: 1.265 ± 0.301
0.708TrpPhe: 0.708 ± 0.209
1.163TrpGly: 1.163 ± 0.25
0.253TrpHis: 0.253 ± 0.114
0.86TrpIle: 0.86 ± 0.247
0.354TrpLys: 0.354 ± 0.132
1.922TrpLeu: 1.922 ± 0.287
0.303TrpMet: 0.303 ± 0.105
0.708TrpAsn: 0.708 ± 0.187
1.012TrpPro: 1.012 ± 0.307
1.012TrpGln: 1.012 ± 0.176
1.973TrpArg: 1.973 ± 0.345
0.91TrpSer: 0.91 ± 0.221
2.023TrpThr: 2.023 ± 0.309
1.669TrpVal: 1.669 ± 0.305
0.506TrpTrp: 0.506 ± 0.149
0.607TrpTyr: 0.607 ± 0.195
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.731TyrAla: 2.731 ± 0.46
0.051TyrCys: 0.051 ± 0.056
2.226TyrAsp: 2.226 ± 0.413
1.568TyrGlu: 1.568 ± 0.294
0.708TyrPhe: 0.708 ± 0.191
2.377TyrGly: 2.377 ± 0.344
0.506TyrHis: 0.506 ± 0.172
0.708TyrIle: 0.708 ± 0.22
0.405TyrLys: 0.405 ± 0.159
2.074TyrLeu: 2.074 ± 0.297
0.303TyrMet: 0.303 ± 0.123
0.809TyrAsn: 0.809 ± 0.247
1.315TyrPro: 1.315 ± 0.274
0.91TyrGln: 0.91 ± 0.201
2.074TyrArg: 2.074 ± 0.374
1.214TyrSer: 1.214 ± 0.281
1.72TyrThr: 1.72 ± 0.307
2.276TyrVal: 2.276 ± 0.36
0.405TyrTrp: 0.405 ± 0.133
0.405TyrTyr: 0.405 ± 0.148
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (19771 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski