Amino acid dipepetide frequency for Citrobacter phage NS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.025AlaAla: 10.025 ± 1.128
0.697AlaCys: 0.697 ± 0.299
4.359AlaAsp: 4.359 ± 0.544
5.666AlaGlu: 5.666 ± 0.746
3.051AlaPhe: 3.051 ± 0.447
7.41AlaGly: 7.41 ± 0.993
0.872AlaHis: 0.872 ± 0.29
5.405AlaIle: 5.405 ± 0.672
6.102AlaLys: 6.102 ± 0.784
7.236AlaLeu: 7.236 ± 0.801
2.615AlaMet: 2.615 ± 0.373
3.749AlaAsn: 3.749 ± 0.488
2.615AlaPro: 2.615 ± 0.557
3.051AlaGln: 3.051 ± 0.541
4.01AlaArg: 4.01 ± 0.655
5.318AlaSer: 5.318 ± 0.565
3.836AlaThr: 3.836 ± 0.585
6.364AlaVal: 6.364 ± 0.968
1.918AlaTrp: 1.918 ± 0.57
2.354AlaTyr: 2.354 ± 0.546
0.0AlaXaa: 0.0 ± 0.0
Cys
0.697CysAla: 0.697 ± 0.226
0.174CysCys: 0.174 ± 0.128
0.61CysAsp: 0.61 ± 0.361
0.697CysGlu: 0.697 ± 0.236
0.436CysPhe: 0.436 ± 0.198
0.697CysGly: 0.697 ± 0.232
0.087CysHis: 0.087 ± 0.091
0.61CysIle: 0.61 ± 0.274
0.697CysLys: 0.697 ± 0.245
1.046CysLeu: 1.046 ± 0.382
0.349CysMet: 0.349 ± 0.204
0.785CysAsn: 0.785 ± 0.319
0.349CysPro: 0.349 ± 0.165
0.087CysGln: 0.087 ± 0.069
0.523CysArg: 0.523 ± 0.221
0.697CysSer: 0.697 ± 0.29
0.262CysThr: 0.262 ± 0.169
0.697CysVal: 0.697 ± 0.258
0.349CysTrp: 0.349 ± 0.251
0.087CysTyr: 0.087 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
6.102AspAla: 6.102 ± 0.775
0.436AspCys: 0.436 ± 0.237
4.097AspAsp: 4.097 ± 0.728
3.836AspGlu: 3.836 ± 0.524
1.744AspPhe: 1.744 ± 0.415
6.8AspGly: 6.8 ± 0.827
1.482AspHis: 1.482 ± 0.351
3.4AspIle: 3.4 ± 0.447
3.313AspLys: 3.313 ± 0.504
5.143AspLeu: 5.143 ± 0.509
1.744AspMet: 1.744 ± 0.401
2.267AspAsn: 2.267 ± 0.534
2.441AspPro: 2.441 ± 0.637
2.005AspGln: 2.005 ± 0.47
2.267AspArg: 2.267 ± 0.436
3.051AspSer: 3.051 ± 0.433
4.01AspThr: 4.01 ± 0.596
3.749AspVal: 3.749 ± 0.54
1.133AspTrp: 1.133 ± 0.412
2.615AspTyr: 2.615 ± 0.51
0.0AspXaa: 0.0 ± 0.0
Glu
6.625GluAla: 6.625 ± 0.988
0.436GluCys: 0.436 ± 0.222
4.446GluAsp: 4.446 ± 0.602
4.01GluGlu: 4.01 ± 0.637
2.354GluPhe: 2.354 ± 0.477
5.143GluGly: 5.143 ± 0.81
1.046GluHis: 1.046 ± 0.274
2.615GluIle: 2.615 ± 0.455
2.441GluLys: 2.441 ± 0.466
5.318GluLeu: 5.318 ± 0.622
2.005GluMet: 2.005 ± 0.399
2.877GluAsn: 2.877 ± 0.491
2.092GluPro: 2.092 ± 0.413
3.138GluGln: 3.138 ± 0.487
4.097GluArg: 4.097 ± 0.552
3.574GluSer: 3.574 ± 0.722
3.574GluThr: 3.574 ± 0.456
3.923GluVal: 3.923 ± 0.806
1.22GluTrp: 1.22 ± 0.294
2.615GluTyr: 2.615 ± 0.533
0.0GluXaa: 0.0 ± 0.0
Phe
2.964PheAla: 2.964 ± 0.448
0.262PheCys: 0.262 ± 0.151
2.528PheAsp: 2.528 ± 0.512
1.569PheGlu: 1.569 ± 0.315
0.872PhePhe: 0.872 ± 0.318
2.615PheGly: 2.615 ± 0.569
0.959PheHis: 0.959 ± 0.249
1.656PheIle: 1.656 ± 0.4
2.702PheLys: 2.702 ± 0.438
2.877PheLeu: 2.877 ± 0.407
0.785PheMet: 0.785 ± 0.299
1.918PheAsn: 1.918 ± 0.338
1.482PhePro: 1.482 ± 0.389
0.959PheGln: 0.959 ± 0.365
1.744PheArg: 1.744 ± 0.322
2.79PheSer: 2.79 ± 0.311
1.744PheThr: 1.744 ± 0.359
2.179PheVal: 2.179 ± 0.462
0.262PheTrp: 0.262 ± 0.128
1.308PheTyr: 1.308 ± 0.287
0.0PheXaa: 0.0 ± 0.0
Gly
6.538GlyAla: 6.538 ± 1.234
0.785GlyCys: 0.785 ± 0.333
5.754GlyAsp: 5.754 ± 1.017
5.231GlyGlu: 5.231 ± 0.647
2.092GlyPhe: 2.092 ± 0.305
5.318GlyGly: 5.318 ± 0.757
1.308GlyHis: 1.308 ± 0.327
4.359GlyIle: 4.359 ± 0.597
5.492GlyLys: 5.492 ± 0.968
6.102GlyLeu: 6.102 ± 0.779
2.528GlyMet: 2.528 ± 0.465
2.79GlyAsn: 2.79 ± 0.522
1.744GlyPro: 1.744 ± 0.361
2.964GlyGln: 2.964 ± 0.367
5.056GlyArg: 5.056 ± 0.636
5.318GlySer: 5.318 ± 0.577
5.143GlyThr: 5.143 ± 0.649
6.102GlyVal: 6.102 ± 0.746
1.22GlyTrp: 1.22 ± 0.353
3.487GlyTyr: 3.487 ± 0.594
0.0GlyXaa: 0.0 ± 0.0
His
0.959HisAla: 0.959 ± 0.287
0.262HisCys: 0.262 ± 0.173
1.046HisAsp: 1.046 ± 0.434
0.872HisGlu: 0.872 ± 0.289
0.697HisPhe: 0.697 ± 0.303
1.046HisGly: 1.046 ± 0.341
0.436HisHis: 0.436 ± 0.199
1.046HisIle: 1.046 ± 0.244
1.482HisLys: 1.482 ± 0.384
2.092HisLeu: 2.092 ± 0.455
0.262HisMet: 0.262 ± 0.134
0.523HisAsn: 0.523 ± 0.212
0.436HisPro: 0.436 ± 0.196
0.959HisGln: 0.959 ± 0.243
1.22HisArg: 1.22 ± 0.327
1.046HisSer: 1.046 ± 0.266
1.395HisThr: 1.395 ± 0.411
0.959HisVal: 0.959 ± 0.258
0.523HisTrp: 0.523 ± 0.217
0.523HisTyr: 0.523 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
4.359IleAla: 4.359 ± 0.571
0.697IleCys: 0.697 ± 0.263
2.702IleAsp: 2.702 ± 0.412
2.877IleGlu: 2.877 ± 0.494
1.308IlePhe: 1.308 ± 0.373
3.836IleGly: 3.836 ± 0.656
1.046IleHis: 1.046 ± 0.332
1.918IleIle: 1.918 ± 0.422
3.313IleLys: 3.313 ± 0.551
3.4IleLeu: 3.4 ± 0.689
1.046IleMet: 1.046 ± 0.284
2.267IleAsn: 2.267 ± 0.399
2.179IlePro: 2.179 ± 0.615
1.656IleGln: 1.656 ± 0.472
2.964IleArg: 2.964 ± 0.405
3.4IleSer: 3.4 ± 0.612
3.661IleThr: 3.661 ± 0.913
4.272IleVal: 4.272 ± 0.56
0.697IleTrp: 0.697 ± 0.198
1.395IleTyr: 1.395 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
7.584LysAla: 7.584 ± 0.819
0.61LysCys: 0.61 ± 0.262
3.836LysAsp: 3.836 ± 0.595
3.313LysGlu: 3.313 ± 0.473
2.267LysPhe: 2.267 ± 0.416
3.749LysGly: 3.749 ± 0.579
1.22LysHis: 1.22 ± 0.449
1.918LysIle: 1.918 ± 0.473
3.836LysLys: 3.836 ± 0.78
5.666LysLeu: 5.666 ± 0.77
1.656LysMet: 1.656 ± 0.403
2.005LysAsn: 2.005 ± 0.39
2.615LysPro: 2.615 ± 0.47
1.482LysGln: 1.482 ± 0.368
4.01LysArg: 4.01 ± 0.66
3.923LysSer: 3.923 ± 0.525
4.533LysThr: 4.533 ± 0.505
5.405LysVal: 5.405 ± 0.835
1.308LysTrp: 1.308 ± 0.339
2.005LysTyr: 2.005 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
6.887LeuAla: 6.887 ± 0.699
0.523LeuCys: 0.523 ± 0.212
4.62LeuAsp: 4.62 ± 0.484
5.928LeuGlu: 5.928 ± 0.746
1.831LeuPhe: 1.831 ± 0.391
4.969LeuGly: 4.969 ± 0.642
0.785LeuHis: 0.785 ± 0.23
4.184LeuIle: 4.184 ± 0.613
6.538LeuLys: 6.538 ± 0.813
6.364LeuLeu: 6.364 ± 0.716
3.226LeuMet: 3.226 ± 0.459
4.708LeuAsn: 4.708 ± 0.816
3.313LeuPro: 3.313 ± 0.502
4.097LeuGln: 4.097 ± 0.827
5.492LeuArg: 5.492 ± 0.645
5.928LeuSer: 5.928 ± 0.832
5.405LeuThr: 5.405 ± 0.653
4.969LeuVal: 4.969 ± 0.607
1.133LeuTrp: 1.133 ± 0.459
2.005LeuTyr: 2.005 ± 0.58
0.0LeuXaa: 0.0 ± 0.0
Met
2.79MetAla: 2.79 ± 0.374
0.523MetCys: 0.523 ± 0.217
1.133MetAsp: 1.133 ± 0.345
2.267MetGlu: 2.267 ± 0.321
1.395MetPhe: 1.395 ± 0.389
2.441MetGly: 2.441 ± 0.414
0.436MetHis: 0.436 ± 0.192
1.308MetIle: 1.308 ± 0.222
0.785MetLys: 0.785 ± 0.225
2.267MetLeu: 2.267 ± 0.397
0.61MetMet: 0.61 ± 0.232
0.959MetAsn: 0.959 ± 0.208
0.61MetPro: 0.61 ± 0.214
0.785MetGln: 0.785 ± 0.312
1.395MetArg: 1.395 ± 0.394
2.092MetSer: 2.092 ± 0.474
2.092MetThr: 2.092 ± 0.45
3.138MetVal: 3.138 ± 0.46
0.349MetTrp: 0.349 ± 0.176
0.872MetTyr: 0.872 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
4.272AsnAla: 4.272 ± 0.656
0.785AsnCys: 0.785 ± 0.302
2.005AsnAsp: 2.005 ± 0.57
1.918AsnGlu: 1.918 ± 0.467
1.308AsnPhe: 1.308 ± 0.309
4.446AsnGly: 4.446 ± 0.638
0.785AsnHis: 0.785 ± 0.231
2.267AsnIle: 2.267 ± 0.407
2.267AsnLys: 2.267 ± 0.508
3.313AsnLeu: 3.313 ± 0.687
0.872AsnMet: 0.872 ± 0.268
1.918AsnAsn: 1.918 ± 0.45
2.964AsnPro: 2.964 ± 0.485
1.831AsnGln: 1.831 ± 0.388
2.615AsnArg: 2.615 ± 0.451
2.615AsnSer: 2.615 ± 0.567
2.528AsnThr: 2.528 ± 0.371
3.661AsnVal: 3.661 ± 0.696
0.174AsnTrp: 0.174 ± 0.1
1.831AsnTyr: 1.831 ± 0.555
0.0AsnXaa: 0.0 ± 0.0
Pro
3.138ProAla: 3.138 ± 0.551
0.262ProCys: 0.262 ± 0.19
2.441ProAsp: 2.441 ± 0.381
2.964ProGlu: 2.964 ± 0.448
1.046ProPhe: 1.046 ± 0.225
1.918ProGly: 1.918 ± 0.283
0.436ProHis: 0.436 ± 0.164
2.005ProIle: 2.005 ± 0.428
2.877ProLys: 2.877 ± 0.621
2.179ProLeu: 2.179 ± 0.396
1.22ProMet: 1.22 ± 0.259
2.092ProAsn: 2.092 ± 0.456
0.436ProPro: 0.436 ± 0.171
1.482ProGln: 1.482 ± 0.39
1.918ProArg: 1.918 ± 0.458
2.267ProSer: 2.267 ± 0.445
3.051ProThr: 3.051 ± 0.399
2.528ProVal: 2.528 ± 0.333
0.697ProTrp: 0.697 ± 0.275
0.785ProTyr: 0.785 ± 0.213
0.0ProXaa: 0.0 ± 0.0
Gln
2.615GlnAla: 2.615 ± 0.567
0.087GlnCys: 0.087 ± 0.104
2.964GlnAsp: 2.964 ± 0.826
2.528GlnGlu: 2.528 ± 0.385
2.092GlnPhe: 2.092 ± 0.405
2.877GlnGly: 2.877 ± 0.53
0.523GlnHis: 0.523 ± 0.24
0.872GlnIle: 0.872 ± 0.284
1.918GlnLys: 1.918 ± 0.372
4.097GlnLeu: 4.097 ± 0.679
1.046GlnMet: 1.046 ± 0.34
1.569GlnAsn: 1.569 ± 0.429
0.959GlnPro: 0.959 ± 0.349
2.354GlnGln: 2.354 ± 0.63
2.354GlnArg: 2.354 ± 0.731
3.226GlnSer: 3.226 ± 0.486
2.092GlnThr: 2.092 ± 0.52
3.226GlnVal: 3.226 ± 0.384
0.697GlnTrp: 0.697 ± 0.237
1.308GlnTyr: 1.308 ± 0.381
0.0GlnXaa: 0.0 ± 0.0
Arg
4.359ArgAla: 4.359 ± 0.973
0.523ArgCys: 0.523 ± 0.18
4.359ArgAsp: 4.359 ± 0.507
4.184ArgGlu: 4.184 ± 0.53
2.354ArgPhe: 2.354 ± 0.444
3.836ArgGly: 3.836 ± 0.452
0.872ArgHis: 0.872 ± 0.292
3.051ArgIle: 3.051 ± 0.508
3.923ArgLys: 3.923 ± 0.66
5.928ArgLeu: 5.928 ± 0.741
0.959ArgMet: 0.959 ± 0.309
2.79ArgAsn: 2.79 ± 0.528
1.395ArgPro: 1.395 ± 0.366
2.441ArgGln: 2.441 ± 0.408
2.354ArgArg: 2.354 ± 0.389
4.01ArgSer: 4.01 ± 0.502
3.138ArgThr: 3.138 ± 0.484
3.4ArgVal: 3.4 ± 0.695
1.308ArgTrp: 1.308 ± 0.302
1.482ArgTyr: 1.482 ± 0.309
0.0ArgXaa: 0.0 ± 0.0
Ser
4.184SerAla: 4.184 ± 0.623
1.133SerCys: 1.133 ± 0.443
4.969SerAsp: 4.969 ± 0.556
3.749SerGlu: 3.749 ± 0.612
3.226SerPhe: 3.226 ± 0.449
6.19SerGly: 6.19 ± 0.701
2.441SerHis: 2.441 ± 0.449
3.487SerIle: 3.487 ± 0.596
3.313SerLys: 3.313 ± 0.519
4.184SerLeu: 4.184 ± 0.59
1.831SerMet: 1.831 ± 0.453
2.615SerAsn: 2.615 ± 0.514
3.138SerPro: 3.138 ± 0.506
2.179SerGln: 2.179 ± 0.409
3.661SerArg: 3.661 ± 0.758
4.533SerSer: 4.533 ± 0.932
3.313SerThr: 3.313 ± 0.463
3.923SerVal: 3.923 ± 0.577
0.61SerTrp: 0.61 ± 0.197
2.702SerTyr: 2.702 ± 0.553
0.0SerXaa: 0.0 ± 0.0
Thr
3.661ThrAla: 3.661 ± 0.577
0.697ThrCys: 0.697 ± 0.192
3.487ThrAsp: 3.487 ± 0.549
4.359ThrGlu: 4.359 ± 0.655
2.354ThrPhe: 2.354 ± 0.527
5.754ThrGly: 5.754 ± 0.67
0.697ThrHis: 0.697 ± 0.238
3.4ThrIle: 3.4 ± 0.598
3.4ThrLys: 3.4 ± 0.585
6.19ThrLeu: 6.19 ± 0.698
1.918ThrMet: 1.918 ± 0.338
2.528ThrAsn: 2.528 ± 0.546
2.877ThrPro: 2.877 ± 0.381
2.79ThrGln: 2.79 ± 0.504
2.702ThrArg: 2.702 ± 0.521
3.749ThrSer: 3.749 ± 0.83
4.01ThrThr: 4.01 ± 0.804
4.097ThrVal: 4.097 ± 0.491
0.872ThrTrp: 0.872 ± 0.314
1.308ThrTyr: 1.308 ± 0.261
0.0ThrXaa: 0.0 ± 0.0
Val
4.969ValAla: 4.969 ± 0.763
0.436ValCys: 0.436 ± 0.188
3.4ValAsp: 3.4 ± 0.476
5.056ValGlu: 5.056 ± 0.816
2.092ValPhe: 2.092 ± 0.427
6.451ValGly: 6.451 ± 0.85
1.133ValHis: 1.133 ± 0.379
3.487ValIle: 3.487 ± 0.514
5.579ValLys: 5.579 ± 0.689
4.882ValLeu: 4.882 ± 0.557
2.528ValMet: 2.528 ± 0.52
3.749ValAsn: 3.749 ± 0.609
2.615ValPro: 2.615 ± 0.481
2.702ValGln: 2.702 ± 0.41
4.882ValArg: 4.882 ± 0.719
5.056ValSer: 5.056 ± 0.538
4.097ValThr: 4.097 ± 0.57
5.666ValVal: 5.666 ± 0.924
1.046ValTrp: 1.046 ± 0.392
2.528ValTyr: 2.528 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
0.349TrpAla: 0.349 ± 0.176
0.174TrpCys: 0.174 ± 0.144
0.872TrpAsp: 0.872 ± 0.274
1.22TrpGlu: 1.22 ± 0.428
0.523TrpPhe: 0.523 ± 0.22
0.959TrpGly: 0.959 ± 0.396
0.61TrpHis: 0.61 ± 0.23
0.523TrpIle: 0.523 ± 0.306
1.046TrpLys: 1.046 ± 0.279
2.528TrpLeu: 2.528 ± 0.392
0.262TrpMet: 0.262 ± 0.182
0.872TrpAsn: 0.872 ± 0.23
0.436TrpPro: 0.436 ± 0.197
0.697TrpGln: 0.697 ± 0.282
0.785TrpArg: 0.785 ± 0.278
1.22TrpSer: 1.22 ± 0.515
0.697TrpThr: 0.697 ± 0.274
1.482TrpVal: 1.482 ± 0.397
0.436TrpTrp: 0.436 ± 0.172
0.523TrpTyr: 0.523 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.574TyrAla: 3.574 ± 0.623
0.436TyrCys: 0.436 ± 0.236
2.005TyrAsp: 2.005 ± 0.337
1.308TyrGlu: 1.308 ± 0.48
1.133TyrPhe: 1.133 ± 0.274
2.877TyrGly: 2.877 ± 0.496
0.61TyrHis: 0.61 ± 0.226
1.482TyrIle: 1.482 ± 0.486
1.918TyrLys: 1.918 ± 0.416
2.092TyrLeu: 2.092 ± 0.246
0.697TyrMet: 0.697 ± 0.234
1.482TyrAsn: 1.482 ± 0.47
1.046TyrPro: 1.046 ± 0.377
1.744TyrGln: 1.744 ± 0.553
2.615TyrArg: 2.615 ± 0.578
1.569TyrSer: 1.569 ± 0.463
2.267TyrThr: 2.267 ± 0.373
2.615TyrVal: 2.615 ± 0.533
0.262TyrTrp: 0.262 ± 0.149
1.133TyrTyr: 1.133 ± 0.261
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski