Amino acid dipepetide frequency for Microbacterium phage Piperis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.672AlaAla: 17.672 ± 1.561
0.827AlaCys: 0.827 ± 0.261
8.334AlaAsp: 8.334 ± 0.759
7.861AlaGlu: 7.861 ± 0.839
3.369AlaPhe: 3.369 ± 0.393
8.748AlaGly: 8.748 ± 0.922
2.482AlaHis: 2.482 ± 0.386
5.497AlaIle: 5.497 ± 0.648
3.605AlaLys: 3.605 ± 0.468
11.821AlaLeu: 11.821 ± 1.39
3.251AlaMet: 3.251 ± 0.366
3.192AlaAsn: 3.192 ± 0.369
6.206AlaPro: 6.206 ± 0.63
4.788AlaGln: 4.788 ± 0.571
7.388AlaArg: 7.388 ± 0.799
6.62AlaSer: 6.62 ± 0.659
7.684AlaThr: 7.684 ± 0.628
6.679AlaVal: 6.679 ± 0.695
3.073AlaTrp: 3.073 ± 0.397
2.896AlaTyr: 2.896 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.768CysAla: 0.768 ± 0.24
0.177CysCys: 0.177 ± 0.13
1.241CysAsp: 1.241 ± 0.322
0.591CysGlu: 0.591 ± 0.212
0.236CysPhe: 0.236 ± 0.107
1.3CysGly: 1.3 ± 0.308
0.355CysHis: 0.355 ± 0.142
0.236CysIle: 0.236 ± 0.119
0.0CysLys: 0.0 ± 0.0
0.355CysLeu: 0.355 ± 0.133
0.177CysMet: 0.177 ± 0.093
0.177CysAsn: 0.177 ± 0.094
0.768CysPro: 0.768 ± 0.183
0.236CysGln: 0.236 ± 0.114
0.591CysArg: 0.591 ± 0.166
0.473CysSer: 0.473 ± 0.293
0.355CysThr: 0.355 ± 0.128
0.65CysVal: 0.65 ± 0.19
0.118CysTrp: 0.118 ± 0.072
0.118CysTyr: 0.118 ± 0.067
0.0CysXaa: 0.0 ± 0.0
Asp
9.043AspAla: 9.043 ± 0.799
0.65AspCys: 0.65 ± 0.264
5.142AspAsp: 5.142 ± 0.732
5.201AspGlu: 5.201 ± 0.778
2.128AspPhe: 2.128 ± 0.297
6.383AspGly: 6.383 ± 0.609
1.241AspHis: 1.241 ± 0.259
2.896AspIle: 2.896 ± 0.459
1.241AspLys: 1.241 ± 0.231
5.851AspLeu: 5.851 ± 0.606
1.596AspMet: 1.596 ± 0.348
1.596AspAsn: 1.596 ± 0.285
4.019AspPro: 4.019 ± 0.515
2.246AspGln: 2.246 ± 0.39
5.142AspArg: 5.142 ± 0.688
3.428AspSer: 3.428 ± 0.414
2.955AspThr: 2.955 ± 0.384
4.847AspVal: 4.847 ± 0.583
0.946AspTrp: 0.946 ± 0.262
1.714AspTyr: 1.714 ± 0.349
0.0AspXaa: 0.0 ± 0.0
Glu
8.334GluAla: 8.334 ± 0.958
0.473GluCys: 0.473 ± 0.141
4.847GluAsp: 4.847 ± 0.698
4.433GluGlu: 4.433 ± 0.662
1.596GluPhe: 1.596 ± 0.355
4.906GluGly: 4.906 ± 0.573
1.891GluHis: 1.891 ± 0.379
1.891GluIle: 1.891 ± 0.46
1.773GluLys: 1.773 ± 0.302
3.133GluLeu: 3.133 ± 0.349
1.241GluMet: 1.241 ± 0.236
1.478GluAsn: 1.478 ± 0.289
5.083GluPro: 5.083 ± 0.8
2.128GluGln: 2.128 ± 0.352
5.615GluArg: 5.615 ± 0.601
2.896GluSer: 2.896 ± 0.462
3.605GluThr: 3.605 ± 0.548
5.733GluVal: 5.733 ± 0.654
1.3GluTrp: 1.3 ± 0.268
2.01GluTyr: 2.01 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
3.133PheAla: 3.133 ± 0.462
0.236PheCys: 0.236 ± 0.155
2.305PheAsp: 2.305 ± 0.373
1.95PheGlu: 1.95 ± 0.327
0.65PhePhe: 0.65 ± 0.215
2.542PheGly: 2.542 ± 0.432
0.355PheHis: 0.355 ± 0.133
1.478PheIle: 1.478 ± 0.294
0.709PheLys: 0.709 ± 0.197
1.596PheLeu: 1.596 ± 0.259
0.65PheMet: 0.65 ± 0.186
0.414PheAsn: 0.414 ± 0.14
1.3PhePro: 1.3 ± 0.282
0.532PheGln: 0.532 ± 0.208
1.478PheArg: 1.478 ± 0.284
1.241PheSer: 1.241 ± 0.274
2.364PheThr: 2.364 ± 0.337
1.655PheVal: 1.655 ± 0.301
0.236PheTrp: 0.236 ± 0.108
0.709PheTyr: 0.709 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
7.802GlyAla: 7.802 ± 0.911
0.709GlyCys: 0.709 ± 0.216
6.265GlyAsp: 6.265 ± 0.624
5.438GlyGlu: 5.438 ± 0.549
3.014GlyPhe: 3.014 ± 0.391
7.27GlyGly: 7.27 ± 0.566
1.891GlyHis: 1.891 ± 0.314
4.374GlyIle: 4.374 ± 0.581
2.719GlyLys: 2.719 ± 0.383
6.62GlyLeu: 6.62 ± 1.064
3.428GlyMet: 3.428 ± 0.47
1.95GlyAsn: 1.95 ± 0.339
3.487GlyPro: 3.487 ± 0.456
3.133GlyGln: 3.133 ± 0.425
5.97GlyArg: 5.97 ± 0.541
4.847GlySer: 4.847 ± 0.57
6.088GlyThr: 6.088 ± 0.558
5.911GlyVal: 5.911 ± 0.635
2.364GlyTrp: 2.364 ± 0.345
2.778GlyTyr: 2.778 ± 0.329
0.0GlyXaa: 0.0 ± 0.0
His
2.069HisAla: 2.069 ± 0.385
0.177HisCys: 0.177 ± 0.096
1.419HisAsp: 1.419 ± 0.251
1.182HisGlu: 1.182 ± 0.237
0.532HisPhe: 0.532 ± 0.158
1.95HisGly: 1.95 ± 0.308
0.355HisHis: 0.355 ± 0.175
0.768HisIle: 0.768 ± 0.196
0.473HisLys: 0.473 ± 0.178
2.01HisLeu: 2.01 ± 0.421
0.355HisMet: 0.355 ± 0.13
0.414HisAsn: 0.414 ± 0.138
1.359HisPro: 1.359 ± 0.328
0.355HisGln: 0.355 ± 0.149
1.064HisArg: 1.064 ± 0.305
0.827HisSer: 0.827 ± 0.203
1.359HisThr: 1.359 ± 0.344
1.596HisVal: 1.596 ± 0.303
0.355HisTrp: 0.355 ± 0.149
0.887HisTyr: 0.887 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.61IleAla: 4.61 ± 0.43
0.414IleCys: 0.414 ± 0.186
3.546IleAsp: 3.546 ± 0.488
3.605IleGlu: 3.605 ± 0.434
0.768IlePhe: 0.768 ± 0.206
3.428IleGly: 3.428 ± 0.412
0.709IleHis: 0.709 ± 0.185
2.01IleIle: 2.01 ± 0.493
1.005IleLys: 1.005 ± 0.187
2.542IleLeu: 2.542 ± 0.375
0.946IleMet: 0.946 ± 0.273
0.946IleAsn: 0.946 ± 0.258
3.251IlePro: 3.251 ± 0.656
1.419IleGln: 1.419 ± 0.329
3.369IleArg: 3.369 ± 0.446
2.542IleSer: 2.542 ± 0.426
4.492IleThr: 4.492 ± 0.577
4.019IleVal: 4.019 ± 0.433
0.532IleTrp: 0.532 ± 0.152
0.532IleTyr: 0.532 ± 0.159
0.0IleXaa: 0.0 ± 0.0
Lys
3.783LysAla: 3.783 ± 0.519
0.355LysCys: 0.355 ± 0.134
1.005LysAsp: 1.005 ± 0.247
0.591LysGlu: 0.591 ± 0.198
0.768LysPhe: 0.768 ± 0.239
2.364LysGly: 2.364 ± 0.388
0.591LysHis: 0.591 ± 0.185
1.064LysIle: 1.064 ± 0.215
0.65LysLys: 0.65 ± 0.219
1.123LysLeu: 1.123 ± 0.226
0.768LysMet: 0.768 ± 0.178
0.236LysAsn: 0.236 ± 0.118
1.95LysPro: 1.95 ± 0.361
0.709LysGln: 0.709 ± 0.177
2.482LysArg: 2.482 ± 0.459
1.714LysSer: 1.714 ± 0.278
1.596LysThr: 1.596 ± 0.378
2.955LysVal: 2.955 ± 0.415
0.532LysTrp: 0.532 ± 0.199
0.355LysTyr: 0.355 ± 0.146
0.0LysXaa: 0.0 ± 0.0
Leu
9.811LeuAla: 9.811 ± 0.945
0.296LeuCys: 0.296 ± 0.116
4.847LeuAsp: 4.847 ± 0.655
4.433LeuGlu: 4.433 ± 0.512
1.596LeuPhe: 1.596 ± 0.394
6.561LeuGly: 6.561 ± 0.983
1.064LeuHis: 1.064 ± 0.238
4.137LeuIle: 4.137 ± 0.659
0.65LeuLys: 0.65 ± 0.163
5.438LeuLeu: 5.438 ± 0.739
2.01LeuMet: 2.01 ± 0.306
1.773LeuAsn: 1.773 ± 0.379
4.906LeuPro: 4.906 ± 0.553
2.187LeuGln: 2.187 ± 0.34
4.788LeuArg: 4.788 ± 0.595
5.615LeuSer: 5.615 ± 0.702
6.442LeuThr: 6.442 ± 0.677
6.088LeuVal: 6.088 ± 0.673
1.182LeuTrp: 1.182 ± 0.262
1.3LeuTyr: 1.3 ± 0.313
0.0LeuXaa: 0.0 ± 0.0
Met
3.251MetAla: 3.251 ± 0.396
0.177MetCys: 0.177 ± 0.11
1.064MetAsp: 1.064 ± 0.275
0.473MetGlu: 0.473 ± 0.157
0.473MetPhe: 0.473 ± 0.154
2.246MetGly: 2.246 ± 0.372
0.414MetHis: 0.414 ± 0.15
1.714MetIle: 1.714 ± 0.39
0.296MetLys: 0.296 ± 0.114
1.832MetLeu: 1.832 ± 0.337
0.236MetMet: 0.236 ± 0.099
0.768MetAsn: 0.768 ± 0.198
1.419MetPro: 1.419 ± 0.318
0.768MetGln: 0.768 ± 0.231
1.832MetArg: 1.832 ± 0.298
2.542MetSer: 2.542 ± 0.33
2.601MetThr: 2.601 ± 0.344
1.655MetVal: 1.655 ± 0.255
0.177MetTrp: 0.177 ± 0.096
0.709MetTyr: 0.709 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.428AsnAla: 3.428 ± 0.395
0.296AsnCys: 0.296 ± 0.118
1.064AsnAsp: 1.064 ± 0.235
0.946AsnGlu: 0.946 ± 0.239
0.355AsnPhe: 0.355 ± 0.161
3.133AsnGly: 3.133 ± 0.499
0.414AsnHis: 0.414 ± 0.18
0.532AsnIle: 0.532 ± 0.195
0.414AsnLys: 0.414 ± 0.165
2.423AsnLeu: 2.423 ± 0.305
0.414AsnMet: 0.414 ± 0.193
0.65AsnAsn: 0.65 ± 0.222
2.482AsnPro: 2.482 ± 0.447
0.591AsnGln: 0.591 ± 0.171
2.01AsnArg: 2.01 ± 0.409
1.714AsnSer: 1.714 ± 0.339
2.246AsnThr: 2.246 ± 0.315
1.714AsnVal: 1.714 ± 0.338
0.177AsnTrp: 0.177 ± 0.093
0.473AsnTyr: 0.473 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
7.861ProAla: 7.861 ± 0.897
0.65ProCys: 0.65 ± 0.193
4.965ProAsp: 4.965 ± 0.587
5.851ProGlu: 5.851 ± 0.805
1.478ProPhe: 1.478 ± 0.296
5.438ProGly: 5.438 ± 0.457
1.123ProHis: 1.123 ± 0.271
1.537ProIle: 1.537 ± 0.301
1.596ProLys: 1.596 ± 0.338
3.783ProLeu: 3.783 ± 0.432
1.359ProMet: 1.359 ± 0.285
1.832ProAsn: 1.832 ± 0.389
3.901ProPro: 3.901 ± 0.633
1.714ProGln: 1.714 ± 0.465
2.896ProArg: 2.896 ± 0.37
3.665ProSer: 3.665 ± 0.419
4.61ProThr: 4.61 ± 0.754
3.96ProVal: 3.96 ± 0.5
1.005ProTrp: 1.005 ± 0.245
1.419ProTyr: 1.419 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.842GlnAla: 3.842 ± 0.574
0.296GlnCys: 0.296 ± 0.124
1.773GlnAsp: 1.773 ± 0.279
1.3GlnGlu: 1.3 ± 0.228
1.123GlnPhe: 1.123 ± 0.237
2.364GlnGly: 2.364 ± 0.335
0.768GlnHis: 0.768 ± 0.212
1.655GlnIle: 1.655 ± 0.36
0.827GlnLys: 0.827 ± 0.208
1.064GlnLeu: 1.064 ± 0.323
1.3GlnMet: 1.3 ± 0.284
1.3GlnAsn: 1.3 ± 0.302
1.714GlnPro: 1.714 ± 0.332
1.182GlnGln: 1.182 ± 0.27
2.187GlnArg: 2.187 ± 0.4
2.01GlnSer: 2.01 ± 0.324
2.305GlnThr: 2.305 ± 0.411
2.542GlnVal: 2.542 ± 0.424
0.768GlnTrp: 0.768 ± 0.196
1.005GlnTyr: 1.005 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
7.329ArgAla: 7.329 ± 0.74
0.946ArgCys: 0.946 ± 0.229
4.492ArgAsp: 4.492 ± 0.644
5.083ArgGlu: 5.083 ± 0.71
1.714ArgPhe: 1.714 ± 0.358
6.029ArgGly: 6.029 ± 0.545
1.655ArgHis: 1.655 ± 0.342
2.955ArgIle: 2.955 ± 0.428
2.246ArgLys: 2.246 ± 0.322
6.147ArgLeu: 6.147 ± 0.635
1.419ArgMet: 1.419 ± 0.277
1.359ArgAsn: 1.359 ± 0.262
3.251ArgPro: 3.251 ± 0.511
3.073ArgGln: 3.073 ± 0.41
7.329ArgArg: 7.329 ± 0.849
3.487ArgSer: 3.487 ± 0.48
3.724ArgThr: 3.724 ± 0.522
4.374ArgVal: 4.374 ± 0.607
1.655ArgTrp: 1.655 ± 0.362
2.246ArgTyr: 2.246 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
8.275SerAla: 8.275 ± 1.114
0.177SerCys: 0.177 ± 0.103
5.024SerAsp: 5.024 ± 0.498
3.605SerGlu: 3.605 ± 0.516
1.419SerPhe: 1.419 ± 0.297
4.551SerGly: 4.551 ± 0.617
1.123SerHis: 1.123 ± 0.25
2.955SerIle: 2.955 ± 0.472
1.832SerLys: 1.832 ± 0.299
4.137SerLeu: 4.137 ± 0.561
1.714SerMet: 1.714 ± 0.337
1.891SerAsn: 1.891 ± 0.357
2.66SerPro: 2.66 ± 0.39
1.064SerGln: 1.064 ± 0.226
3.487SerArg: 3.487 ± 0.393
2.66SerSer: 2.66 ± 0.539
4.433SerThr: 4.433 ± 0.585
4.551SerVal: 4.551 ± 0.539
1.064SerTrp: 1.064 ± 0.217
1.3SerTyr: 1.3 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
8.097ThrAla: 8.097 ± 0.885
0.591ThrCys: 0.591 ± 0.181
3.783ThrAsp: 3.783 ± 0.538
4.256ThrGlu: 4.256 ± 0.481
1.478ThrPhe: 1.478 ± 0.284
6.974ThrGly: 6.974 ± 0.796
1.419ThrHis: 1.419 ± 0.279
3.428ThrIle: 3.428 ± 0.517
2.069ThrLys: 2.069 ± 0.461
5.792ThrLeu: 5.792 ± 0.635
1.3ThrMet: 1.3 ± 0.253
1.478ThrAsn: 1.478 ± 0.324
5.851ThrPro: 5.851 ± 0.682
1.419ThrGln: 1.419 ± 0.232
4.019ThrArg: 4.019 ± 0.555
3.842ThrSer: 3.842 ± 0.561
5.201ThrThr: 5.201 ± 0.472
6.679ThrVal: 6.679 ± 0.599
0.887ThrTrp: 0.887 ± 0.2
1.891ThrTyr: 1.891 ± 0.327
0.0ThrXaa: 0.0 ± 0.0
Val
8.038ValAla: 8.038 ± 0.654
0.827ValCys: 0.827 ± 0.217
3.546ValAsp: 3.546 ± 0.517
4.728ValGlu: 4.728 ± 0.529
1.537ValPhe: 1.537 ± 0.326
6.147ValGly: 6.147 ± 0.65
1.005ValHis: 1.005 ± 0.259
3.783ValIle: 3.783 ± 0.475
2.128ValLys: 2.128 ± 0.402
5.911ValLeu: 5.911 ± 0.527
1.655ValMet: 1.655 ± 0.347
2.896ValAsn: 2.896 ± 0.425
5.319ValPro: 5.319 ± 0.499
2.246ValGln: 2.246 ± 0.43
5.024ValArg: 5.024 ± 0.637
4.847ValSer: 4.847 ± 0.592
6.029ValThr: 6.029 ± 0.427
5.142ValVal: 5.142 ± 0.632
1.537ValTrp: 1.537 ± 0.339
2.128ValTyr: 2.128 ± 0.332
0.0ValXaa: 0.0 ± 0.0
Trp
2.069TrpAla: 2.069 ± 0.38
0.355TrpCys: 0.355 ± 0.144
0.887TrpAsp: 0.887 ± 0.28
1.3TrpGlu: 1.3 ± 0.283
0.887TrpPhe: 0.887 ± 0.2
1.241TrpGly: 1.241 ± 0.266
0.414TrpHis: 0.414 ± 0.171
0.709TrpIle: 0.709 ± 0.2
0.709TrpLys: 0.709 ± 0.178
1.596TrpLeu: 1.596 ± 0.345
0.177TrpMet: 0.177 ± 0.1
0.532TrpAsn: 0.532 ± 0.181
0.709TrpPro: 0.709 ± 0.177
0.65TrpGln: 0.65 ± 0.179
1.714TrpArg: 1.714 ± 0.358
1.359TrpSer: 1.359 ± 0.306
0.768TrpThr: 0.768 ± 0.213
1.714TrpVal: 1.714 ± 0.286
0.296TrpTrp: 0.296 ± 0.13
0.65TrpTyr: 0.65 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.778TyrAla: 2.778 ± 0.418
0.296TyrCys: 0.296 ± 0.144
2.719TyrAsp: 2.719 ± 0.428
1.537TyrGlu: 1.537 ± 0.264
0.414TyrPhe: 0.414 ± 0.201
2.482TyrGly: 2.482 ± 0.421
0.236TyrHis: 0.236 ± 0.106
1.064TyrIle: 1.064 ± 0.234
0.65TyrLys: 0.65 ± 0.205
1.95TyrLeu: 1.95 ± 0.342
0.591TyrMet: 0.591 ± 0.208
0.591TyrAsn: 0.591 ± 0.172
1.064TyrPro: 1.064 ± 0.27
0.887TyrGln: 0.887 ± 0.222
2.128TyrArg: 2.128 ± 0.333
1.596TyrSer: 1.596 ± 0.363
1.596TyrThr: 1.596 ± 0.245
2.01TyrVal: 2.01 ± 0.359
0.532TyrTrp: 0.532 ± 0.148
0.65TyrTyr: 0.65 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (16920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski