Amino acid dipepetide frequency for Gordonia Phage Lollipop1437

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.28AlaAla: 12.28 ± 1.109
0.526AlaCys: 0.526 ± 0.153
7.693AlaAsp: 7.693 ± 0.569
7.789AlaGlu: 7.789 ± 0.646
3.01AlaPhe: 3.01 ± 0.36
7.741AlaGly: 7.741 ± 0.656
2.15AlaHis: 2.15 ± 0.336
5.399AlaIle: 5.399 ± 0.723
5.161AlaLys: 5.161 ± 0.535
7.932AlaLeu: 7.932 ± 0.881
2.246AlaMet: 2.246 ± 0.322
3.536AlaAsn: 3.536 ± 0.361
3.727AlaPro: 3.727 ± 0.555
3.679AlaGln: 3.679 ± 0.694
5.686AlaArg: 5.686 ± 0.558
4.874AlaSer: 4.874 ± 0.658
6.164AlaThr: 6.164 ± 0.572
7.789AlaVal: 7.789 ± 0.605
1.433AlaTrp: 1.433 ± 0.286
2.246AlaTyr: 2.246 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
0.334CysAla: 0.334 ± 0.133
0.0CysCys: 0.0 ± 0.0
0.717CysAsp: 0.717 ± 0.203
0.43CysGlu: 0.43 ± 0.12
0.191CysPhe: 0.191 ± 0.092
0.765CysGly: 0.765 ± 0.196
0.334CysHis: 0.334 ± 0.144
0.382CysIle: 0.382 ± 0.141
0.239CysLys: 0.239 ± 0.122
0.382CysLeu: 0.382 ± 0.122
0.048CysMet: 0.048 ± 0.043
0.143CysAsn: 0.143 ± 0.077
0.478CysPro: 0.478 ± 0.166
0.191CysGln: 0.191 ± 0.097
0.334CysArg: 0.334 ± 0.134
0.382CysSer: 0.382 ± 0.162
0.43CysThr: 0.43 ± 0.161
0.478CysVal: 0.478 ± 0.196
0.239CysTrp: 0.239 ± 0.103
0.239CysTyr: 0.239 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
6.498AspAla: 6.498 ± 0.534
0.382AspCys: 0.382 ± 0.133
6.69AspAsp: 6.69 ± 0.973
5.925AspGlu: 5.925 ± 0.787
2.915AspPhe: 2.915 ± 0.404
7.215AspGly: 7.215 ± 0.574
1.386AspHis: 1.386 ± 0.227
3.631AspIle: 3.631 ± 0.453
2.389AspLys: 2.389 ± 0.31
5.304AspLeu: 5.304 ± 0.539
1.577AspMet: 1.577 ± 0.277
1.768AspAsn: 1.768 ± 0.296
5.83AspPro: 5.83 ± 0.72
2.867AspGln: 2.867 ± 0.334
4.587AspArg: 4.587 ± 0.594
2.963AspSer: 2.963 ± 0.363
2.819AspThr: 2.819 ± 0.382
4.014AspVal: 4.014 ± 0.55
1.147AspTrp: 1.147 ± 0.216
2.102AspTyr: 2.102 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
6.737GluAla: 6.737 ± 0.595
0.526GluCys: 0.526 ± 0.18
4.109GluAsp: 4.109 ± 0.49
4.587GluGlu: 4.587 ± 0.589
2.532GluPhe: 2.532 ± 0.354
4.444GluGly: 4.444 ± 0.593
1.242GluHis: 1.242 ± 0.234
4.348GluIle: 4.348 ± 0.468
3.01GluLys: 3.01 ± 0.474
6.164GluLeu: 6.164 ± 0.636
1.816GluMet: 1.816 ± 0.314
2.15GluAsn: 2.15 ± 0.395
2.294GluPro: 2.294 ± 0.412
2.102GluGln: 2.102 ± 0.329
5.161GluArg: 5.161 ± 0.661
2.819GluSer: 2.819 ± 0.305
4.348GluThr: 4.348 ± 0.514
4.969GluVal: 4.969 ± 0.51
1.529GluTrp: 1.529 ± 0.238
1.911GluTyr: 1.911 ± 0.324
0.0GluXaa: 0.0 ± 0.0
Phe
2.628PheAla: 2.628 ± 0.364
0.382PheCys: 0.382 ± 0.121
2.867PheAsp: 2.867 ± 0.383
2.867PheGlu: 2.867 ± 0.33
1.195PhePhe: 1.195 ± 0.244
3.154PheGly: 3.154 ± 0.387
0.908PheHis: 0.908 ± 0.222
1.29PheIle: 1.29 ± 0.179
0.956PheLys: 0.956 ± 0.198
2.676PheLeu: 2.676 ± 0.332
0.669PheMet: 0.669 ± 0.167
0.86PheAsn: 0.86 ± 0.184
1.338PhePro: 1.338 ± 0.267
1.003PheGln: 1.003 ± 0.194
2.485PheArg: 2.485 ± 0.267
2.055PheSer: 2.055 ± 0.36
1.959PheThr: 1.959 ± 0.317
2.102PheVal: 2.102 ± 0.378
0.717PheTrp: 0.717 ± 0.207
0.956PheTyr: 0.956 ± 0.188
0.0PheXaa: 0.0 ± 0.0
Gly
8.362GlyAla: 8.362 ± 0.774
0.526GlyCys: 0.526 ± 0.192
6.021GlyAsp: 6.021 ± 0.503
5.543GlyGlu: 5.543 ± 0.529
3.106GlyPhe: 3.106 ± 0.344
8.744GlyGly: 8.744 ± 1.456
2.055GlyHis: 2.055 ± 0.367
3.775GlyIle: 3.775 ± 0.474
4.157GlyLys: 4.157 ± 0.475
6.355GlyLeu: 6.355 ± 0.692
2.15GlyMet: 2.15 ± 0.407
2.724GlyAsn: 2.724 ± 0.448
4.539GlyPro: 4.539 ± 0.533
3.01GlyGln: 3.01 ± 0.435
5.782GlyArg: 5.782 ± 0.588
5.113GlySer: 5.113 ± 0.753
5.734GlyThr: 5.734 ± 0.759
5.495GlyVal: 5.495 ± 0.578
1.672GlyTrp: 1.672 ± 0.298
2.58GlyTyr: 2.58 ± 0.331
0.0GlyXaa: 0.0 ± 0.0
His
1.386HisAla: 1.386 ± 0.212
0.048HisCys: 0.048 ± 0.043
1.099HisAsp: 1.099 ± 0.243
0.812HisGlu: 0.812 ± 0.203
0.765HisPhe: 0.765 ± 0.206
1.386HisGly: 1.386 ± 0.247
0.43HisHis: 0.43 ± 0.139
0.86HisIle: 0.86 ± 0.169
0.382HisLys: 0.382 ± 0.128
1.338HisLeu: 1.338 ± 0.265
0.573HisMet: 0.573 ± 0.155
0.526HisAsn: 0.526 ± 0.149
1.433HisPro: 1.433 ± 0.348
0.334HisGln: 0.334 ± 0.114
2.007HisArg: 2.007 ± 0.329
0.956HisSer: 0.956 ± 0.224
1.625HisThr: 1.625 ± 0.296
1.911HisVal: 1.911 ± 0.346
0.478HisTrp: 0.478 ± 0.139
0.908HisTyr: 0.908 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
5.161IleAla: 5.161 ± 0.556
0.191IleCys: 0.191 ± 0.111
3.679IleAsp: 3.679 ± 0.406
4.492IleGlu: 4.492 ± 0.446
1.577IlePhe: 1.577 ± 0.276
4.253IleGly: 4.253 ± 0.473
1.195IleHis: 1.195 ± 0.274
2.198IleIle: 2.198 ± 0.328
2.246IleLys: 2.246 ± 0.334
3.297IleLeu: 3.297 ± 0.301
1.003IleMet: 1.003 ± 0.197
2.055IleAsn: 2.055 ± 0.299
2.628IlePro: 2.628 ± 0.334
1.338IleGln: 1.338 ± 0.3
3.44IleArg: 3.44 ± 0.377
2.724IleSer: 2.724 ± 0.398
3.536IleThr: 3.536 ± 0.476
3.536IleVal: 3.536 ± 0.435
1.003IleTrp: 1.003 ± 0.184
0.908IleTyr: 0.908 ± 0.221
0.0IleXaa: 0.0 ± 0.0
Lys
3.775LysAla: 3.775 ± 0.41
0.382LysCys: 0.382 ± 0.134
2.771LysAsp: 2.771 ± 0.328
1.72LysGlu: 1.72 ± 0.268
1.29LysPhe: 1.29 ± 0.248
3.01LysGly: 3.01 ± 0.392
0.621LysHis: 0.621 ± 0.171
2.676LysIle: 2.676 ± 0.399
2.771LysLys: 2.771 ± 0.449
3.106LysLeu: 3.106 ± 0.412
1.242LysMet: 1.242 ± 0.235
1.29LysAsn: 1.29 ± 0.317
2.437LysPro: 2.437 ± 0.306
1.672LysGln: 1.672 ± 0.28
3.536LysArg: 3.536 ± 0.41
2.485LysSer: 2.485 ± 0.311
2.676LysThr: 2.676 ± 0.347
3.249LysVal: 3.249 ± 0.365
0.86LysTrp: 0.86 ± 0.21
1.242LysTyr: 1.242 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
8.888LeuAla: 8.888 ± 0.981
0.669LeuCys: 0.669 ± 0.252
6.307LeuAsp: 6.307 ± 0.76
4.109LeuGlu: 4.109 ± 0.472
1.864LeuPhe: 1.864 ± 0.286
6.785LeuGly: 6.785 ± 0.763
1.051LeuHis: 1.051 ± 0.235
3.679LeuIle: 3.679 ± 0.477
3.154LeuLys: 3.154 ± 0.392
5.256LeuLeu: 5.256 ± 0.485
1.864LeuMet: 1.864 ± 0.326
2.246LeuAsn: 2.246 ± 0.29
4.253LeuPro: 4.253 ± 0.438
1.911LeuGln: 1.911 ± 0.268
6.307LeuArg: 6.307 ± 0.556
3.966LeuSer: 3.966 ± 0.413
5.161LeuThr: 5.161 ± 0.513
5.113LeuVal: 5.113 ± 0.429
1.338LeuTrp: 1.338 ± 0.219
1.864LeuTyr: 1.864 ± 0.299
0.0LeuXaa: 0.0 ± 0.0
Met
2.771MetAla: 2.771 ± 0.518
0.048MetCys: 0.048 ± 0.047
1.433MetAsp: 1.433 ± 0.255
1.386MetGlu: 1.386 ± 0.231
0.908MetPhe: 0.908 ± 0.229
1.242MetGly: 1.242 ± 0.228
0.334MetHis: 0.334 ± 0.135
1.195MetIle: 1.195 ± 0.259
1.433MetLys: 1.433 ± 0.226
1.864MetLeu: 1.864 ± 0.265
0.334MetMet: 0.334 ± 0.14
0.669MetAsn: 0.669 ± 0.169
1.816MetPro: 1.816 ± 0.269
0.669MetGln: 0.669 ± 0.195
1.433MetArg: 1.433 ± 0.253
2.055MetSer: 2.055 ± 0.307
1.672MetThr: 1.672 ± 0.317
1.481MetVal: 1.481 ± 0.273
0.478MetTrp: 0.478 ± 0.149
0.621MetTyr: 0.621 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
2.915AsnAla: 2.915 ± 0.408
0.287AsnCys: 0.287 ± 0.112
2.485AsnAsp: 2.485 ± 0.406
1.911AsnGlu: 1.911 ± 0.32
1.195AsnPhe: 1.195 ± 0.237
3.584AsnGly: 3.584 ± 0.385
0.526AsnHis: 0.526 ± 0.165
1.29AsnIle: 1.29 ± 0.22
1.051AsnLys: 1.051 ± 0.232
2.102AsnLeu: 2.102 ± 0.239
0.621AsnMet: 0.621 ± 0.194
1.099AsnAsn: 1.099 ± 0.216
2.341AsnPro: 2.341 ± 0.376
0.86AsnGln: 0.86 ± 0.19
2.676AsnArg: 2.676 ± 0.362
1.625AsnSer: 1.625 ± 0.298
1.911AsnThr: 1.911 ± 0.293
2.389AsnVal: 2.389 ± 0.36
1.051AsnTrp: 1.051 ± 0.221
0.908AsnTyr: 0.908 ± 0.223
0.0AsnXaa: 0.0 ± 0.0
Pro
4.826ProAla: 4.826 ± 0.506
0.287ProCys: 0.287 ± 0.148
4.969ProAsp: 4.969 ± 0.472
4.396ProGlu: 4.396 ± 0.663
1.72ProPhe: 1.72 ± 0.24
5.256ProGly: 5.256 ± 0.48
0.956ProHis: 0.956 ± 0.228
3.154ProIle: 3.154 ± 0.403
2.294ProLys: 2.294 ± 0.318
4.253ProLeu: 4.253 ± 0.507
1.242ProMet: 1.242 ± 0.209
2.15ProAsn: 2.15 ± 0.318
2.532ProPro: 2.532 ± 0.416
1.099ProGln: 1.099 ± 0.224
3.679ProArg: 3.679 ± 0.425
2.724ProSer: 2.724 ± 0.411
3.201ProThr: 3.201 ± 0.363
3.727ProVal: 3.727 ± 0.461
0.621ProTrp: 0.621 ± 0.182
0.765ProTyr: 0.765 ± 0.148
0.0ProXaa: 0.0 ± 0.0
Gln
3.393GlnAla: 3.393 ± 0.453
0.287GlnCys: 0.287 ± 0.111
1.195GlnAsp: 1.195 ± 0.278
1.864GlnGlu: 1.864 ± 0.302
1.242GlnPhe: 1.242 ± 0.284
2.055GlnGly: 2.055 ± 0.323
0.334GlnHis: 0.334 ± 0.131
1.577GlnIle: 1.577 ± 0.245
0.765GlnLys: 0.765 ± 0.162
2.724GlnLeu: 2.724 ± 0.424
1.003GlnMet: 1.003 ± 0.187
1.481GlnAsn: 1.481 ± 0.347
1.625GlnPro: 1.625 ± 0.285
1.529GlnGln: 1.529 ± 0.246
3.154GlnArg: 3.154 ± 0.348
1.481GlnSer: 1.481 ± 0.321
1.625GlnThr: 1.625 ± 0.234
2.341GlnVal: 2.341 ± 0.404
0.478GlnTrp: 0.478 ± 0.169
0.621GlnTyr: 0.621 ± 0.179
0.0GlnXaa: 0.0 ± 0.0
Arg
8.171ArgAla: 8.171 ± 0.736
0.669ArgCys: 0.669 ± 0.203
4.731ArgAsp: 4.731 ± 0.545
4.683ArgGlu: 4.683 ± 0.635
2.676ArgPhe: 2.676 ± 0.429
6.929ArgGly: 6.929 ± 0.54
1.577ArgHis: 1.577 ± 0.338
3.679ArgIle: 3.679 ± 0.47
3.727ArgLys: 3.727 ± 0.514
4.683ArgLeu: 4.683 ± 0.49
2.246ArgMet: 2.246 ± 0.292
2.198ArgAsn: 2.198 ± 0.381
3.44ArgPro: 3.44 ± 0.4
2.102ArgGln: 2.102 ± 0.333
7.215ArgArg: 7.215 ± 0.716
2.819ArgSer: 2.819 ± 0.443
4.348ArgThr: 4.348 ± 0.545
4.157ArgVal: 4.157 ± 0.561
1.481ArgTrp: 1.481 ± 0.256
1.768ArgTyr: 1.768 ± 0.283
0.0ArgXaa: 0.0 ± 0.0
Ser
6.307SerAla: 6.307 ± 0.692
0.191SerCys: 0.191 ± 0.095
3.154SerAsp: 3.154 ± 0.396
3.87SerGlu: 3.87 ± 0.417
1.481SerPhe: 1.481 ± 0.378
4.969SerGly: 4.969 ± 0.632
0.86SerHis: 0.86 ± 0.22
2.294SerIle: 2.294 ± 0.26
2.389SerLys: 2.389 ± 0.388
3.823SerLeu: 3.823 ± 0.411
1.768SerMet: 1.768 ± 0.276
1.959SerAsn: 1.959 ± 0.391
2.294SerPro: 2.294 ± 0.316
1.386SerGln: 1.386 ± 0.258
3.44SerArg: 3.44 ± 0.501
2.963SerSer: 2.963 ± 0.605
3.249SerThr: 3.249 ± 0.417
3.393SerVal: 3.393 ± 0.356
1.242SerTrp: 1.242 ± 0.214
1.099SerTyr: 1.099 ± 0.218
0.0SerXaa: 0.0 ± 0.0
Thr
5.638ThrAla: 5.638 ± 0.605
0.334ThrCys: 0.334 ± 0.141
3.727ThrAsp: 3.727 ± 0.394
3.345ThrGlu: 3.345 ± 0.42
1.768ThrPhe: 1.768 ± 0.286
6.021ThrGly: 6.021 ± 0.682
0.956ThrHis: 0.956 ± 0.274
3.44ThrIle: 3.44 ± 0.449
3.345ThrLys: 3.345 ± 0.353
4.731ThrLeu: 4.731 ± 0.48
1.242ThrMet: 1.242 ± 0.237
1.768ThrAsn: 1.768 ± 0.31
4.205ThrPro: 4.205 ± 0.398
1.72ThrGln: 1.72 ± 0.268
4.062ThrArg: 4.062 ± 0.514
3.201ThrSer: 3.201 ± 0.409
5.543ThrThr: 5.543 ± 0.679
5.782ThrVal: 5.782 ± 0.647
1.242ThrTrp: 1.242 ± 0.274
1.625ThrTyr: 1.625 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
6.881ValAla: 6.881 ± 0.616
0.334ValCys: 0.334 ± 0.119
4.539ValAsp: 4.539 ± 0.467
4.205ValGlu: 4.205 ± 0.399
2.198ValPhe: 2.198 ± 0.315
7.024ValGly: 7.024 ± 0.878
1.147ValHis: 1.147 ± 0.226
3.727ValIle: 3.727 ± 0.421
2.055ValLys: 2.055 ± 0.323
5.352ValLeu: 5.352 ± 0.439
1.529ValMet: 1.529 ± 0.228
2.819ValAsn: 2.819 ± 0.379
3.823ValPro: 3.823 ± 0.487
2.246ValGln: 2.246 ± 0.339
5.113ValArg: 5.113 ± 0.485
4.062ValSer: 4.062 ± 0.378
4.969ValThr: 4.969 ± 0.532
4.778ValVal: 4.778 ± 0.616
1.051ValTrp: 1.051 ± 0.214
1.625ValTyr: 1.625 ± 0.269
0.0ValXaa: 0.0 ± 0.0
Trp
1.577TrpAla: 1.577 ± 0.241
0.334TrpCys: 0.334 ± 0.125
1.72TrpAsp: 1.72 ± 0.307
1.386TrpGlu: 1.386 ± 0.279
0.717TrpPhe: 0.717 ± 0.222
1.242TrpGly: 1.242 ± 0.213
0.717TrpHis: 0.717 ± 0.228
1.051TrpIle: 1.051 ± 0.219
0.717TrpLys: 0.717 ± 0.191
1.529TrpLeu: 1.529 ± 0.259
0.191TrpMet: 0.191 ± 0.103
0.478TrpAsn: 0.478 ± 0.182
0.573TrpPro: 0.573 ± 0.155
0.621TrpGln: 0.621 ± 0.189
1.147TrpArg: 1.147 ± 0.311
1.529TrpSer: 1.529 ± 0.301
1.433TrpThr: 1.433 ± 0.25
1.577TrpVal: 1.577 ± 0.286
0.956TrpTrp: 0.956 ± 0.196
0.334TrpTyr: 0.334 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.532TyrAla: 2.532 ± 0.293
0.334TyrCys: 0.334 ± 0.112
2.102TyrAsp: 2.102 ± 0.313
1.481TyrGlu: 1.481 ± 0.278
0.717TyrPhe: 0.717 ± 0.178
1.625TyrGly: 1.625 ± 0.253
0.478TyrHis: 0.478 ± 0.174
0.812TyrIle: 0.812 ± 0.186
0.573TyrLys: 0.573 ± 0.197
2.915TyrLeu: 2.915 ± 0.401
0.43TyrMet: 0.43 ± 0.127
0.908TyrAsn: 0.908 ± 0.192
2.294TyrPro: 2.294 ± 0.363
0.43TyrGln: 0.43 ± 0.141
2.055TyrArg: 2.055 ± 0.302
1.338TyrSer: 1.338 ± 0.289
1.29TyrThr: 1.29 ± 0.263
1.195TyrVal: 1.195 ± 0.225
0.812TyrTrp: 0.812 ± 0.253
0.526TyrTyr: 0.526 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (20929 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski