Amino acid dipepetide frequency for Gordonia phage DelRio

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.072AlaAla: 21.072 ± 1.735
0.559AlaCys: 0.559 ± 0.191
9.324AlaAsp: 9.324 ± 0.899
7.397AlaGlu: 7.397 ± 0.958
3.294AlaPhe: 3.294 ± 0.567
15.54AlaGly: 15.54 ± 1.359
2.176AlaHis: 2.176 ± 0.373
4.848AlaIle: 4.848 ± 0.429
3.978AlaLys: 3.978 ± 0.827
9.635AlaLeu: 9.635 ± 1.08
2.611AlaMet: 2.611 ± 0.435
3.543AlaAsn: 3.543 ± 0.542
7.583AlaPro: 7.583 ± 1.184
3.978AlaGln: 3.978 ± 0.557
8.64AlaArg: 8.64 ± 1.079
5.47AlaSer: 5.47 ± 0.687
6.962AlaThr: 6.962 ± 0.813
9.635AlaVal: 9.635 ± 0.877
2.238AlaTrp: 2.238 ± 0.452
2.051AlaTyr: 2.051 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
1.057CysAla: 1.057 ± 0.293
0.062CysCys: 0.062 ± 0.052
0.311CysAsp: 0.311 ± 0.129
0.622CysGlu: 0.622 ± 0.206
0.062CysPhe: 0.062 ± 0.043
1.616CysGly: 1.616 ± 0.384
0.062CysHis: 0.062 ± 0.065
0.124CysIle: 0.124 ± 0.093
0.124CysLys: 0.124 ± 0.098
0.559CysLeu: 0.559 ± 0.162
0.062CysMet: 0.062 ± 0.054
0.497CysAsn: 0.497 ± 0.251
0.684CysPro: 0.684 ± 0.233
0.124CysGln: 0.124 ± 0.099
0.746CysArg: 0.746 ± 0.206
0.373CysSer: 0.373 ± 0.157
0.808CysThr: 0.808 ± 0.242
0.497CysVal: 0.497 ± 0.22
0.249CysTrp: 0.249 ± 0.141
0.124CysTyr: 0.124 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
7.894AspAla: 7.894 ± 0.639
0.622AspCys: 0.622 ± 0.239
6.216AspAsp: 6.216 ± 0.773
3.978AspGlu: 3.978 ± 0.593
1.367AspPhe: 1.367 ± 0.296
8.081AspGly: 8.081 ± 1.044
1.865AspHis: 1.865 ± 0.362
2.735AspIle: 2.735 ± 0.41
2.113AspLys: 2.113 ± 0.36
6.278AspLeu: 6.278 ± 0.571
1.243AspMet: 1.243 ± 0.26
2.113AspAsn: 2.113 ± 0.393
4.227AspPro: 4.227 ± 0.552
2.921AspGln: 2.921 ± 0.395
4.848AspArg: 4.848 ± 0.708
3.17AspSer: 3.17 ± 0.48
4.848AspThr: 4.848 ± 0.566
4.973AspVal: 4.973 ± 0.713
1.678AspTrp: 1.678 ± 0.25
0.87AspTyr: 0.87 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
5.097GluAla: 5.097 ± 0.63
0.373GluCys: 0.373 ± 0.147
2.797GluAsp: 2.797 ± 0.457
1.927GluGlu: 1.927 ± 0.399
1.181GluPhe: 1.181 ± 0.237
2.673GluGly: 2.673 ± 0.421
1.305GluHis: 1.305 ± 0.315
2.176GluIle: 2.176 ± 0.319
2.051GluLys: 2.051 ± 0.44
6.029GluLeu: 6.029 ± 0.559
1.305GluMet: 1.305 ± 0.261
1.865GluAsn: 1.865 ± 0.333
3.481GluPro: 3.481 ± 0.819
2.486GluGln: 2.486 ± 0.376
4.6GluArg: 4.6 ± 0.494
3.046GluSer: 3.046 ± 0.527
3.978GluThr: 3.978 ± 0.617
3.543GluVal: 3.543 ± 0.582
1.243GluTrp: 1.243 ± 0.253
1.43GluTyr: 1.43 ± 0.279
0.0GluXaa: 0.0 ± 0.0
Phe
3.978PheAla: 3.978 ± 0.465
0.124PheCys: 0.124 ± 0.094
2.176PheAsp: 2.176 ± 0.305
1.554PheGlu: 1.554 ± 0.312
0.311PhePhe: 0.311 ± 0.145
2.238PheGly: 2.238 ± 0.513
0.186PheHis: 0.186 ± 0.095
0.746PheIle: 0.746 ± 0.282
0.932PheLys: 0.932 ± 0.414
1.119PheLeu: 1.119 ± 0.248
0.497PheMet: 0.497 ± 0.136
0.497PheAsn: 0.497 ± 0.158
1.119PhePro: 1.119 ± 0.277
0.684PheGln: 0.684 ± 0.235
1.927PheArg: 1.927 ± 0.309
1.927PheSer: 1.927 ± 0.311
1.927PheThr: 1.927 ± 0.348
2.424PheVal: 2.424 ± 0.397
0.373PheTrp: 0.373 ± 0.147
0.497PheTyr: 0.497 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
9.945GlyAla: 9.945 ± 0.801
0.932GlyCys: 0.932 ± 0.282
7.832GlyAsp: 7.832 ± 0.95
3.481GlyGlu: 3.481 ± 0.555
3.046GlyPhe: 3.046 ± 0.442
9.635GlyGly: 9.635 ± 1.42
2.859GlyHis: 2.859 ± 0.532
5.408GlyIle: 5.408 ± 0.954
2.921GlyLys: 2.921 ± 0.38
7.148GlyLeu: 7.148 ± 0.987
1.305GlyMet: 1.305 ± 0.256
2.113GlyAsn: 2.113 ± 0.356
4.165GlyPro: 4.165 ± 0.538
3.17GlyGln: 3.17 ± 0.328
6.837GlyArg: 6.837 ± 0.816
5.656GlySer: 5.656 ± 0.604
6.962GlyThr: 6.962 ± 0.761
6.775GlyVal: 6.775 ± 0.778
1.43GlyTrp: 1.43 ± 0.31
2.424GlyTyr: 2.424 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
2.797HisAla: 2.797 ± 0.486
0.124HisCys: 0.124 ± 0.085
1.803HisAsp: 1.803 ± 0.434
0.932HisGlu: 0.932 ± 0.226
0.435HisPhe: 0.435 ± 0.189
1.492HisGly: 1.492 ± 0.345
0.808HisHis: 0.808 ± 0.274
0.995HisIle: 0.995 ± 0.269
0.746HisLys: 0.746 ± 0.202
1.865HisLeu: 1.865 ± 0.373
0.186HisMet: 0.186 ± 0.105
0.622HisAsn: 0.622 ± 0.167
1.554HisPro: 1.554 ± 0.359
0.186HisGln: 0.186 ± 0.107
1.554HisArg: 1.554 ± 0.489
0.559HisSer: 0.559 ± 0.157
1.305HisThr: 1.305 ± 0.29
1.74HisVal: 1.74 ± 0.346
0.186HisTrp: 0.186 ± 0.099
0.373HisTyr: 0.373 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
6.962IleAla: 6.962 ± 0.635
0.249IleCys: 0.249 ± 0.121
3.481IleAsp: 3.481 ± 0.513
3.357IleGlu: 3.357 ± 0.407
0.559IlePhe: 0.559 ± 0.217
3.978IleGly: 3.978 ± 0.469
0.559IleHis: 0.559 ± 0.167
0.559IleIle: 0.559 ± 0.198
1.74IleLys: 1.74 ± 0.741
2.424IleLeu: 2.424 ± 0.283
0.559IleMet: 0.559 ± 0.146
0.87IleAsn: 0.87 ± 0.229
1.927IlePro: 1.927 ± 0.372
0.746IleGln: 0.746 ± 0.212
3.481IleArg: 3.481 ± 0.434
1.243IleSer: 1.243 ± 0.261
2.921IleThr: 2.921 ± 0.392
2.984IleVal: 2.984 ± 0.43
1.057IleTrp: 1.057 ± 0.297
0.373IleTyr: 0.373 ± 0.138
0.0IleXaa: 0.0 ± 0.0
Lys
3.792LysAla: 3.792 ± 0.704
0.311LysCys: 0.311 ± 0.146
1.43LysAsp: 1.43 ± 0.362
1.057LysGlu: 1.057 ± 0.256
0.87LysPhe: 0.87 ± 0.22
2.113LysGly: 2.113 ± 0.414
0.559LysHis: 0.559 ± 0.149
1.616LysIle: 1.616 ± 0.332
0.622LysLys: 0.622 ± 0.251
2.797LysLeu: 2.797 ± 0.657
0.87LysMet: 0.87 ± 0.201
1.057LysAsn: 1.057 ± 0.407
1.803LysPro: 1.803 ± 0.299
0.559LysGln: 0.559 ± 0.202
2.3LysArg: 2.3 ± 0.472
1.927LysSer: 1.927 ± 0.363
1.678LysThr: 1.678 ± 0.348
3.046LysVal: 3.046 ± 0.559
0.497LysTrp: 0.497 ± 0.175
0.559LysTyr: 0.559 ± 0.185
0.0LysXaa: 0.0 ± 0.0
Leu
10.07LeuAla: 10.07 ± 0.714
0.622LeuCys: 0.622 ± 0.213
5.719LeuAsp: 5.719 ± 0.582
6.029LeuGlu: 6.029 ± 0.574
2.176LeuPhe: 2.176 ± 0.294
6.589LeuGly: 6.589 ± 0.669
1.181LeuHis: 1.181 ± 0.231
3.108LeuIle: 3.108 ± 0.415
1.119LeuLys: 1.119 ± 0.227
6.154LeuLeu: 6.154 ± 0.655
1.989LeuMet: 1.989 ± 0.383
2.611LeuAsn: 2.611 ± 0.411
4.289LeuPro: 4.289 ± 0.521
2.673LeuGln: 2.673 ± 0.428
6.464LeuArg: 6.464 ± 0.835
4.6LeuSer: 4.6 ± 0.481
5.346LeuThr: 5.346 ± 0.677
5.905LeuVal: 5.905 ± 0.654
1.616LeuTrp: 1.616 ± 0.254
1.678LeuTyr: 1.678 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
2.486MetAla: 2.486 ± 0.363
0.186MetCys: 0.186 ± 0.108
0.311MetAsp: 0.311 ± 0.166
0.559MetGlu: 0.559 ± 0.205
1.367MetPhe: 1.367 ± 0.315
1.119MetGly: 1.119 ± 0.282
0.249MetHis: 0.249 ± 0.135
0.684MetIle: 0.684 ± 0.186
0.435MetLys: 0.435 ± 0.154
0.995MetLeu: 0.995 ± 0.282
0.186MetMet: 0.186 ± 0.092
0.497MetAsn: 0.497 ± 0.176
1.554MetPro: 1.554 ± 0.298
0.622MetGln: 0.622 ± 0.179
0.808MetArg: 0.808 ± 0.197
1.678MetSer: 1.678 ± 0.354
2.486MetThr: 2.486 ± 0.39
1.554MetVal: 1.554 ± 0.276
0.373MetTrp: 0.373 ± 0.116
0.186MetTyr: 0.186 ± 0.105
0.0MetXaa: 0.0 ± 0.0
Asn
2.673AsnAla: 2.673 ± 0.284
0.186AsnCys: 0.186 ± 0.103
1.554AsnAsp: 1.554 ± 0.293
1.119AsnGlu: 1.119 ± 0.256
0.746AsnPhe: 0.746 ± 0.213
3.605AsnGly: 3.605 ± 0.433
0.435AsnHis: 0.435 ± 0.234
0.87AsnIle: 0.87 ± 0.271
1.367AsnLys: 1.367 ± 0.35
2.611AsnLeu: 2.611 ± 0.456
0.435AsnMet: 0.435 ± 0.154
0.746AsnAsn: 0.746 ± 0.195
1.803AsnPro: 1.803 ± 0.314
0.995AsnGln: 0.995 ± 0.243
1.865AsnArg: 1.865 ± 0.297
0.87AsnSer: 0.87 ± 0.184
1.554AsnThr: 1.554 ± 0.272
2.113AsnVal: 2.113 ± 0.496
0.808AsnTrp: 0.808 ± 0.2
0.684AsnTyr: 0.684 ± 0.215
0.0AsnXaa: 0.0 ± 0.0
Pro
9.386ProAla: 9.386 ± 1.162
0.684ProCys: 0.684 ± 0.226
4.351ProAsp: 4.351 ± 0.62
2.859ProGlu: 2.859 ± 0.694
0.87ProPhe: 0.87 ± 0.228
5.47ProGly: 5.47 ± 0.797
1.492ProHis: 1.492 ± 0.309
2.051ProIle: 2.051 ± 0.375
2.113ProLys: 2.113 ± 0.404
3.17ProLeu: 3.17 ± 0.483
0.87ProMet: 0.87 ± 0.233
1.803ProAsn: 1.803 ± 0.31
2.859ProPro: 2.859 ± 0.513
2.238ProGln: 2.238 ± 0.441
3.17ProArg: 3.17 ± 0.553
2.424ProSer: 2.424 ± 0.304
4.6ProThr: 4.6 ± 0.594
4.351ProVal: 4.351 ± 0.435
1.181ProTrp: 1.181 ± 0.253
1.243ProTyr: 1.243 ± 0.374
0.0ProXaa: 0.0 ± 0.0
Gln
4.848GlnAla: 4.848 ± 0.79
0.186GlnCys: 0.186 ± 0.115
1.057GlnAsp: 1.057 ± 0.265
0.995GlnGlu: 0.995 ± 0.192
1.243GlnPhe: 1.243 ± 0.416
2.176GlnGly: 2.176 ± 0.473
0.87GlnHis: 0.87 ± 0.179
1.678GlnIle: 1.678 ± 0.318
0.87GlnLys: 0.87 ± 0.22
3.108GlnLeu: 3.108 ± 0.45
0.559GlnMet: 0.559 ± 0.154
0.808GlnAsn: 0.808 ± 0.263
2.176GlnPro: 2.176 ± 0.376
1.181GlnGln: 1.181 ± 0.312
1.74GlnArg: 1.74 ± 0.34
1.305GlnSer: 1.305 ± 0.27
1.678GlnThr: 1.678 ± 0.329
3.108GlnVal: 3.108 ± 0.451
0.932GlnTrp: 0.932 ± 0.304
0.373GlnTyr: 0.373 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
9.262ArgAla: 9.262 ± 0.848
0.995ArgCys: 0.995 ± 0.32
5.656ArgAsp: 5.656 ± 0.609
4.475ArgGlu: 4.475 ± 0.495
1.989ArgPhe: 1.989 ± 0.467
5.594ArgGly: 5.594 ± 0.608
1.678ArgHis: 1.678 ± 0.462
3.854ArgIle: 3.854 ± 0.523
1.927ArgLys: 1.927 ± 0.337
6.278ArgLeu: 6.278 ± 0.55
1.492ArgMet: 1.492 ± 0.244
2.362ArgAsn: 2.362 ± 0.419
3.792ArgPro: 3.792 ± 0.72
1.865ArgGln: 1.865 ± 0.358
7.21ArgArg: 7.21 ± 0.899
2.673ArgSer: 2.673 ± 0.472
3.792ArgThr: 3.792 ± 0.446
5.221ArgVal: 5.221 ± 0.521
2.113ArgTrp: 2.113 ± 0.413
1.554ArgTyr: 1.554 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
6.278SerAla: 6.278 ± 0.652
0.311SerCys: 0.311 ± 0.137
2.673SerAsp: 2.673 ± 0.477
2.362SerGlu: 2.362 ± 0.417
1.616SerPhe: 1.616 ± 0.358
5.905SerGly: 5.905 ± 0.53
1.057SerHis: 1.057 ± 0.255
1.74SerIle: 1.74 ± 0.319
2.051SerLys: 2.051 ± 0.479
4.102SerLeu: 4.102 ± 0.446
1.119SerMet: 1.119 ± 0.235
0.932SerAsn: 0.932 ± 0.205
2.735SerPro: 2.735 ± 0.494
1.616SerGln: 1.616 ± 0.294
3.046SerArg: 3.046 ± 0.43
3.667SerSer: 3.667 ± 0.818
4.786SerThr: 4.786 ± 0.587
3.667SerVal: 3.667 ± 0.453
0.932SerTrp: 0.932 ± 0.237
0.995SerTyr: 0.995 ± 0.223
0.0SerXaa: 0.0 ± 0.0
Thr
8.267ThrAla: 8.267 ± 0.696
1.057ThrCys: 1.057 ± 0.313
5.283ThrAsp: 5.283 ± 0.574
3.419ThrGlu: 3.419 ± 0.768
1.74ThrPhe: 1.74 ± 0.258
7.148ThrGly: 7.148 ± 0.701
1.243ThrHis: 1.243 ± 0.256
2.797ThrIle: 2.797 ± 0.403
1.927ThrLys: 1.927 ± 0.456
5.159ThrLeu: 5.159 ± 0.646
0.808ThrMet: 0.808 ± 0.211
1.243ThrAsn: 1.243 ± 0.204
4.102ThrPro: 4.102 ± 0.527
1.554ThrGln: 1.554 ± 0.361
5.035ThrArg: 5.035 ± 0.557
3.792ThrSer: 3.792 ± 0.553
5.035ThrThr: 5.035 ± 0.713
7.335ThrVal: 7.335 ± 0.711
0.87ThrTrp: 0.87 ± 0.223
1.305ThrTyr: 1.305 ± 0.258
0.0ThrXaa: 0.0 ± 0.0
Val
10.753ValAla: 10.753 ± 0.906
0.559ValCys: 0.559 ± 0.192
7.335ValAsp: 7.335 ± 0.675
5.283ValGlu: 5.283 ± 0.612
1.554ValPhe: 1.554 ± 0.327
6.029ValGly: 6.029 ± 0.744
0.932ValHis: 0.932 ± 0.246
2.424ValIle: 2.424 ± 0.38
1.492ValLys: 1.492 ± 0.306
6.651ValLeu: 6.651 ± 0.685
1.119ValMet: 1.119 ± 0.321
1.989ValAsn: 1.989 ± 0.332
5.283ValPro: 5.283 ± 0.464
1.865ValGln: 1.865 ± 0.367
6.091ValArg: 6.091 ± 0.749
4.413ValSer: 4.413 ± 0.65
5.905ValThr: 5.905 ± 0.593
7.583ValVal: 7.583 ± 0.718
1.678ValTrp: 1.678 ± 0.29
0.995ValTyr: 0.995 ± 0.303
0.0ValXaa: 0.0 ± 0.0
Trp
1.678TrpAla: 1.678 ± 0.304
0.435TrpCys: 0.435 ± 0.147
1.243TrpAsp: 1.243 ± 0.348
0.559TrpGlu: 0.559 ± 0.183
0.559TrpPhe: 0.559 ± 0.181
1.119TrpGly: 1.119 ± 0.245
0.435TrpHis: 0.435 ± 0.156
0.932TrpIle: 0.932 ± 0.241
0.497TrpLys: 0.497 ± 0.164
2.611TrpLeu: 2.611 ± 0.346
0.559TrpMet: 0.559 ± 0.167
0.622TrpAsn: 0.622 ± 0.201
1.367TrpPro: 1.367 ± 0.256
0.746TrpGln: 0.746 ± 0.265
1.181TrpArg: 1.181 ± 0.231
1.989TrpSer: 1.989 ± 0.43
0.995TrpThr: 0.995 ± 0.25
1.492TrpVal: 1.492 ± 0.404
0.373TrpTrp: 0.373 ± 0.151
0.808TrpTyr: 0.808 ± 0.279
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.238TyrAla: 2.238 ± 0.379
0.186TyrCys: 0.186 ± 0.114
1.554TyrAsp: 1.554 ± 0.34
0.622TyrGlu: 0.622 ± 0.182
0.249TyrPhe: 0.249 ± 0.122
1.927TyrGly: 1.927 ± 0.361
0.311TyrHis: 0.311 ± 0.151
0.684TyrIle: 0.684 ± 0.268
0.435TyrLys: 0.435 ± 0.162
1.243TyrLeu: 1.243 ± 0.323
0.497TyrMet: 0.497 ± 0.145
0.249TyrAsn: 0.249 ± 0.126
0.684TyrPro: 0.684 ± 0.24
0.684TyrGln: 0.684 ± 0.264
2.238TyrArg: 2.238 ± 0.488
0.808TyrSer: 0.808 ± 0.191
1.554TyrThr: 1.554 ± 0.452
1.989TyrVal: 1.989 ± 0.365
0.373TyrTrp: 0.373 ± 0.146
0.497TyrTyr: 0.497 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (16089 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski