Amino acid dipepetide frequency for Arthrobacter phage Gordon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.337AlaAla: 8.337 ± 1.327
0.329AlaCys: 0.329 ± 0.132
3.894AlaAsp: 3.894 ± 0.475
6.527AlaGlu: 6.527 ± 0.615
3.455AlaPhe: 3.455 ± 0.478
7.075AlaGly: 7.075 ± 0.911
1.591AlaHis: 1.591 ± 0.305
5.759AlaIle: 5.759 ± 0.595
5.21AlaLys: 5.21 ± 0.888
6.801AlaLeu: 6.801 ± 1.037
2.687AlaMet: 2.687 ± 0.509
4.059AlaAsn: 4.059 ± 0.463
2.962AlaPro: 2.962 ± 0.43
2.687AlaGln: 2.687 ± 0.383
4.497AlaArg: 4.497 ± 0.562
4.717AlaSer: 4.717 ± 0.526
4.936AlaThr: 4.936 ± 0.547
6.307AlaVal: 6.307 ± 0.721
1.481AlaTrp: 1.481 ± 0.346
2.523AlaTyr: 2.523 ± 0.283
0.0AlaXaa: 0.0 ± 0.0
Cys
0.439CysAla: 0.439 ± 0.178
0.0CysCys: 0.0 ± 0.0
0.548CysAsp: 0.548 ± 0.147
0.494CysGlu: 0.494 ± 0.167
0.384CysPhe: 0.384 ± 0.204
0.274CysGly: 0.274 ± 0.13
0.165CysHis: 0.165 ± 0.094
0.384CysIle: 0.384 ± 0.133
0.219CysLys: 0.219 ± 0.131
0.603CysLeu: 0.603 ± 0.2
0.055CysMet: 0.055 ± 0.048
0.055CysAsn: 0.055 ± 0.046
0.329CysPro: 0.329 ± 0.16
0.11CysGln: 0.11 ± 0.082
0.165CysArg: 0.165 ± 0.093
0.274CysSer: 0.274 ± 0.115
0.11CysThr: 0.11 ± 0.089
0.494CysVal: 0.494 ± 0.146
0.055CysTrp: 0.055 ± 0.062
0.055CysTyr: 0.055 ± 0.052
0.0CysXaa: 0.0 ± 0.0
Asp
5.923AspAla: 5.923 ± 0.591
0.274AspCys: 0.274 ± 0.137
4.881AspAsp: 4.881 ± 0.653
4.168AspGlu: 4.168 ± 0.479
2.687AspPhe: 2.687 ± 0.437
5.046AspGly: 5.046 ± 0.533
0.932AspHis: 0.932 ± 0.27
3.949AspIle: 3.949 ± 0.392
3.071AspLys: 3.071 ± 0.454
5.375AspLeu: 5.375 ± 0.545
1.81AspMet: 1.81 ± 0.258
2.633AspAsn: 2.633 ± 0.446
2.523AspPro: 2.523 ± 0.339
2.139AspGln: 2.139 ± 0.363
2.797AspArg: 2.797 ± 0.461
2.852AspSer: 2.852 ± 0.412
2.468AspThr: 2.468 ± 0.334
4.059AspVal: 4.059 ± 0.553
0.932AspTrp: 0.932 ± 0.202
1.371AspTyr: 1.371 ± 0.231
0.0AspXaa: 0.0 ± 0.0
Glu
4.772GluAla: 4.772 ± 0.515
0.548GluCys: 0.548 ± 0.167
3.236GluAsp: 3.236 ± 0.42
3.949GluGlu: 3.949 ± 0.526
2.962GluPhe: 2.962 ± 0.368
4.168GluGly: 4.168 ± 0.432
1.645GluHis: 1.645 ± 0.392
3.455GluIle: 3.455 ± 0.359
3.565GluLys: 3.565 ± 0.529
5.485GluLeu: 5.485 ± 0.639
2.084GluMet: 2.084 ± 0.34
2.249GluAsn: 2.249 ± 0.318
2.578GluPro: 2.578 ± 0.394
2.742GluGln: 2.742 ± 0.358
3.51GluArg: 3.51 ± 0.477
3.017GluSer: 3.017 ± 0.397
3.784GluThr: 3.784 ± 0.455
5.814GluVal: 5.814 ± 0.713
1.645GluTrp: 1.645 ± 0.34
2.029GluTyr: 2.029 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
2.742PheAla: 2.742 ± 0.423
0.329PheCys: 0.329 ± 0.144
2.742PheAsp: 2.742 ± 0.388
2.468PheGlu: 2.468 ± 0.347
1.207PhePhe: 1.207 ± 0.283
2.742PheGly: 2.742 ± 0.339
0.439PheHis: 0.439 ± 0.154
2.249PheIle: 2.249 ± 0.331
1.974PheLys: 1.974 ± 0.298
2.962PheLeu: 2.962 ± 0.333
0.768PheMet: 0.768 ± 0.212
2.523PheAsn: 2.523 ± 0.35
1.591PhePro: 1.591 ± 0.252
0.878PheGln: 0.878 ± 0.183
1.92PheArg: 1.92 ± 0.323
2.633PheSer: 2.633 ± 0.406
2.194PheThr: 2.194 ± 0.36
2.413PheVal: 2.413 ± 0.34
0.603PheTrp: 0.603 ± 0.205
1.536PheTyr: 1.536 ± 0.295
0.0PheXaa: 0.0 ± 0.0
Gly
7.843GlyAla: 7.843 ± 0.929
0.439GlyCys: 0.439 ± 0.176
4.607GlyAsp: 4.607 ± 0.596
4.278GlyGlu: 4.278 ± 0.536
2.852GlyPhe: 2.852 ± 0.384
5.101GlyGly: 5.101 ± 0.884
1.316GlyHis: 1.316 ± 0.288
4.168GlyIle: 4.168 ± 0.596
4.662GlyLys: 4.662 ± 0.591
5.978GlyLeu: 5.978 ± 0.901
2.962GlyMet: 2.962 ± 0.634
3.51GlyAsn: 3.51 ± 0.674
2.523GlyPro: 2.523 ± 0.572
2.084GlyGln: 2.084 ± 0.292
2.742GlyArg: 2.742 ± 0.408
5.265GlySer: 5.265 ± 0.763
6.307GlyThr: 6.307 ± 0.837
5.759GlyVal: 5.759 ± 0.572
1.645GlyTrp: 1.645 ± 0.362
2.523GlyTyr: 2.523 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
1.591HisAla: 1.591 ± 0.498
0.274HisCys: 0.274 ± 0.112
0.768HisAsp: 0.768 ± 0.297
1.371HisGlu: 1.371 ± 0.301
0.494HisPhe: 0.494 ± 0.191
1.152HisGly: 1.152 ± 0.278
0.603HisHis: 0.603 ± 0.157
1.481HisIle: 1.481 ± 0.302
0.987HisLys: 0.987 ± 0.242
1.207HisLeu: 1.207 ± 0.213
0.384HisMet: 0.384 ± 0.162
1.097HisAsn: 1.097 ± 0.271
0.932HisPro: 0.932 ± 0.198
0.713HisGln: 0.713 ± 0.206
0.768HisArg: 0.768 ± 0.267
1.536HisSer: 1.536 ± 0.294
0.987HisThr: 0.987 ± 0.256
1.426HisVal: 1.426 ± 0.317
0.219HisTrp: 0.219 ± 0.103
0.878HisTyr: 0.878 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
5.375IleAla: 5.375 ± 0.87
0.219IleCys: 0.219 ± 0.133
4.004IleAsp: 4.004 ± 0.507
4.113IleGlu: 4.113 ± 0.49
1.645IlePhe: 1.645 ± 0.303
3.51IleGly: 3.51 ± 0.816
1.097IleHis: 1.097 ± 0.271
3.181IleIle: 3.181 ± 0.478
3.949IleLys: 3.949 ± 0.571
5.101IleLeu: 5.101 ± 0.457
2.029IleMet: 2.029 ± 0.497
3.126IleAsn: 3.126 ± 0.403
2.797IlePro: 2.797 ± 0.485
2.304IleGln: 2.304 ± 0.329
2.194IleArg: 2.194 ± 0.373
4.168IleSer: 4.168 ± 0.449
4.552IleThr: 4.552 ± 0.596
4.552IleVal: 4.552 ± 0.531
0.932IleTrp: 0.932 ± 0.221
1.974IleTyr: 1.974 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
5.759LysAla: 5.759 ± 0.703
0.384LysCys: 0.384 ± 0.142
2.907LysAsp: 2.907 ± 0.585
4.333LysGlu: 4.333 ± 0.643
2.578LysPhe: 2.578 ± 0.359
4.881LysGly: 4.881 ± 0.572
1.371LysHis: 1.371 ± 0.368
3.4LysIle: 3.4 ± 0.54
5.485LysLys: 5.485 ± 0.884
4.881LysLeu: 4.881 ± 0.619
2.413LysMet: 2.413 ± 0.374
1.974LysAsn: 1.974 ± 0.364
2.797LysPro: 2.797 ± 0.442
2.084LysGln: 2.084 ± 0.406
3.565LysArg: 3.565 ± 0.444
3.51LysSer: 3.51 ± 0.46
3.51LysThr: 3.51 ± 0.476
3.565LysVal: 3.565 ± 0.424
0.878LysTrp: 0.878 ± 0.236
2.194LysTyr: 2.194 ± 0.357
0.0LysXaa: 0.0 ± 0.0
Leu
7.13LeuAla: 7.13 ± 0.701
0.219LeuCys: 0.219 ± 0.13
5.32LeuAsp: 5.32 ± 0.481
4.497LeuGlu: 4.497 ± 0.529
2.413LeuPhe: 2.413 ± 0.416
5.649LeuGly: 5.649 ± 1.15
1.371LeuHis: 1.371 ± 0.289
6.143LeuIle: 6.143 ± 0.552
5.21LeuLys: 5.21 ± 0.75
5.101LeuLeu: 5.101 ± 0.775
1.974LeuMet: 1.974 ± 0.322
3.784LeuAsn: 3.784 ± 0.39
3.346LeuPro: 3.346 ± 0.379
2.633LeuGln: 2.633 ± 0.359
4.333LeuArg: 4.333 ± 0.592
4.662LeuSer: 4.662 ± 0.657
5.32LeuThr: 5.32 ± 0.498
6.198LeuVal: 6.198 ± 0.627
0.823LeuTrp: 0.823 ± 0.191
2.742LeuTyr: 2.742 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
3.017MetAla: 3.017 ± 0.361
0.11MetCys: 0.11 ± 0.08
1.92MetAsp: 1.92 ± 0.298
1.316MetGlu: 1.316 ± 0.274
1.042MetPhe: 1.042 ± 0.209
1.865MetGly: 1.865 ± 0.541
0.603MetHis: 0.603 ± 0.175
1.7MetIle: 1.7 ± 0.467
1.536MetLys: 1.536 ± 0.266
2.687MetLeu: 2.687 ± 0.336
0.548MetMet: 0.548 ± 0.223
0.878MetAsn: 0.878 ± 0.168
1.152MetPro: 1.152 ± 0.222
0.768MetGln: 0.768 ± 0.303
1.536MetArg: 1.536 ± 0.334
2.358MetSer: 2.358 ± 0.424
2.084MetThr: 2.084 ± 0.344
1.81MetVal: 1.81 ± 0.272
0.329MetTrp: 0.329 ± 0.117
0.932MetTyr: 0.932 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
3.346AsnAla: 3.346 ± 0.421
0.219AsnCys: 0.219 ± 0.114
2.742AsnAsp: 2.742 ± 0.342
2.962AsnGlu: 2.962 ± 0.47
1.92AsnPhe: 1.92 ± 0.305
4.826AsnGly: 4.826 ± 0.712
0.823AsnHis: 0.823 ± 0.251
3.017AsnIle: 3.017 ± 0.529
2.797AsnLys: 2.797 ± 0.325
3.017AsnLeu: 3.017 ± 0.498
1.316AsnMet: 1.316 ± 0.256
2.852AsnAsn: 2.852 ± 0.339
2.194AsnPro: 2.194 ± 0.283
1.755AsnGln: 1.755 ± 0.32
2.523AsnArg: 2.523 ± 0.478
3.236AsnSer: 3.236 ± 0.432
3.071AsnThr: 3.071 ± 0.4
2.304AsnVal: 2.304 ± 0.342
0.932AsnTrp: 0.932 ± 0.276
1.645AsnTyr: 1.645 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
2.852ProAla: 2.852 ± 0.415
0.384ProCys: 0.384 ± 0.143
3.071ProAsp: 3.071 ± 0.427
3.455ProGlu: 3.455 ± 0.428
0.932ProPhe: 0.932 ± 0.2
3.51ProGly: 3.51 ± 0.406
1.042ProHis: 1.042 ± 0.194
2.194ProIle: 2.194 ± 0.289
2.358ProLys: 2.358 ± 0.316
2.358ProLeu: 2.358 ± 0.279
1.097ProMet: 1.097 ± 0.261
2.304ProAsn: 2.304 ± 0.412
1.865ProPro: 1.865 ± 0.428
1.207ProGln: 1.207 ± 0.31
2.194ProArg: 2.194 ± 0.339
2.578ProSer: 2.578 ± 0.372
3.346ProThr: 3.346 ± 0.519
3.675ProVal: 3.675 ± 0.444
0.329ProTrp: 0.329 ± 0.144
1.097ProTyr: 1.097 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
2.962GlnAla: 2.962 ± 0.453
0.165GlnCys: 0.165 ± 0.103
1.81GlnAsp: 1.81 ± 0.255
2.194GlnGlu: 2.194 ± 0.393
0.823GlnPhe: 0.823 ± 0.189
2.578GlnGly: 2.578 ± 0.414
0.548GlnHis: 0.548 ± 0.171
2.468GlnIle: 2.468 ± 0.294
2.249GlnLys: 2.249 ± 0.37
3.4GlnLeu: 3.4 ± 0.427
0.823GlnMet: 0.823 ± 0.177
1.371GlnAsn: 1.371 ± 0.3
1.7GlnPro: 1.7 ± 0.286
2.194GlnGln: 2.194 ± 0.302
1.81GlnArg: 1.81 ± 0.335
1.591GlnSer: 1.591 ± 0.278
1.974GlnThr: 1.974 ± 0.357
2.139GlnVal: 2.139 ± 0.426
0.658GlnTrp: 0.658 ± 0.207
1.481GlnTyr: 1.481 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
3.236ArgAla: 3.236 ± 0.447
0.329ArgCys: 0.329 ± 0.153
2.249ArgAsp: 2.249 ± 0.322
4.113ArgGlu: 4.113 ± 0.568
1.755ArgPhe: 1.755 ± 0.367
2.962ArgGly: 2.962 ± 0.451
1.152ArgHis: 1.152 ± 0.294
3.071ArgIle: 3.071 ± 0.389
4.113ArgLys: 4.113 ± 0.531
4.552ArgLeu: 4.552 ± 0.603
0.768ArgMet: 0.768 ± 0.223
3.126ArgAsn: 3.126 ± 0.386
2.249ArgPro: 2.249 ± 0.386
1.974ArgGln: 1.974 ± 0.441
3.839ArgArg: 3.839 ± 0.664
2.797ArgSer: 2.797 ± 0.353
3.017ArgThr: 3.017 ± 0.435
4.333ArgVal: 4.333 ± 0.493
0.603ArgTrp: 0.603 ± 0.184
1.974ArgTyr: 1.974 ± 0.378
0.0ArgXaa: 0.0 ± 0.0
Ser
4.826SerAla: 4.826 ± 0.499
0.165SerCys: 0.165 ± 0.103
3.017SerAsp: 3.017 ± 0.359
3.017SerGlu: 3.017 ± 0.403
2.468SerPhe: 2.468 ± 0.402
6.252SerGly: 6.252 ± 0.658
1.426SerHis: 1.426 ± 0.259
3.51SerIle: 3.51 ± 0.472
4.223SerLys: 4.223 ± 0.565
4.717SerLeu: 4.717 ± 0.493
1.7SerMet: 1.7 ± 0.247
3.126SerAsn: 3.126 ± 0.39
2.578SerPro: 2.578 ± 0.361
2.084SerGln: 2.084 ± 0.285
3.236SerArg: 3.236 ± 0.34
4.333SerSer: 4.333 ± 0.829
3.4SerThr: 3.4 ± 0.429
3.565SerVal: 3.565 ± 0.469
1.261SerTrp: 1.261 ± 0.241
2.249SerTyr: 2.249 ± 0.369
0.0SerXaa: 0.0 ± 0.0
Thr
5.814ThrAla: 5.814 ± 0.59
0.274ThrCys: 0.274 ± 0.176
3.839ThrAsp: 3.839 ± 0.431
3.565ThrGlu: 3.565 ± 0.459
2.797ThrPhe: 2.797 ± 0.323
5.759ThrGly: 5.759 ± 0.695
0.768ThrHis: 0.768 ± 0.189
3.51ThrIle: 3.51 ± 0.473
3.949ThrLys: 3.949 ± 0.441
5.978ThrLeu: 5.978 ± 0.458
1.81ThrMet: 1.81 ± 0.314
2.907ThrAsn: 2.907 ± 0.335
3.126ThrPro: 3.126 ± 0.432
1.426ThrGln: 1.426 ± 0.296
2.578ThrArg: 2.578 ± 0.388
2.468ThrSer: 2.468 ± 0.343
5.594ThrThr: 5.594 ± 0.652
4.552ThrVal: 4.552 ± 0.586
0.932ThrTrp: 0.932 ± 0.214
2.249ThrTyr: 2.249 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
5.978ValAla: 5.978 ± 0.633
0.329ValCys: 0.329 ± 0.143
4.552ValAsp: 4.552 ± 0.502
4.223ValGlu: 4.223 ± 0.577
2.358ValPhe: 2.358 ± 0.333
5.101ValGly: 5.101 ± 0.574
1.207ValHis: 1.207 ± 0.313
4.607ValIle: 4.607 ± 0.435
4.278ValLys: 4.278 ± 0.45
5.375ValLeu: 5.375 ± 0.6
1.865ValMet: 1.865 ± 0.357
3.346ValAsn: 3.346 ± 0.366
2.852ValPro: 2.852 ± 0.4
2.468ValGln: 2.468 ± 0.399
4.717ValArg: 4.717 ± 0.595
5.704ValSer: 5.704 ± 0.749
3.62ValThr: 3.62 ± 0.42
5.155ValVal: 5.155 ± 0.601
1.042ValTrp: 1.042 ± 0.202
2.742ValTyr: 2.742 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.263
0.0TrpCys: 0.0 ± 0.0
1.536TrpAsp: 1.536 ± 0.253
0.658TrpGlu: 0.658 ± 0.195
0.548TrpPhe: 0.548 ± 0.207
0.987TrpGly: 0.987 ± 0.202
0.274TrpHis: 0.274 ± 0.129
0.823TrpIle: 0.823 ± 0.276
0.658TrpLys: 0.658 ± 0.19
1.207TrpLeu: 1.207 ± 0.254
0.603TrpMet: 0.603 ± 0.154
0.878TrpAsn: 0.878 ± 0.245
0.439TrpPro: 0.439 ± 0.184
0.932TrpGln: 0.932 ± 0.212
1.207TrpArg: 1.207 ± 0.316
1.097TrpSer: 1.097 ± 0.239
1.207TrpThr: 1.207 ± 0.22
0.932TrpVal: 0.932 ± 0.238
0.165TrpTrp: 0.165 ± 0.099
0.768TrpTyr: 0.768 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.017TyrAla: 3.017 ± 0.446
0.219TyrCys: 0.219 ± 0.108
2.304TyrAsp: 2.304 ± 0.517
1.536TyrGlu: 1.536 ± 0.29
1.591TyrPhe: 1.591 ± 0.299
3.126TyrGly: 3.126 ± 0.453
0.494TyrHis: 0.494 ± 0.163
1.755TyrIle: 1.755 ± 0.291
1.974TyrLys: 1.974 ± 0.37
2.084TyrLeu: 2.084 ± 0.36
0.439TyrMet: 0.439 ± 0.159
1.7TyrAsn: 1.7 ± 0.303
1.316TyrPro: 1.316 ± 0.235
1.755TyrGln: 1.755 ± 0.313
2.084TyrArg: 2.084 ± 0.294
2.358TyrSer: 2.358 ± 0.37
2.413TyrThr: 2.413 ± 0.387
2.468TyrVal: 2.468 ± 0.383
0.329TyrTrp: 0.329 ± 0.129
1.316TyrTyr: 1.316 ± 0.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (18234 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski