Amino acid dipepetide frequency for Guinea pig adenovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.046AlaAla: 18.046 ± 2.206
1.325AlaCys: 1.325 ± 0.389
5.298AlaAsp: 5.298 ± 0.684
3.974AlaGlu: 3.974 ± 0.662
3.311AlaPhe: 3.311 ± 0.587
7.202AlaGly: 7.202 ± 0.9
2.897AlaHis: 2.897 ± 0.703
1.904AlaIle: 1.904 ± 0.453
2.235AlaLys: 2.235 ± 0.535
8.278AlaLeu: 8.278 ± 0.976
2.235AlaMet: 2.235 ± 0.365
2.318AlaAsn: 2.318 ± 0.507
7.285AlaPro: 7.285 ± 0.952
3.477AlaGln: 3.477 ± 0.48
9.437AlaArg: 9.437 ± 0.877
8.195AlaSer: 8.195 ± 0.768
6.043AlaThr: 6.043 ± 0.687
9.272AlaVal: 9.272 ± 1.218
0.911AlaTrp: 0.911 ± 0.235
2.401AlaTyr: 2.401 ± 0.451
0.0AlaXaa: 0.0 ± 0.0
Cys
1.325CysAla: 1.325 ± 0.473
0.662CysCys: 0.662 ± 0.322
1.407CysAsp: 1.407 ± 0.419
0.911CysGlu: 0.911 ± 0.248
1.076CysPhe: 1.076 ± 0.382
1.738CysGly: 1.738 ± 0.548
0.579CysHis: 0.579 ± 0.227
0.414CysIle: 0.414 ± 0.195
0.248CysLys: 0.248 ± 0.135
1.821CysLeu: 1.821 ± 0.433
0.497CysMet: 0.497 ± 0.165
0.662CysAsn: 0.662 ± 0.232
0.414CysPro: 0.414 ± 0.221
0.331CysGln: 0.331 ± 0.148
1.159CysArg: 1.159 ± 0.379
0.745CysSer: 0.745 ± 0.282
1.076CysThr: 1.076 ± 0.29
1.242CysVal: 1.242 ± 0.409
0.331CysTrp: 0.331 ± 0.179
0.993CysTyr: 0.993 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
5.381AspAla: 5.381 ± 0.905
0.662AspCys: 0.662 ± 0.314
5.546AspAsp: 5.546 ± 0.903
3.228AspGlu: 3.228 ± 0.485
1.821AspPhe: 1.821 ± 0.366
5.215AspGly: 5.215 ± 0.472
0.993AspHis: 0.993 ± 0.33
1.821AspIle: 1.821 ± 0.427
0.828AspLys: 0.828 ± 0.315
4.387AspLeu: 4.387 ± 0.674
0.662AspMet: 0.662 ± 0.238
1.738AspAsn: 1.738 ± 0.412
4.636AspPro: 4.636 ± 0.658
2.401AspGln: 2.401 ± 0.422
5.05AspArg: 5.05 ± 0.6
3.228AspSer: 3.228 ± 0.494
2.732AspThr: 2.732 ± 0.479
4.884AspVal: 4.884 ± 0.551
0.579AspTrp: 0.579 ± 0.159
1.904AspTyr: 1.904 ± 0.45
0.0AspXaa: 0.0 ± 0.0
Glu
4.47GluAla: 4.47 ± 0.65
0.911GluCys: 0.911 ± 0.368
3.725GluAsp: 3.725 ± 0.746
3.394GluGlu: 3.394 ± 0.764
0.745GluPhe: 0.745 ± 0.225
4.056GluGly: 4.056 ± 0.708
1.656GluHis: 1.656 ± 0.379
1.325GluIle: 1.325 ± 0.307
1.407GluLys: 1.407 ± 0.36
2.483GluLeu: 2.483 ± 0.375
0.745GluMet: 0.745 ± 0.191
1.904GluAsn: 1.904 ± 0.319
3.56GluPro: 3.56 ± 0.732
1.407GluGln: 1.407 ± 0.362
4.801GluArg: 4.801 ± 0.722
1.987GluSer: 1.987 ± 0.611
4.553GluThr: 4.553 ± 0.593
3.477GluVal: 3.477 ± 0.613
0.497GluTrp: 0.497 ± 0.205
0.911GluTyr: 0.911 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.815PheAla: 2.815 ± 0.417
0.662PheCys: 0.662 ± 0.244
2.07PheAsp: 2.07 ± 0.581
2.152PheGlu: 2.152 ± 0.499
1.904PhePhe: 1.904 ± 0.447
1.573PheGly: 1.573 ± 0.461
1.987PheHis: 1.987 ± 0.335
0.911PheIle: 0.911 ± 0.255
0.911PheLys: 0.911 ± 0.261
2.566PheLeu: 2.566 ± 0.417
1.159PheMet: 1.159 ± 0.244
0.745PheAsn: 0.745 ± 0.204
1.904PhePro: 1.904 ± 0.347
1.573PheGln: 1.573 ± 0.344
2.649PheArg: 2.649 ± 0.526
1.573PheSer: 1.573 ± 0.265
1.987PheThr: 1.987 ± 0.369
3.228PheVal: 3.228 ± 0.507
0.331PheTrp: 0.331 ± 0.174
1.076PheTyr: 1.076 ± 0.238
0.0PheXaa: 0.0 ± 0.0
Gly
8.775GlyAla: 8.775 ± 1.015
0.993GlyCys: 0.993 ± 0.456
4.884GlyAsp: 4.884 ± 0.83
3.891GlyGlu: 3.891 ± 0.72
2.483GlyPhe: 2.483 ± 0.407
9.023GlyGly: 9.023 ± 2.423
2.318GlyHis: 2.318 ± 0.462
1.573GlyIle: 1.573 ± 0.359
1.49GlyLys: 1.49 ± 0.39
5.381GlyLeu: 5.381 ± 0.856
1.49GlyMet: 1.49 ± 0.399
2.732GlyAsn: 2.732 ± 0.691
5.215GlyPro: 5.215 ± 1.514
2.318GlyGln: 2.318 ± 0.484
6.705GlyArg: 6.705 ± 0.715
4.967GlySer: 4.967 ± 0.613
3.146GlyThr: 3.146 ± 0.412
6.54GlyVal: 6.54 ± 1.227
0.579GlyTrp: 0.579 ± 0.206
1.49GlyTyr: 1.49 ± 0.325
0.0GlyXaa: 0.0 ± 0.0
His
2.401HisAla: 2.401 ± 0.723
0.662HisCys: 0.662 ± 0.274
1.656HisAsp: 1.656 ± 0.389
1.076HisGlu: 1.076 ± 0.29
0.828HisPhe: 0.828 ± 0.266
3.311HisGly: 3.311 ± 0.691
0.828HisHis: 0.828 ± 0.297
0.662HisIle: 0.662 ± 0.18
0.579HisLys: 0.579 ± 0.227
2.649HisLeu: 2.649 ± 0.385
0.579HisMet: 0.579 ± 0.171
0.993HisAsn: 0.993 ± 0.322
2.566HisPro: 2.566 ± 0.563
0.911HisGln: 0.911 ± 0.33
3.642HisArg: 3.642 ± 0.599
1.49HisSer: 1.49 ± 0.32
1.49HisThr: 1.49 ± 0.456
2.649HisVal: 2.649 ± 0.53
0.248HisTrp: 0.248 ± 0.143
0.662HisTyr: 0.662 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
2.649IleAla: 2.649 ± 0.411
0.497IleCys: 0.497 ± 0.174
1.821IleAsp: 1.821 ± 0.383
0.911IleGlu: 0.911 ± 0.263
0.414IlePhe: 0.414 ± 0.245
1.821IleGly: 1.821 ± 0.411
0.414IleHis: 0.414 ± 0.186
0.497IleIle: 0.497 ± 0.172
0.497IleLys: 0.497 ± 0.162
1.49IleLeu: 1.49 ± 0.305
0.662IleMet: 0.662 ± 0.186
1.325IleAsn: 1.325 ± 0.308
1.738IlePro: 1.738 ± 0.48
0.828IleGln: 0.828 ± 0.294
2.235IleArg: 2.235 ± 0.488
2.318IleSer: 2.318 ± 0.481
1.49IleThr: 1.49 ± 0.313
1.656IleVal: 1.656 ± 0.326
0.0IleTrp: 0.0 ± 0.0
0.828IleTyr: 0.828 ± 0.187
0.0IleXaa: 0.0 ± 0.0
Lys
1.738LysAla: 1.738 ± 0.429
0.497LysCys: 0.497 ± 0.257
1.325LysAsp: 1.325 ± 0.362
0.911LysGlu: 0.911 ± 0.274
0.828LysPhe: 0.828 ± 0.329
1.325LysGly: 1.325 ± 0.33
0.662LysHis: 0.662 ± 0.282
1.325LysIle: 1.325 ± 0.265
1.159LysLys: 1.159 ± 0.323
1.987LysLeu: 1.987 ± 0.341
0.414LysMet: 0.414 ± 0.184
0.497LysAsn: 0.497 ± 0.158
0.828LysPro: 0.828 ± 0.31
0.579LysGln: 0.579 ± 0.191
2.649LysArg: 2.649 ± 0.59
1.159LysSer: 1.159 ± 0.282
1.49LysThr: 1.49 ± 0.451
1.242LysVal: 1.242 ± 0.381
0.248LysTrp: 0.248 ± 0.145
0.166LysTyr: 0.166 ± 0.102
0.0LysXaa: 0.0 ± 0.0
Leu
8.113LeuAla: 8.113 ± 1.227
1.656LeuCys: 1.656 ± 0.414
4.056LeuAsp: 4.056 ± 0.433
2.897LeuGlu: 2.897 ± 0.479
2.566LeuPhe: 2.566 ± 0.428
5.132LeuGly: 5.132 ± 0.802
3.063LeuHis: 3.063 ± 0.756
2.07LeuIle: 2.07 ± 0.369
2.649LeuLys: 2.649 ± 0.481
8.278LeuLeu: 8.278 ± 0.982
1.738LeuMet: 1.738 ± 0.483
2.483LeuAsn: 2.483 ± 0.514
5.464LeuPro: 5.464 ± 0.726
3.642LeuGln: 3.642 ± 0.915
9.437LeuArg: 9.437 ± 1.136
7.616LeuSer: 7.616 ± 0.983
5.464LeuThr: 5.464 ± 0.742
6.043LeuVal: 6.043 ± 0.75
1.242LeuTrp: 1.242 ± 0.255
2.732LeuTyr: 2.732 ± 0.56
0.0LeuXaa: 0.0 ± 0.0
Met
1.904MetAla: 1.904 ± 0.424
0.083MetCys: 0.083 ± 0.082
1.076MetAsp: 1.076 ± 0.408
1.159MetGlu: 1.159 ± 0.296
0.579MetPhe: 0.579 ± 0.17
0.828MetGly: 0.828 ± 0.241
0.579MetHis: 0.579 ± 0.202
0.579MetIle: 0.579 ± 0.179
0.248MetLys: 0.248 ± 0.139
1.159MetLeu: 1.159 ± 0.364
0.331MetMet: 0.331 ± 0.145
0.497MetAsn: 0.497 ± 0.201
0.828MetPro: 0.828 ± 0.319
1.076MetGln: 1.076 ± 0.25
1.656MetArg: 1.656 ± 0.404
1.656MetSer: 1.656 ± 0.391
1.407MetThr: 1.407 ± 0.343
1.407MetVal: 1.407 ± 0.268
0.166MetTrp: 0.166 ± 0.103
0.414MetTyr: 0.414 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
3.56AsnAla: 3.56 ± 0.528
0.248AsnCys: 0.248 ± 0.12
1.573AsnAsp: 1.573 ± 0.474
1.159AsnGlu: 1.159 ± 0.346
1.738AsnPhe: 1.738 ± 0.359
1.656AsnGly: 1.656 ± 0.45
0.911AsnHis: 0.911 ± 0.274
1.242AsnIle: 1.242 ± 0.35
0.414AsnLys: 0.414 ± 0.176
2.732AsnLeu: 2.732 ± 0.733
0.579AsnMet: 0.579 ± 0.248
0.911AsnAsn: 0.911 ± 0.304
2.649AsnPro: 2.649 ± 0.616
0.745AsnGln: 0.745 ± 0.32
2.235AsnArg: 2.235 ± 0.38
1.987AsnSer: 1.987 ± 0.388
2.235AsnThr: 2.235 ± 0.365
2.815AsnVal: 2.815 ± 0.504
0.579AsnTrp: 0.579 ± 0.186
1.076AsnTyr: 1.076 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
8.526ProAla: 8.526 ± 1.036
0.745ProCys: 0.745 ± 0.213
3.063ProAsp: 3.063 ± 0.569
3.808ProGlu: 3.808 ± 0.822
1.987ProPhe: 1.987 ± 0.425
5.298ProGly: 5.298 ± 1.479
1.738ProHis: 1.738 ± 0.432
1.656ProIle: 1.656 ± 0.339
0.911ProLys: 0.911 ± 0.224
5.96ProLeu: 5.96 ± 0.908
0.414ProMet: 0.414 ± 0.171
1.738ProAsn: 1.738 ± 0.483
11.424ProPro: 11.424 ± 1.967
2.732ProGln: 2.732 ± 0.531
5.381ProArg: 5.381 ± 0.84
5.381ProSer: 5.381 ± 0.784
3.228ProThr: 3.228 ± 0.584
5.629ProVal: 5.629 ± 1.081
0.579ProTrp: 0.579 ± 0.219
2.318ProTyr: 2.318 ± 0.435
0.0ProXaa: 0.0 ± 0.0
Gln
3.228GlnAla: 3.228 ± 0.408
0.828GlnCys: 0.828 ± 0.32
1.573GlnAsp: 1.573 ± 0.301
1.325GlnGlu: 1.325 ± 0.302
1.159GlnPhe: 1.159 ± 0.226
2.566GlnGly: 2.566 ± 0.613
1.656GlnHis: 1.656 ± 0.321
1.242GlnIle: 1.242 ± 0.239
1.242GlnLys: 1.242 ± 0.354
3.146GlnLeu: 3.146 ± 0.615
0.662GlnMet: 0.662 ± 0.212
1.573GlnAsn: 1.573 ± 0.411
1.987GlnPro: 1.987 ± 0.493
1.904GlnGln: 1.904 ± 0.741
3.974GlnArg: 3.974 ± 0.575
1.987GlnSer: 1.987 ± 0.331
1.738GlnThr: 1.738 ± 0.463
2.318GlnVal: 2.318 ± 0.397
0.414GlnTrp: 0.414 ± 0.157
0.993GlnTyr: 0.993 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
8.692ArgAla: 8.692 ± 0.949
1.987ArgCys: 1.987 ± 0.511
6.043ArgAsp: 6.043 ± 0.975
4.884ArgGlu: 4.884 ± 0.683
2.235ArgPhe: 2.235 ± 0.519
7.781ArgGly: 7.781 ± 0.901
2.401ArgHis: 2.401 ± 0.478
1.49ArgIle: 1.49 ± 0.381
1.407ArgLys: 1.407 ± 0.395
9.272ArgLeu: 9.272 ± 1.072
1.325ArgMet: 1.325 ± 0.426
2.566ArgAsn: 2.566 ± 0.672
4.553ArgPro: 4.553 ± 0.58
3.642ArgGln: 3.642 ± 0.644
14.57ArgArg: 14.57 ± 2.205
6.291ArgSer: 6.291 ± 1.077
3.974ArgThr: 3.974 ± 0.581
7.616ArgVal: 7.616 ± 0.811
1.738ArgTrp: 1.738 ± 0.462
3.146ArgTyr: 3.146 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
6.457SerAla: 6.457 ± 0.959
1.242SerCys: 1.242 ± 0.366
2.732SerAsp: 2.732 ± 0.445
3.063SerGlu: 3.063 ± 0.701
2.566SerPhe: 2.566 ± 0.407
4.967SerGly: 4.967 ± 0.584
1.573SerHis: 1.573 ± 0.346
1.242SerIle: 1.242 ± 0.277
1.656SerLys: 1.656 ± 0.358
7.119SerLeu: 7.119 ± 0.941
0.911SerMet: 0.911 ± 0.219
2.649SerAsn: 2.649 ± 0.446
5.712SerPro: 5.712 ± 0.718
2.07SerGln: 2.07 ± 0.396
5.298SerArg: 5.298 ± 0.837
6.871SerSer: 6.871 ± 1.224
4.056SerThr: 4.056 ± 0.487
4.967SerVal: 4.967 ± 0.581
0.662SerTrp: 0.662 ± 0.227
2.401SerTyr: 2.401 ± 0.484
0.0SerXaa: 0.0 ± 0.0
Thr
6.623ThrAla: 6.623 ± 0.838
1.242ThrCys: 1.242 ± 0.315
3.146ThrAsp: 3.146 ± 0.418
3.063ThrGlu: 3.063 ± 0.452
2.566ThrPhe: 2.566 ± 0.528
4.222ThrGly: 4.222 ± 0.575
1.904ThrHis: 1.904 ± 0.252
1.407ThrIle: 1.407 ± 0.313
0.579ThrLys: 0.579 ± 0.209
7.781ThrLeu: 7.781 ± 1.081
0.497ThrMet: 0.497 ± 0.176
1.407ThrAsn: 1.407 ± 0.249
4.305ThrPro: 4.305 ± 0.745
1.325ThrGln: 1.325 ± 0.372
4.884ThrArg: 4.884 ± 0.729
2.897ThrSer: 2.897 ± 0.546
3.146ThrThr: 3.146 ± 0.605
4.801ThrVal: 4.801 ± 0.537
0.331ThrTrp: 0.331 ± 0.156
2.152ThrTyr: 2.152 ± 0.498
0.0ThrXaa: 0.0 ± 0.0
Val
7.947ValAla: 7.947 ± 0.726
1.573ValCys: 1.573 ± 0.575
4.47ValAsp: 4.47 ± 0.859
3.56ValGlu: 3.56 ± 0.493
3.063ValPhe: 3.063 ± 0.456
5.546ValGly: 5.546 ± 1.112
2.649ValHis: 2.649 ± 0.632
1.904ValIle: 1.904 ± 0.374
1.738ValLys: 1.738 ± 0.505
6.54ValLeu: 6.54 ± 0.635
1.656ValMet: 1.656 ± 0.507
2.732ValAsn: 2.732 ± 0.609
5.464ValPro: 5.464 ± 0.793
2.815ValGln: 2.815 ± 0.449
6.209ValArg: 6.209 ± 0.806
5.215ValSer: 5.215 ± 0.845
6.457ValThr: 6.457 ± 0.886
7.368ValVal: 7.368 ± 1.201
0.662ValTrp: 0.662 ± 0.205
2.98ValTyr: 2.98 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
0.497TrpAla: 0.497 ± 0.188
0.331TrpCys: 0.331 ± 0.195
0.579TrpAsp: 0.579 ± 0.238
0.662TrpGlu: 0.662 ± 0.246
0.248TrpPhe: 0.248 ± 0.124
0.828TrpGly: 0.828 ± 0.306
0.414TrpHis: 0.414 ± 0.173
0.248TrpIle: 0.248 ± 0.147
0.248TrpLys: 0.248 ± 0.155
0.662TrpLeu: 0.662 ± 0.225
0.248TrpMet: 0.248 ± 0.124
0.662TrpAsn: 0.662 ± 0.234
0.745TrpPro: 0.745 ± 0.201
0.662TrpGln: 0.662 ± 0.249
0.993TrpArg: 0.993 ± 0.302
0.911TrpSer: 0.911 ± 0.307
0.745TrpThr: 0.745 ± 0.238
0.579TrpVal: 0.579 ± 0.186
0.248TrpTrp: 0.248 ± 0.151
0.166TrpTyr: 0.166 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.897TyrAla: 2.897 ± 0.664
0.828TyrCys: 0.828 ± 0.252
1.656TyrAsp: 1.656 ± 0.301
1.987TyrGlu: 1.987 ± 0.443
1.656TyrPhe: 1.656 ± 0.403
1.904TyrGly: 1.904 ± 0.305
0.662TyrHis: 0.662 ± 0.245
0.579TyrIle: 0.579 ± 0.172
0.579TyrLys: 0.579 ± 0.171
2.649TyrLeu: 2.649 ± 0.439
0.745TyrMet: 0.745 ± 0.244
0.911TyrAsn: 0.911 ± 0.325
1.242TyrPro: 1.242 ± 0.353
1.076TyrGln: 1.076 ± 0.275
2.649TyrArg: 2.649 ± 0.411
1.821TyrSer: 1.821 ± 0.264
1.821TyrThr: 1.821 ± 0.426
2.815TyrVal: 2.815 ± 0.429
0.331TyrTrp: 0.331 ± 0.134
0.662TyrTyr: 0.662 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 32 proteins (12081 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski