Amino acid dipepetide frequency for White sturgeon adenovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.363AlaAla: 4.363 ± 0.729
1.295AlaCys: 1.295 ± 0.305
2.999AlaAsp: 2.999 ± 0.439
2.863AlaGlu: 2.863 ± 0.493
2.249AlaPhe: 2.249 ± 0.453
2.318AlaGly: 2.318 ± 0.448
1.022AlaHis: 1.022 ± 0.281
3.272AlaIle: 3.272 ± 0.488
3.272AlaLys: 3.272 ± 0.575
4.908AlaLeu: 4.908 ± 0.483
2.454AlaMet: 2.454 ± 0.455
3.817AlaAsn: 3.817 ± 0.559
4.294AlaPro: 4.294 ± 0.595
2.795AlaGln: 2.795 ± 0.505
2.386AlaArg: 2.386 ± 0.417
3.954AlaSer: 3.954 ± 0.517
3.408AlaThr: 3.408 ± 0.424
3.749AlaVal: 3.749 ± 0.442
0.613AlaTrp: 0.613 ± 0.174
2.522AlaTyr: 2.522 ± 0.482
0.0AlaXaa: 0.0 ± 0.0
Cys
1.091CysAla: 1.091 ± 0.292
0.75CysCys: 0.75 ± 0.257
1.159CysAsp: 1.159 ± 0.366
1.022CysGlu: 1.022 ± 0.345
1.159CysPhe: 1.159 ± 0.299
1.363CysGly: 1.363 ± 0.299
0.341CysHis: 0.341 ± 0.127
1.227CysIle: 1.227 ± 0.296
1.84CysLys: 1.84 ± 0.448
2.795CysLeu: 2.795 ± 0.394
0.954CysMet: 0.954 ± 0.322
1.909CysAsn: 1.909 ± 0.356
1.159CysPro: 1.159 ± 0.254
1.091CysGln: 1.091 ± 0.259
1.091CysArg: 1.091 ± 0.289
2.386CysSer: 2.386 ± 0.452
1.022CysThr: 1.022 ± 0.304
1.091CysVal: 1.091 ± 0.259
0.204CysTrp: 0.204 ± 0.102
1.227CysTyr: 1.227 ± 0.301
0.0CysXaa: 0.0 ± 0.0
Asp
2.454AspAla: 2.454 ± 0.486
1.227AspCys: 1.227 ± 0.314
2.795AspAsp: 2.795 ± 0.617
3.681AspGlu: 3.681 ± 0.491
2.454AspPhe: 2.454 ± 0.46
2.999AspGly: 2.999 ± 0.447
1.091AspHis: 1.091 ± 0.265
3.613AspIle: 3.613 ± 0.504
2.931AspLys: 2.931 ± 0.409
5.044AspLeu: 5.044 ± 0.5
2.113AspMet: 2.113 ± 0.399
3.545AspAsn: 3.545 ± 0.573
3.272AspPro: 3.272 ± 0.545
1.227AspGln: 1.227 ± 0.28
1.772AspArg: 1.772 ± 0.348
3.613AspSer: 3.613 ± 0.442
2.795AspThr: 2.795 ± 0.464
3.272AspVal: 3.272 ± 0.455
0.477AspTrp: 0.477 ± 0.174
2.863AspTyr: 2.863 ± 0.455
0.0AspXaa: 0.0 ± 0.0
Glu
2.318GluAla: 2.318 ± 0.457
0.75GluCys: 0.75 ± 0.274
2.863GluAsp: 2.863 ± 0.446
5.317GluGlu: 5.317 ± 0.936
2.318GluPhe: 2.318 ± 0.398
2.931GluGly: 2.931 ± 0.512
1.431GluHis: 1.431 ± 0.287
3.817GluIle: 3.817 ± 0.575
2.658GluLys: 2.658 ± 0.476
5.044GluLeu: 5.044 ± 0.566
1.636GluMet: 1.636 ± 0.308
2.249GluAsn: 2.249 ± 0.425
1.977GluPro: 1.977 ± 0.299
1.909GluGln: 1.909 ± 0.445
2.386GluArg: 2.386 ± 0.418
3.817GluSer: 3.817 ± 0.6
4.294GluThr: 4.294 ± 0.523
3.749GluVal: 3.749 ± 0.453
0.477GluTrp: 0.477 ± 0.164
1.909GluTyr: 1.909 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
2.318PheAla: 2.318 ± 0.398
1.636PheCys: 1.636 ± 0.383
2.386PheAsp: 2.386 ± 0.337
3.067PheGlu: 3.067 ± 0.499
2.045PhePhe: 2.045 ± 0.466
1.909PheGly: 1.909 ± 0.315
1.227PheHis: 1.227 ± 0.345
2.522PheIle: 2.522 ± 0.424
2.931PheLys: 2.931 ± 0.425
2.999PheLeu: 2.999 ± 0.493
1.5PheMet: 1.5 ± 0.343
4.09PheAsn: 4.09 ± 0.677
2.249PhePro: 2.249 ± 0.326
1.636PheGln: 1.636 ± 0.227
2.59PheArg: 2.59 ± 0.488
3.476PheSer: 3.476 ± 0.451
2.863PheThr: 2.863 ± 0.462
3.408PheVal: 3.408 ± 0.523
0.613PheTrp: 0.613 ± 0.192
2.454PheTyr: 2.454 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
3.545GlyAla: 3.545 ± 0.562
0.818GlyCys: 0.818 ± 0.293
2.522GlyAsp: 2.522 ± 0.437
3.204GlyGlu: 3.204 ± 0.693
2.522GlyPhe: 2.522 ± 0.397
3.885GlyGly: 3.885 ± 0.6
1.772GlyHis: 1.772 ± 0.301
2.999GlyIle: 2.999 ± 0.461
2.931GlyLys: 2.931 ± 0.441
4.09GlyLeu: 4.09 ± 0.697
1.022GlyMet: 1.022 ± 0.272
2.454GlyAsn: 2.454 ± 0.559
2.454GlyPro: 2.454 ± 0.322
2.863GlyGln: 2.863 ± 0.355
2.795GlyArg: 2.795 ± 0.457
4.499GlySer: 4.499 ± 0.727
3.749GlyThr: 3.749 ± 0.711
4.158GlyVal: 4.158 ± 0.615
1.022GlyTrp: 1.022 ± 0.268
2.863GlyTyr: 2.863 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
1.977HisAla: 1.977 ± 0.364
1.431HisCys: 1.431 ± 0.339
0.818HisAsp: 0.818 ± 0.229
1.022HisGlu: 1.022 ± 0.363
0.886HisPhe: 0.886 ± 0.243
1.227HisGly: 1.227 ± 0.297
0.613HisHis: 0.613 ± 0.243
1.704HisIle: 1.704 ± 0.386
1.022HisLys: 1.022 ± 0.246
1.5HisLeu: 1.5 ± 0.364
0.477HisMet: 0.477 ± 0.218
1.022HisAsn: 1.022 ± 0.248
1.091HisPro: 1.091 ± 0.273
0.545HisGln: 0.545 ± 0.186
1.022HisArg: 1.022 ± 0.281
1.772HisSer: 1.772 ± 0.294
1.704HisThr: 1.704 ± 0.324
1.5HisVal: 1.5 ± 0.359
0.273HisTrp: 0.273 ± 0.143
1.704HisTyr: 1.704 ± 0.308
0.0HisXaa: 0.0 ± 0.0
Ile
3.272IleAla: 3.272 ± 0.396
1.431IleCys: 1.431 ± 0.336
3.476IleAsp: 3.476 ± 0.464
3.272IleGlu: 3.272 ± 0.468
1.977IlePhe: 1.977 ± 0.297
2.999IleGly: 2.999 ± 0.441
1.5IleHis: 1.5 ± 0.316
2.318IleIle: 2.318 ± 0.486
3.272IleLys: 3.272 ± 0.337
5.112IleLeu: 5.112 ± 0.676
0.954IleMet: 0.954 ± 0.263
3.613IleAsn: 3.613 ± 0.556
3.476IlePro: 3.476 ± 0.399
1.772IleGln: 1.772 ± 0.353
2.454IleArg: 2.454 ± 0.432
4.84IleSer: 4.84 ± 0.625
4.226IleThr: 4.226 ± 0.658
3.613IleVal: 3.613 ± 0.457
0.273IleTrp: 0.273 ± 0.149
2.249IleTyr: 2.249 ± 0.422
0.0IleXaa: 0.0 ± 0.0
Lys
3.272LysAla: 3.272 ± 0.483
1.295LysCys: 1.295 ± 0.389
1.977LysAsp: 1.977 ± 0.329
2.931LysGlu: 2.931 ± 0.509
3.408LysPhe: 3.408 ± 0.483
1.772LysGly: 1.772 ± 0.466
1.091LysHis: 1.091 ± 0.304
3.204LysIle: 3.204 ± 0.463
3.885LysLys: 3.885 ± 0.695
4.84LysLeu: 4.84 ± 0.808
1.5LysMet: 1.5 ± 0.353
3.545LysAsn: 3.545 ± 0.657
2.181LysPro: 2.181 ± 0.469
2.386LysGln: 2.386 ± 0.408
3.408LysArg: 3.408 ± 0.465
2.999LysSer: 2.999 ± 0.385
3.204LysThr: 3.204 ± 0.417
3.476LysVal: 3.476 ± 0.534
0.682LysTrp: 0.682 ± 0.257
2.931LysTyr: 2.931 ± 0.443
0.0LysXaa: 0.0 ± 0.0
Leu
5.181LeuAla: 5.181 ± 0.678
1.909LeuCys: 1.909 ± 0.397
5.521LeuAsp: 5.521 ± 0.672
3.613LeuGlu: 3.613 ± 0.448
3.885LeuPhe: 3.885 ± 0.696
4.84LeuGly: 4.84 ± 0.41
1.568LeuHis: 1.568 ± 0.293
4.772LeuIle: 4.772 ± 0.567
4.772LeuLys: 4.772 ± 0.558
7.021LeuLeu: 7.021 ± 0.759
2.045LeuMet: 2.045 ± 0.4
5.317LeuAsn: 5.317 ± 0.639
6.135LeuPro: 6.135 ± 0.621
3.545LeuGln: 3.545 ± 0.446
3.476LeuArg: 3.476 ± 0.531
6.817LeuSer: 6.817 ± 0.543
6.817LeuThr: 6.817 ± 0.684
4.84LeuVal: 4.84 ± 0.511
1.5LeuTrp: 1.5 ± 0.292
2.795LeuTyr: 2.795 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
1.84MetAla: 1.84 ± 0.427
0.613MetCys: 0.613 ± 0.211
1.636MetAsp: 1.636 ± 0.297
0.886MetGlu: 0.886 ± 0.225
1.568MetPhe: 1.568 ± 0.33
1.568MetGly: 1.568 ± 0.266
0.75MetHis: 0.75 ± 0.225
1.227MetIle: 1.227 ± 0.256
1.704MetLys: 1.704 ± 0.312
2.931MetLeu: 2.931 ± 0.331
0.818MetMet: 0.818 ± 0.234
1.159MetAsn: 1.159 ± 0.28
1.431MetPro: 1.431 ± 0.391
0.886MetGln: 0.886 ± 0.303
1.091MetArg: 1.091 ± 0.222
2.181MetSer: 2.181 ± 0.563
2.249MetThr: 2.249 ± 0.399
1.227MetVal: 1.227 ± 0.262
0.136MetTrp: 0.136 ± 0.088
1.5MetTyr: 1.5 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
3.067AsnAla: 3.067 ± 0.582
1.227AsnCys: 1.227 ± 0.254
1.977AsnAsp: 1.977 ± 0.397
2.454AsnGlu: 2.454 ± 0.541
3.681AsnPhe: 3.681 ± 0.559
4.431AsnGly: 4.431 ± 0.647
1.772AsnHis: 1.772 ± 0.315
3.749AsnIle: 3.749 ± 0.422
3.272AsnLys: 3.272 ± 0.431
5.385AsnLeu: 5.385 ± 0.691
1.977AsnMet: 1.977 ± 0.444
3.681AsnAsn: 3.681 ± 0.542
3.067AsnPro: 3.067 ± 0.343
2.181AsnGln: 2.181 ± 0.308
2.658AsnArg: 2.658 ± 0.561
4.158AsnSer: 4.158 ± 0.608
4.09AsnThr: 4.09 ± 0.67
3.476AsnVal: 3.476 ± 0.692
0.75AsnTrp: 0.75 ± 0.231
3.272AsnTyr: 3.272 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
3.954ProAla: 3.954 ± 0.607
1.363ProCys: 1.363 ± 0.273
3.817ProAsp: 3.817 ± 0.642
3.136ProGlu: 3.136 ± 0.6
2.727ProPhe: 2.727 ± 0.47
2.59ProGly: 2.59 ± 0.469
1.295ProHis: 1.295 ± 0.311
3.545ProIle: 3.545 ± 0.517
1.909ProLys: 1.909 ± 0.31
5.181ProLeu: 5.181 ± 0.676
1.363ProMet: 1.363 ± 0.294
3.272ProAsn: 3.272 ± 0.419
5.181ProPro: 5.181 ± 1.308
1.704ProGln: 1.704 ± 0.313
2.454ProArg: 2.454 ± 0.554
3.204ProSer: 3.204 ± 0.521
3.885ProThr: 3.885 ± 0.563
2.999ProVal: 2.999 ± 0.387
0.613ProTrp: 0.613 ± 0.225
2.931ProTyr: 2.931 ± 0.439
0.0ProXaa: 0.0 ± 0.0
Gln
2.318GlnAla: 2.318 ± 0.342
0.954GlnCys: 0.954 ± 0.302
2.113GlnAsp: 2.113 ± 0.329
1.568GlnGlu: 1.568 ± 0.388
1.227GlnPhe: 1.227 ± 0.241
2.386GlnGly: 2.386 ± 0.286
0.613GlnHis: 0.613 ± 0.26
2.318GlnIle: 2.318 ± 0.345
2.727GlnLys: 2.727 ± 0.434
3.545GlnLeu: 3.545 ± 0.43
1.091GlnMet: 1.091 ± 0.255
2.727GlnAsn: 2.727 ± 0.384
1.704GlnPro: 1.704 ± 0.393
2.113GlnGln: 2.113 ± 0.361
2.318GlnArg: 2.318 ± 0.489
3.067GlnSer: 3.067 ± 0.558
2.522GlnThr: 2.522 ± 0.393
2.249GlnVal: 2.249 ± 0.43
0.682GlnTrp: 0.682 ± 0.237
1.909GlnTyr: 1.909 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
2.658ArgAla: 2.658 ± 0.389
0.75ArgCys: 0.75 ± 0.222
2.795ArgAsp: 2.795 ± 0.46
2.318ArgGlu: 2.318 ± 0.353
2.727ArgPhe: 2.727 ± 0.45
2.59ArgGly: 2.59 ± 0.52
1.295ArgHis: 1.295 ± 0.328
1.909ArgIle: 1.909 ± 0.368
2.045ArgLys: 2.045 ± 0.427
4.158ArgLeu: 4.158 ± 0.583
1.022ArgMet: 1.022 ± 0.237
2.658ArgAsn: 2.658 ± 0.55
2.795ArgPro: 2.795 ± 0.597
2.318ArgGln: 2.318 ± 0.308
3.749ArgArg: 3.749 ± 0.782
3.545ArgSer: 3.545 ± 0.568
2.181ArgThr: 2.181 ± 0.347
2.795ArgVal: 2.795 ± 0.43
0.613ArgTrp: 0.613 ± 0.209
1.977ArgTyr: 1.977 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
4.09SerAla: 4.09 ± 0.515
2.045SerCys: 2.045 ± 0.435
3.34SerAsp: 3.34 ± 0.533
4.84SerGlu: 4.84 ± 0.527
4.635SerPhe: 4.635 ± 0.548
5.726SerGly: 5.726 ± 0.766
1.568SerHis: 1.568 ± 0.351
3.34SerIle: 3.34 ± 0.488
3.476SerLys: 3.476 ± 0.616
6.339SerLeu: 6.339 ± 0.724
1.295SerMet: 1.295 ± 0.319
4.09SerAsn: 4.09 ± 0.566
2.931SerPro: 2.931 ± 0.465
3.613SerGln: 3.613 ± 0.428
2.863SerArg: 2.863 ± 0.446
5.249SerSer: 5.249 ± 0.496
4.294SerThr: 4.294 ± 0.441
4.294SerVal: 4.294 ± 0.411
1.5SerTrp: 1.5 ± 0.422
2.795SerTyr: 2.795 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
3.885ThrAla: 3.885 ± 0.518
1.568ThrCys: 1.568 ± 0.433
3.476ThrAsp: 3.476 ± 0.577
4.158ThrGlu: 4.158 ± 0.615
2.522ThrPhe: 2.522 ± 0.342
3.545ThrGly: 3.545 ± 0.382
1.227ThrHis: 1.227 ± 0.379
2.795ThrIle: 2.795 ± 0.43
3.749ThrLys: 3.749 ± 0.558
6.885ThrLeu: 6.885 ± 0.823
2.045ThrMet: 2.045 ± 0.309
4.022ThrAsn: 4.022 ± 0.58
5.249ThrPro: 5.249 ± 0.574
1.84ThrGln: 1.84 ± 0.289
2.522ThrArg: 2.522 ± 0.416
3.954ThrSer: 3.954 ± 0.75
4.022ThrThr: 4.022 ± 0.626
3.749ThrVal: 3.749 ± 0.532
0.545ThrTrp: 0.545 ± 0.192
2.249ThrTyr: 2.249 ± 0.39
0.0ThrXaa: 0.0 ± 0.0
Val
3.272ValAla: 3.272 ± 0.418
1.84ValCys: 1.84 ± 0.419
3.817ValAsp: 3.817 ± 0.525
2.249ValGlu: 2.249 ± 0.499
2.999ValPhe: 2.999 ± 0.462
3.272ValGly: 3.272 ± 0.502
1.091ValHis: 1.091 ± 0.259
4.499ValIle: 4.499 ± 0.48
3.136ValLys: 3.136 ± 0.387
4.022ValLeu: 4.022 ± 0.466
1.295ValMet: 1.295 ± 0.28
3.476ValAsn: 3.476 ± 0.415
3.681ValPro: 3.681 ± 0.594
2.658ValGln: 2.658 ± 0.44
2.863ValArg: 2.863 ± 0.392
4.84ValSer: 4.84 ± 0.535
4.022ValThr: 4.022 ± 0.556
2.727ValVal: 2.727 ± 0.492
0.75ValTrp: 0.75 ± 0.193
2.658ValTyr: 2.658 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.75TrpAla: 0.75 ± 0.207
0.341TrpCys: 0.341 ± 0.166
0.886TrpAsp: 0.886 ± 0.309
0.545TrpGlu: 0.545 ± 0.194
0.341TrpPhe: 0.341 ± 0.16
0.886TrpGly: 0.886 ± 0.218
0.477TrpHis: 0.477 ± 0.196
0.613TrpIle: 0.613 ± 0.177
0.613TrpLys: 0.613 ± 0.164
1.295TrpLeu: 1.295 ± 0.311
0.409TrpMet: 0.409 ± 0.157
0.75TrpAsn: 0.75 ± 0.182
0.409TrpPro: 0.409 ± 0.161
0.886TrpGln: 0.886 ± 0.243
0.477TrpArg: 0.477 ± 0.187
0.954TrpSer: 0.954 ± 0.21
0.613TrpThr: 0.613 ± 0.173
0.613TrpVal: 0.613 ± 0.22
0.273TrpTrp: 0.273 ± 0.126
0.409TrpTyr: 0.409 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.931TyrAla: 2.931 ± 0.412
1.636TyrCys: 1.636 ± 0.316
2.863TyrAsp: 2.863 ± 0.634
1.772TyrGlu: 1.772 ± 0.292
2.59TyrPhe: 2.59 ± 0.389
2.931TyrGly: 2.931 ± 0.567
1.568TyrHis: 1.568 ± 0.361
2.658TyrIle: 2.658 ± 0.409
1.84TyrLys: 1.84 ± 0.365
3.272TyrLeu: 3.272 ± 0.523
1.363TyrMet: 1.363 ± 0.233
2.931TyrAsn: 2.931 ± 0.401
2.386TyrPro: 2.386 ± 0.393
2.181TyrGln: 2.181 ± 0.367
2.522TyrArg: 2.522 ± 0.461
2.999TyrSer: 2.999 ± 0.372
2.045TyrThr: 2.045 ± 0.304
2.181TyrVal: 2.181 ± 0.371
0.545TyrTrp: 0.545 ± 0.157
2.454TyrTyr: 2.454 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (14671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski