Amino acid dipepetide frequency for Siphoviridae sp. ctvD11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.613AlaAla: 7.613 ± 1.46
0.568AlaCys: 0.568 ± 0.285
4.659AlaAsp: 4.659 ± 0.886
3.863AlaGlu: 3.863 ± 0.82
2.954AlaPhe: 2.954 ± 0.48
4.318AlaGly: 4.318 ± 1.387
1.363AlaHis: 1.363 ± 0.38
4.886AlaIle: 4.886 ± 0.81
5.795AlaLys: 5.795 ± 1.005
8.635AlaLeu: 8.635 ± 1.012
3.181AlaMet: 3.181 ± 0.626
3.409AlaAsn: 3.409 ± 1.058
2.386AlaPro: 2.386 ± 0.561
1.704AlaGln: 1.704 ± 0.334
3.863AlaArg: 3.863 ± 0.745
4.999AlaSer: 4.999 ± 1.121
7.954AlaThr: 7.954 ± 1.841
6.022AlaVal: 6.022 ± 1.19
0.795AlaTrp: 0.795 ± 0.323
2.727AlaTyr: 2.727 ± 0.547
0.0AlaXaa: 0.0 ± 0.0
Cys
0.682CysAla: 0.682 ± 0.276
0.0CysCys: 0.0 ± 0.0
0.795CysAsp: 0.795 ± 0.356
0.682CysGlu: 0.682 ± 0.339
0.568CysPhe: 0.568 ± 0.27
0.227CysGly: 0.227 ± 0.176
0.227CysHis: 0.227 ± 0.181
0.341CysIle: 0.341 ± 0.226
1.136CysLys: 1.136 ± 0.385
0.795CysLeu: 0.795 ± 0.279
0.568CysMet: 0.568 ± 0.247
0.682CysAsn: 0.682 ± 0.335
0.795CysPro: 0.795 ± 0.362
0.682CysGln: 0.682 ± 0.27
0.454CysArg: 0.454 ± 0.188
0.227CysSer: 0.227 ± 0.185
1.023CysThr: 1.023 ± 0.339
0.341CysVal: 0.341 ± 0.166
0.114CysTrp: 0.114 ± 0.101
0.341CysTyr: 0.341 ± 0.21
0.0CysXaa: 0.0 ± 0.0
Asp
4.999AspAla: 4.999 ± 0.954
0.909AspCys: 0.909 ± 0.295
2.159AspAsp: 2.159 ± 0.439
2.727AspGlu: 2.727 ± 0.491
2.5AspPhe: 2.5 ± 0.545
5.227AspGly: 5.227 ± 0.745
0.909AspHis: 0.909 ± 0.343
3.75AspIle: 3.75 ± 0.654
2.272AspLys: 2.272 ± 0.579
7.726AspLeu: 7.726 ± 0.938
1.136AspMet: 1.136 ± 0.354
2.045AspAsn: 2.045 ± 0.403
2.159AspPro: 2.159 ± 0.562
2.613AspGln: 2.613 ± 0.557
2.954AspArg: 2.954 ± 0.518
1.818AspSer: 1.818 ± 0.662
4.431AspThr: 4.431 ± 0.516
3.636AspVal: 3.636 ± 0.707
0.909AspTrp: 0.909 ± 0.355
2.841AspTyr: 2.841 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
4.772GluAla: 4.772 ± 0.917
0.227GluCys: 0.227 ± 0.153
3.522GluAsp: 3.522 ± 0.698
3.295GluGlu: 3.295 ± 0.651
2.727GluPhe: 2.727 ± 0.75
4.09GluGly: 4.09 ± 0.616
1.477GluHis: 1.477 ± 0.471
2.954GluIle: 2.954 ± 0.551
4.772GluLys: 4.772 ± 0.871
6.136GluLeu: 6.136 ± 1.293
1.136GluMet: 1.136 ± 0.434
2.272GluAsn: 2.272 ± 0.655
2.841GluPro: 2.841 ± 0.62
2.386GluGln: 2.386 ± 0.526
3.068GluArg: 3.068 ± 0.754
1.818GluSer: 1.818 ± 0.419
5.227GluThr: 5.227 ± 0.716
4.09GluVal: 4.09 ± 0.759
0.454GluTrp: 0.454 ± 0.187
2.613GluTyr: 2.613 ± 0.764
0.0GluXaa: 0.0 ± 0.0
Phe
2.613PheAla: 2.613 ± 0.395
0.114PheCys: 0.114 ± 0.098
2.841PheAsp: 2.841 ± 0.604
2.954PheGlu: 2.954 ± 0.611
0.795PhePhe: 0.795 ± 0.4
2.727PheGly: 2.727 ± 0.53
1.477PheHis: 1.477 ± 0.496
1.932PheIle: 1.932 ± 0.523
1.818PheLys: 1.818 ± 0.451
2.841PheLeu: 2.841 ± 0.621
0.568PheMet: 0.568 ± 0.288
1.136PheAsn: 1.136 ± 0.338
1.25PhePro: 1.25 ± 0.369
1.25PheGln: 1.25 ± 0.351
1.591PheArg: 1.591 ± 0.399
3.068PheSer: 3.068 ± 0.443
3.295PheThr: 3.295 ± 0.659
2.159PheVal: 2.159 ± 0.387
0.341PheTrp: 0.341 ± 0.172
1.704PheTyr: 1.704 ± 0.518
0.0PheXaa: 0.0 ± 0.0
Gly
6.477GlyAla: 6.477 ± 2.142
0.795GlyCys: 0.795 ± 0.282
1.932GlyAsp: 1.932 ± 0.532
3.75GlyGlu: 3.75 ± 0.775
2.386GlyPhe: 2.386 ± 0.388
5.795GlyGly: 5.795 ± 0.979
1.477GlyHis: 1.477 ± 0.382
4.431GlyIle: 4.431 ± 0.527
3.409GlyLys: 3.409 ± 0.712
5.568GlyLeu: 5.568 ± 0.724
1.136GlyMet: 1.136 ± 0.362
3.75GlyAsn: 3.75 ± 0.732
1.477GlyPro: 1.477 ± 0.346
1.477GlyGln: 1.477 ± 0.447
2.5GlyArg: 2.5 ± 0.639
4.886GlySer: 4.886 ± 0.94
7.158GlyThr: 7.158 ± 1.85
4.659GlyVal: 4.659 ± 1.025
1.477GlyTrp: 1.477 ± 0.503
1.477GlyTyr: 1.477 ± 0.324
0.0GlyXaa: 0.0 ± 0.0
His
1.25HisAla: 1.25 ± 0.351
0.454HisCys: 0.454 ± 0.26
1.477HisAsp: 1.477 ± 0.266
1.25HisGlu: 1.25 ± 0.527
1.023HisPhe: 1.023 ± 0.328
1.363HisGly: 1.363 ± 0.374
0.454HisHis: 0.454 ± 0.216
1.25HisIle: 1.25 ± 0.434
0.795HisLys: 0.795 ± 0.396
2.272HisLeu: 2.272 ± 0.374
0.454HisMet: 0.454 ± 0.241
1.023HisAsn: 1.023 ± 0.363
1.023HisPro: 1.023 ± 0.502
0.454HisGln: 0.454 ± 0.228
0.909HisArg: 0.909 ± 0.406
0.682HisSer: 0.682 ± 0.336
1.818HisThr: 1.818 ± 0.499
1.591HisVal: 1.591 ± 0.582
0.0HisTrp: 0.0 ± 0.0
0.568HisTyr: 0.568 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
4.318IleAla: 4.318 ± 0.625
1.136IleCys: 1.136 ± 0.283
3.409IleAsp: 3.409 ± 0.737
4.204IleGlu: 4.204 ± 0.833
1.818IlePhe: 1.818 ± 0.466
3.068IleGly: 3.068 ± 0.594
0.795IleHis: 0.795 ± 0.247
2.613IleIle: 2.613 ± 0.531
3.295IleLys: 3.295 ± 0.627
5.34IleLeu: 5.34 ± 0.939
1.023IleMet: 1.023 ± 0.428
2.045IleAsn: 2.045 ± 0.35
1.818IlePro: 1.818 ± 0.465
3.75IleGln: 3.75 ± 0.675
2.841IleArg: 2.841 ± 0.609
4.318IleSer: 4.318 ± 0.512
5.113IleThr: 5.113 ± 1.007
4.09IleVal: 4.09 ± 0.741
0.454IleTrp: 0.454 ± 0.19
1.818IleTyr: 1.818 ± 0.402
0.0IleXaa: 0.0 ± 0.0
Lys
5.113LysAla: 5.113 ± 0.937
0.227LysCys: 0.227 ± 0.164
4.09LysAsp: 4.09 ± 0.904
4.999LysGlu: 4.999 ± 1.156
1.023LysPhe: 1.023 ± 0.407
3.409LysGly: 3.409 ± 0.716
0.909LysHis: 0.909 ± 0.306
4.09LysIle: 4.09 ± 0.609
4.318LysLys: 4.318 ± 1.059
5.568LysLeu: 5.568 ± 0.868
1.704LysMet: 1.704 ± 0.403
2.159LysAsn: 2.159 ± 0.545
2.159LysPro: 2.159 ± 0.439
2.159LysGln: 2.159 ± 0.61
3.068LysArg: 3.068 ± 0.753
3.295LysSer: 3.295 ± 0.723
4.886LysThr: 4.886 ± 0.592
3.75LysVal: 3.75 ± 0.645
0.568LysTrp: 0.568 ± 0.283
2.841LysTyr: 2.841 ± 0.645
0.0LysXaa: 0.0 ± 0.0
Leu
5.908LeuAla: 5.908 ± 0.866
1.023LeuCys: 1.023 ± 0.47
6.363LeuAsp: 6.363 ± 0.935
7.158LeuGlu: 7.158 ± 1.123
3.863LeuPhe: 3.863 ± 0.853
5.908LeuGly: 5.908 ± 0.849
1.818LeuHis: 1.818 ± 0.501
4.999LeuIle: 4.999 ± 0.874
6.931LeuLys: 6.931 ± 1.042
8.522LeuLeu: 8.522 ± 1.105
2.045LeuMet: 2.045 ± 0.484
3.295LeuAsn: 3.295 ± 0.65
4.886LeuPro: 4.886 ± 0.84
2.841LeuGln: 2.841 ± 0.672
4.318LeuArg: 4.318 ± 0.781
6.931LeuSer: 6.931 ± 1.047
9.317LeuThr: 9.317 ± 0.972
3.977LeuVal: 3.977 ± 0.697
1.591LeuTrp: 1.591 ± 0.484
2.841LeuTyr: 2.841 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
1.704MetAla: 1.704 ± 0.358
0.341MetCys: 0.341 ± 0.186
1.363MetAsp: 1.363 ± 0.378
1.023MetGlu: 1.023 ± 0.302
0.454MetPhe: 0.454 ± 0.211
1.136MetGly: 1.136 ± 0.412
0.341MetHis: 0.341 ± 0.207
1.477MetIle: 1.477 ± 0.471
1.591MetLys: 1.591 ± 0.55
2.613MetLeu: 2.613 ± 0.886
0.341MetMet: 0.341 ± 0.186
0.795MetAsn: 0.795 ± 0.298
1.704MetPro: 1.704 ± 0.436
1.25MetGln: 1.25 ± 0.336
1.363MetArg: 1.363 ± 0.385
1.477MetSer: 1.477 ± 0.38
1.704MetThr: 1.704 ± 0.342
1.932MetVal: 1.932 ± 0.568
0.0MetTrp: 0.0 ± 0.0
0.795MetTyr: 0.795 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
3.863AsnAla: 3.863 ± 0.821
0.227AsnCys: 0.227 ± 0.172
2.613AsnAsp: 2.613 ± 0.677
3.068AsnGlu: 3.068 ± 0.707
1.363AsnPhe: 1.363 ± 0.355
2.5AsnGly: 2.5 ± 0.854
0.341AsnHis: 0.341 ± 0.184
2.5AsnIle: 2.5 ± 0.491
1.704AsnLys: 1.704 ± 0.466
3.863AsnLeu: 3.863 ± 0.603
0.909AsnMet: 0.909 ± 0.37
1.932AsnAsn: 1.932 ± 0.475
2.5AsnPro: 2.5 ± 0.629
1.023AsnGln: 1.023 ± 0.324
1.363AsnArg: 1.363 ± 0.465
2.954AsnSer: 2.954 ± 0.515
2.954AsnThr: 2.954 ± 0.693
3.068AsnVal: 3.068 ± 0.679
0.682AsnTrp: 0.682 ± 0.337
1.932AsnTyr: 1.932 ± 0.534
0.0AsnXaa: 0.0 ± 0.0
Pro
3.181ProAla: 3.181 ± 0.607
0.568ProCys: 0.568 ± 0.35
1.818ProAsp: 1.818 ± 0.516
2.841ProGlu: 2.841 ± 0.757
1.023ProPhe: 1.023 ± 0.298
2.613ProGly: 2.613 ± 0.504
0.682ProHis: 0.682 ± 0.291
2.613ProIle: 2.613 ± 0.523
2.841ProLys: 2.841 ± 0.774
2.841ProLeu: 2.841 ± 0.679
0.454ProMet: 0.454 ± 0.242
2.045ProAsn: 2.045 ± 0.519
1.818ProPro: 1.818 ± 0.574
1.363ProGln: 1.363 ± 0.471
1.932ProArg: 1.932 ± 0.619
3.863ProSer: 3.863 ± 0.588
3.75ProThr: 3.75 ± 0.775
2.954ProVal: 2.954 ± 0.739
0.568ProTrp: 0.568 ± 0.251
1.136ProTyr: 1.136 ± 0.413
0.0ProXaa: 0.0 ± 0.0
Gln
3.977GlnAla: 3.977 ± 0.749
0.454GlnCys: 0.454 ± 0.234
2.045GlnAsp: 2.045 ± 0.451
1.932GlnGlu: 1.932 ± 0.586
1.591GlnPhe: 1.591 ± 0.396
2.386GlnGly: 2.386 ± 0.816
1.023GlnHis: 1.023 ± 0.379
1.591GlnIle: 1.591 ± 0.341
1.818GlnLys: 1.818 ± 0.342
2.613GlnLeu: 2.613 ± 0.699
1.023GlnMet: 1.023 ± 0.307
1.136GlnAsn: 1.136 ± 0.361
1.477GlnPro: 1.477 ± 0.416
1.363GlnGln: 1.363 ± 0.439
1.363GlnArg: 1.363 ± 0.396
2.386GlnSer: 2.386 ± 0.398
2.841GlnThr: 2.841 ± 0.496
1.704GlnVal: 1.704 ± 0.428
0.227GlnTrp: 0.227 ± 0.157
1.591GlnTyr: 1.591 ± 0.618
0.0GlnXaa: 0.0 ± 0.0
Arg
3.295ArgAla: 3.295 ± 0.665
0.682ArgCys: 0.682 ± 0.296
2.613ArgAsp: 2.613 ± 0.493
1.704ArgGlu: 1.704 ± 0.469
2.159ArgPhe: 2.159 ± 0.504
2.159ArgGly: 2.159 ± 0.563
1.136ArgHis: 1.136 ± 0.283
2.159ArgIle: 2.159 ± 0.566
2.954ArgLys: 2.954 ± 0.741
4.659ArgLeu: 4.659 ± 0.922
0.795ArgMet: 0.795 ± 0.319
1.932ArgAsn: 1.932 ± 0.464
1.136ArgPro: 1.136 ± 0.353
1.363ArgGln: 1.363 ± 0.529
2.727ArgArg: 2.727 ± 0.641
3.75ArgSer: 3.75 ± 0.691
4.318ArgThr: 4.318 ± 0.578
3.863ArgVal: 3.863 ± 0.679
0.341ArgTrp: 0.341 ± 0.189
0.909ArgTyr: 0.909 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
6.477SerAla: 6.477 ± 1.055
0.568SerCys: 0.568 ± 0.25
4.431SerAsp: 4.431 ± 0.629
2.727SerGlu: 2.727 ± 0.493
2.727SerPhe: 2.727 ± 0.553
4.431SerGly: 4.431 ± 1.134
1.136SerHis: 1.136 ± 0.453
3.181SerIle: 3.181 ± 0.562
2.954SerLys: 2.954 ± 0.764
5.908SerLeu: 5.908 ± 0.861
2.727SerMet: 2.727 ± 0.537
2.045SerAsn: 2.045 ± 0.78
2.386SerPro: 2.386 ± 0.642
1.591SerGln: 1.591 ± 0.554
2.5SerArg: 2.5 ± 0.552
4.659SerSer: 4.659 ± 0.798
5.227SerThr: 5.227 ± 0.925
5.454SerVal: 5.454 ± 0.787
1.477SerTrp: 1.477 ± 0.423
2.386SerTyr: 2.386 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
7.499ThrAla: 7.499 ± 1.708
0.909ThrCys: 0.909 ± 0.293
4.999ThrAsp: 4.999 ± 0.529
4.318ThrGlu: 4.318 ± 0.655
2.954ThrPhe: 2.954 ± 0.626
8.522ThrGly: 8.522 ± 2.151
2.159ThrHis: 2.159 ± 0.521
5.227ThrIle: 5.227 ± 0.802
5.454ThrLys: 5.454 ± 0.612
8.863ThrLeu: 8.863 ± 1.081
1.591ThrMet: 1.591 ± 0.486
2.954ThrAsn: 2.954 ± 0.602
4.09ThrPro: 4.09 ± 0.679
3.068ThrGln: 3.068 ± 0.536
2.386ThrArg: 2.386 ± 0.715
4.999ThrSer: 4.999 ± 1.217
6.704ThrThr: 6.704 ± 1.409
6.477ThrVal: 6.477 ± 1.328
1.363ThrTrp: 1.363 ± 0.342
3.522ThrTyr: 3.522 ± 0.657
0.0ThrXaa: 0.0 ± 0.0
Val
4.772ValAla: 4.772 ± 0.836
0.568ValCys: 0.568 ± 0.24
4.09ValAsp: 4.09 ± 0.604
3.522ValGlu: 3.522 ± 0.844
2.727ValPhe: 2.727 ± 0.519
3.409ValGly: 3.409 ± 0.677
1.477ValHis: 1.477 ± 0.334
4.659ValIle: 4.659 ± 0.655
3.863ValLys: 3.863 ± 0.55
6.249ValLeu: 6.249 ± 0.696
1.818ValMet: 1.818 ± 0.435
3.068ValAsn: 3.068 ± 0.919
3.522ValPro: 3.522 ± 0.766
2.159ValGln: 2.159 ± 0.492
2.613ValArg: 2.613 ± 0.522
5.454ValSer: 5.454 ± 0.836
6.022ValThr: 6.022 ± 0.904
4.431ValVal: 4.431 ± 0.588
0.909ValTrp: 0.909 ± 0.417
2.272ValTyr: 2.272 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
0.795TrpAla: 0.795 ± 0.307
0.0TrpCys: 0.0 ± 0.0
0.568TrpAsp: 0.568 ± 0.307
1.023TrpGlu: 1.023 ± 0.288
0.341TrpPhe: 0.341 ± 0.198
1.023TrpGly: 1.023 ± 0.309
0.0TrpHis: 0.0 ± 0.0
1.023TrpIle: 1.023 ± 0.417
0.227TrpLys: 0.227 ± 0.18
1.023TrpLeu: 1.023 ± 0.338
0.227TrpMet: 0.227 ± 0.152
1.591TrpAsn: 1.591 ± 0.34
0.454TrpPro: 0.454 ± 0.297
0.795TrpGln: 0.795 ± 0.302
0.341TrpArg: 0.341 ± 0.197
1.023TrpSer: 1.023 ± 0.332
0.909TrpThr: 0.909 ± 0.296
0.909TrpVal: 0.909 ± 0.207
0.341TrpTrp: 0.341 ± 0.316
0.568TrpTyr: 0.568 ± 0.417
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.386TyrAla: 2.386 ± 0.565
0.909TyrCys: 0.909 ± 0.347
1.932TyrAsp: 1.932 ± 0.473
2.841TyrGlu: 2.841 ± 0.73
1.477TyrPhe: 1.477 ± 0.371
1.704TyrGly: 1.704 ± 0.426
0.909TyrHis: 0.909 ± 0.326
1.591TyrIle: 1.591 ± 0.364
2.386TyrLys: 2.386 ± 0.492
2.5TyrLeu: 2.5 ± 0.549
0.454TyrMet: 0.454 ± 0.221
2.045TyrAsn: 2.045 ± 0.576
1.136TyrPro: 1.136 ± 0.352
1.25TyrGln: 1.25 ± 0.421
2.5TyrArg: 2.5 ± 0.622
2.386TyrSer: 2.386 ± 0.546
3.409TyrThr: 3.409 ± 0.681
2.5TyrVal: 2.5 ± 0.659
0.568TyrTrp: 0.568 ± 0.239
1.704TyrTyr: 1.704 ± 0.408
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (8802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski