Amino acid dipepetide frequency for Gammapapillomavirus 24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.721AlaAla: 5.721 ± 1.579
1.635AlaCys: 1.635 ± 0.886
3.678AlaAsp: 3.678 ± 0.911
3.269AlaGlu: 3.269 ± 1.012
4.087AlaPhe: 4.087 ± 0.859
1.226AlaGly: 1.226 ± 0.734
0.817AlaHis: 0.817 ± 0.751
2.452AlaIle: 2.452 ± 0.816
2.452AlaLys: 2.452 ± 1.15
5.313AlaLeu: 5.313 ± 1.254
0.0AlaMet: 0.0 ± 0.0
1.635AlaAsn: 1.635 ± 0.99
2.861AlaPro: 2.861 ± 0.677
4.495AlaGln: 4.495 ± 0.93
2.452AlaArg: 2.452 ± 0.916
2.452AlaSer: 2.452 ± 0.787
3.678AlaThr: 3.678 ± 0.876
4.495AlaVal: 4.495 ± 1.547
0.817AlaTrp: 0.817 ± 0.55
2.452AlaTyr: 2.452 ± 0.887
0.0AlaXaa: 0.0 ± 0.0
Cys
0.409CysAla: 0.409 ± 0.333
0.817CysCys: 0.817 ± 0.665
2.043CysAsp: 2.043 ± 0.971
0.817CysGlu: 0.817 ± 0.367
0.817CysPhe: 0.817 ± 0.599
1.635CysGly: 1.635 ± 0.661
0.0CysHis: 0.0 ± 0.0
1.226CysIle: 1.226 ± 1.043
2.452CysLys: 2.452 ± 0.728
2.452CysLeu: 2.452 ± 1.649
0.409CysMet: 0.409 ± 0.474
1.226CysAsn: 1.226 ± 0.485
1.635CysPro: 1.635 ± 0.586
0.0CysGln: 0.0 ± 0.0
0.817CysArg: 0.817 ± 0.674
1.226CysSer: 1.226 ± 1.058
1.226CysThr: 1.226 ± 0.527
1.635CysVal: 1.635 ± 0.885
1.226CysTrp: 1.226 ± 0.412
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.313AspAla: 5.313 ± 1.353
2.452AspCys: 2.452 ± 1.625
5.313AspAsp: 5.313 ± 1.961
4.495AspGlu: 4.495 ± 1.427
2.861AspPhe: 2.861 ± 0.946
3.269AspGly: 3.269 ± 0.56
0.817AspHis: 0.817 ± 0.367
5.313AspIle: 5.313 ± 1.472
2.452AspLys: 2.452 ± 0.806
4.087AspLeu: 4.087 ± 1.484
0.409AspMet: 0.409 ± 0.375
2.861AspAsn: 2.861 ± 0.881
3.678AspPro: 3.678 ± 1.069
3.269AspGln: 3.269 ± 1.245
3.678AspArg: 3.678 ± 0.91
7.356AspSer: 7.356 ± 1.357
6.13AspThr: 6.13 ± 1.143
4.087AspVal: 4.087 ± 2.53
0.409AspTrp: 0.409 ± 0.333
1.635AspTyr: 1.635 ± 0.544
0.0AspXaa: 0.0 ± 0.0
Glu
3.269GluAla: 3.269 ± 1.47
1.635GluCys: 1.635 ± 0.941
3.678GluAsp: 3.678 ± 0.819
8.582GluGlu: 8.582 ± 3.949
2.043GluPhe: 2.043 ± 0.643
2.861GluGly: 2.861 ± 0.438
0.409GluHis: 0.409 ± 0.474
2.452GluIle: 2.452 ± 1.071
2.452GluLys: 2.452 ± 0.854
4.087GluLeu: 4.087 ± 1.037
0.409GluMet: 0.409 ± 0.333
4.904GluAsn: 4.904 ± 1.488
3.269GluPro: 3.269 ± 1.421
5.313GluGln: 5.313 ± 1.046
1.635GluArg: 1.635 ± 0.687
4.087GluSer: 4.087 ± 0.89
2.861GluThr: 2.861 ± 0.79
4.087GluVal: 4.087 ± 1.661
1.635GluTrp: 1.635 ± 0.279
2.861GluTyr: 2.861 ± 0.997
0.0GluXaa: 0.0 ± 0.0
Phe
1.635PheAla: 1.635 ± 0.505
1.226PheCys: 1.226 ± 0.485
2.452PheAsp: 2.452 ± 0.768
3.678PheGlu: 3.678 ± 1.051
2.861PhePhe: 2.861 ± 0.763
1.635PheGly: 1.635 ± 0.734
0.409PheHis: 0.409 ± 0.49
2.452PheIle: 2.452 ± 0.49
4.495PheLys: 4.495 ± 1.653
3.678PheLeu: 3.678 ± 0.832
0.409PheMet: 0.409 ± 0.375
2.452PheAsn: 2.452 ± 0.818
2.043PhePro: 2.043 ± 0.659
3.269PheGln: 3.269 ± 1.405
0.817PheArg: 0.817 ± 0.35
1.226PheSer: 1.226 ± 0.734
2.452PheThr: 2.452 ± 0.888
2.452PheVal: 2.452 ± 0.982
0.817PheTrp: 0.817 ± 0.367
2.043PheTyr: 2.043 ± 1.077
0.0PheXaa: 0.0 ± 0.0
Gly
3.269GlyAla: 3.269 ± 0.918
1.226GlyCys: 1.226 ± 0.603
4.495GlyAsp: 4.495 ± 1.76
4.904GlyGlu: 4.904 ± 1.873
0.409GlyPhe: 0.409 ± 0.375
5.313GlyGly: 5.313 ± 2.808
0.817GlyHis: 0.817 ± 0.751
2.452GlyIle: 2.452 ± 0.835
2.452GlyLys: 2.452 ± 0.405
4.495GlyLeu: 4.495 ± 1.183
1.226GlyMet: 1.226 ± 0.412
2.861GlyAsn: 2.861 ± 0.832
2.452GlyPro: 2.452 ± 0.539
3.269GlyGln: 3.269 ± 1.009
5.721GlyArg: 5.721 ± 1.616
3.678GlySer: 3.678 ± 1.411
4.087GlyThr: 4.087 ± 1.098
3.678GlyVal: 3.678 ± 0.373
0.0GlyTrp: 0.0 ± 0.0
0.817GlyTyr: 0.817 ± 0.35
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.635HisCys: 1.635 ± 0.885
0.409HisAsp: 0.409 ± 0.333
1.226HisGlu: 1.226 ± 0.591
1.635HisPhe: 1.635 ± 0.604
0.817HisGly: 0.817 ± 0.979
0.409HisHis: 0.409 ± 0.385
1.226HisIle: 1.226 ± 0.771
0.817HisLys: 0.817 ± 0.525
1.226HisLeu: 1.226 ± 0.551
0.409HisMet: 0.409 ± 0.487
0.409HisAsn: 0.409 ± 0.333
2.043HisPro: 2.043 ± 0.679
0.817HisGln: 0.817 ± 0.578
1.226HisArg: 1.226 ± 0.408
1.226HisSer: 1.226 ± 0.408
0.817HisThr: 0.817 ± 0.35
1.226HisVal: 1.226 ± 0.664
0.409HisTrp: 0.409 ± 0.375
0.409HisTyr: 0.409 ± 0.375
0.0HisXaa: 0.0 ± 0.0
Ile
2.861IleAla: 2.861 ± 0.912
0.817IleCys: 0.817 ± 0.751
2.452IleAsp: 2.452 ± 1.296
4.904IleGlu: 4.904 ± 1.118
2.452IlePhe: 2.452 ± 0.802
4.495IleGly: 4.495 ± 1.7
1.226IleHis: 1.226 ± 0.565
4.087IleIle: 4.087 ± 0.975
0.817IleLys: 0.817 ± 0.452
4.495IleLeu: 4.495 ± 1.207
0.409IleMet: 0.409 ± 0.482
3.269IleAsn: 3.269 ± 0.557
2.043IlePro: 2.043 ± 1.121
2.452IleGln: 2.452 ± 0.808
3.678IleArg: 3.678 ± 1.524
2.452IleSer: 2.452 ± 0.479
2.861IleThr: 2.861 ± 1.1
4.087IleVal: 4.087 ± 0.89
0.0IleTrp: 0.0 ± 0.0
2.861IleTyr: 2.861 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
2.452LysAla: 2.452 ± 0.623
1.635LysCys: 1.635 ± 0.505
3.678LysAsp: 3.678 ± 0.757
2.861LysGlu: 2.861 ± 0.979
2.861LysPhe: 2.861 ± 1.031
1.635LysGly: 1.635 ± 1.1
1.635LysHis: 1.635 ± 0.99
1.226LysIle: 1.226 ± 0.619
2.043LysLys: 2.043 ± 1.002
6.13LysLeu: 6.13 ± 2.351
1.226LysMet: 1.226 ± 0.632
1.226LysAsn: 1.226 ± 0.758
0.409LysPro: 0.409 ± 0.333
4.087LysGln: 4.087 ± 1.299
8.173LysArg: 8.173 ± 1.064
2.452LysSer: 2.452 ± 1.146
4.087LysThr: 4.087 ± 1.004
3.269LysVal: 3.269 ± 0.848
0.409LysTrp: 0.409 ± 0.385
2.043LysTyr: 2.043 ± 0.779
0.0LysXaa: 0.0 ± 0.0
Leu
2.861LeuAla: 2.861 ± 0.438
1.635LeuCys: 1.635 ± 0.654
6.13LeuAsp: 6.13 ± 0.727
4.087LeuGlu: 4.087 ± 0.937
6.13LeuPhe: 6.13 ± 2.289
4.904LeuGly: 4.904 ± 1.77
2.861LeuHis: 2.861 ± 1.204
5.313LeuIle: 5.313 ± 2.028
4.904LeuLys: 4.904 ± 1.387
8.173LeuLeu: 8.173 ± 1.284
0.817LeuMet: 0.817 ± 0.424
2.861LeuAsn: 2.861 ± 1.027
5.313LeuPro: 5.313 ± 0.754
7.356LeuGln: 7.356 ± 1.3
2.861LeuArg: 2.861 ± 0.886
5.313LeuSer: 5.313 ± 1.854
5.721LeuThr: 5.721 ± 0.666
5.313LeuVal: 5.313 ± 1.01
0.817LeuTrp: 0.817 ± 0.412
5.313LeuTyr: 5.313 ± 0.925
0.0LeuXaa: 0.0 ± 0.0
Met
2.452MetAla: 2.452 ± 0.928
0.817MetCys: 0.817 ± 0.524
0.409MetAsp: 0.409 ± 0.482
1.635MetGlu: 1.635 ± 0.591
0.0MetPhe: 0.0 ± 0.0
0.817MetGly: 0.817 ± 0.367
0.817MetHis: 0.817 ± 0.799
0.409MetIle: 0.409 ± 0.333
1.226MetLys: 1.226 ± 0.758
0.817MetLeu: 0.817 ± 0.452
0.0MetMet: 0.0 ± 0.0
0.817MetAsn: 0.817 ± 0.367
0.409MetPro: 0.409 ± 0.333
0.0MetGln: 0.0 ± 0.0
0.817MetArg: 0.817 ± 0.367
0.0MetSer: 0.0 ± 0.0
0.817MetThr: 0.817 ± 0.367
0.817MetVal: 0.817 ± 0.367
0.409MetTrp: 0.409 ± 0.385
1.226MetTyr: 1.226 ± 0.412
0.0MetXaa: 0.0 ± 0.0
Asn
2.452AsnAla: 2.452 ± 0.802
0.817AsnCys: 0.817 ± 0.572
2.452AsnAsp: 2.452 ± 1.146
1.635AsnGlu: 1.635 ± 0.903
1.635AsnPhe: 1.635 ± 0.824
2.043AsnGly: 2.043 ± 0.973
0.0AsnHis: 0.0 ± 0.0
2.043AsnIle: 2.043 ± 0.569
2.452AsnLys: 2.452 ± 0.658
4.087AsnLeu: 4.087 ± 0.566
0.817AsnMet: 0.817 ± 0.65
2.861AsnAsn: 2.861 ± 0.711
3.678AsnPro: 3.678 ± 1.44
2.861AsnGln: 2.861 ± 1.085
2.452AsnArg: 2.452 ± 1.087
3.269AsnSer: 3.269 ± 1.2
3.678AsnThr: 3.678 ± 1.418
2.452AsnVal: 2.452 ± 0.768
0.817AsnTrp: 0.817 ± 0.452
1.635AsnTyr: 1.635 ± 0.645
0.0AsnXaa: 0.0 ± 0.0
Pro
2.861ProAla: 2.861 ± 1.155
0.409ProCys: 0.409 ± 0.474
5.313ProAsp: 5.313 ± 1.621
2.861ProGlu: 2.861 ± 0.917
0.817ProPhe: 0.817 ± 0.452
1.635ProGly: 1.635 ± 0.554
0.0ProHis: 0.0 ± 0.0
3.678ProIle: 3.678 ± 1.401
4.904ProLys: 4.904 ± 1.054
8.173ProLeu: 8.173 ± 1.303
0.0ProMet: 0.0 ± 0.0
2.043ProAsn: 2.043 ± 1.378
7.356ProPro: 7.356 ± 2.723
1.226ProGln: 1.226 ± 0.591
3.269ProArg: 3.269 ± 0.709
3.678ProSer: 3.678 ± 1.646
4.495ProThr: 4.495 ± 1.735
3.678ProVal: 3.678 ± 1.406
0.0ProTrp: 0.0 ± 0.0
3.269ProTyr: 3.269 ± 1.192
0.0ProXaa: 0.0 ± 0.0
Gln
3.678GlnAla: 3.678 ± 0.594
2.043GlnCys: 2.043 ± 0.845
2.452GlnAsp: 2.452 ± 0.672
2.043GlnGlu: 2.043 ± 0.659
3.678GlnPhe: 3.678 ± 0.554
2.452GlnGly: 2.452 ± 0.912
1.635GlnHis: 1.635 ± 0.993
2.452GlnIle: 2.452 ± 1.027
2.452GlnLys: 2.452 ± 0.691
6.539GlnLeu: 6.539 ± 1.971
2.452GlnMet: 2.452 ± 1.455
2.861GlnAsn: 2.861 ± 1.128
3.269GlnPro: 3.269 ± 0.613
2.043GlnGln: 2.043 ± 1.456
1.226GlnArg: 1.226 ± 0.618
4.087GlnSer: 4.087 ± 1.564
2.861GlnThr: 2.861 ± 0.703
2.043GlnVal: 2.043 ± 0.775
0.817GlnTrp: 0.817 ± 0.665
2.452GlnTyr: 2.452 ± 1.199
0.0GlnXaa: 0.0 ± 0.0
Arg
3.269ArgAla: 3.269 ± 1.292
0.817ArgCys: 0.817 ± 0.949
4.087ArgAsp: 4.087 ± 1.618
1.635ArgGlu: 1.635 ± 0.707
2.043ArgPhe: 2.043 ± 0.717
4.904ArgGly: 4.904 ± 1.15
2.861ArgHis: 2.861 ± 0.723
0.409ArgIle: 0.409 ± 0.375
4.495ArgLys: 4.495 ± 0.659
4.495ArgLeu: 4.495 ± 1.023
0.817ArgMet: 0.817 ± 0.452
2.861ArgAsn: 2.861 ± 0.541
3.678ArgPro: 3.678 ± 0.911
2.452ArgGln: 2.452 ± 1.033
6.13ArgArg: 6.13 ± 2.144
3.269ArgSer: 3.269 ± 0.66
3.269ArgThr: 3.269 ± 0.685
5.721ArgVal: 5.721 ± 1.267
0.0ArgTrp: 0.0 ± 0.0
1.635ArgTyr: 1.635 ± 0.591
0.0ArgXaa: 0.0 ± 0.0
Ser
3.269SerAla: 3.269 ± 0.839
0.817SerCys: 0.817 ± 0.55
4.904SerAsp: 4.904 ± 1.447
2.861SerGlu: 2.861 ± 1.187
1.635SerPhe: 1.635 ± 0.521
5.313SerGly: 5.313 ± 1.627
1.226SerHis: 1.226 ± 0.304
4.087SerIle: 4.087 ± 1.873
2.043SerLys: 2.043 ± 1.203
4.904SerLeu: 4.904 ± 0.877
1.226SerMet: 1.226 ± 0.591
2.043SerAsn: 2.043 ± 1.044
3.678SerPro: 3.678 ± 1.256
2.861SerGln: 2.861 ± 0.806
3.678SerArg: 3.678 ± 0.731
7.765SerSer: 7.765 ± 2.529
6.539SerThr: 6.539 ± 1.503
5.313SerVal: 5.313 ± 0.986
1.226SerTrp: 1.226 ± 0.617
0.817SerTyr: 0.817 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
2.452ThrAla: 2.452 ± 0.691
0.0ThrCys: 0.0 ± 0.0
6.947ThrAsp: 6.947 ± 0.832
4.904ThrGlu: 4.904 ± 0.926
1.226ThrPhe: 1.226 ± 0.617
6.13ThrGly: 6.13 ± 1.763
0.817ThrHis: 0.817 ± 0.543
6.13ThrIle: 6.13 ± 1.737
2.861ThrLys: 2.861 ± 1.226
6.13ThrLeu: 6.13 ± 2.018
1.226ThrMet: 1.226 ± 0.591
2.861ThrAsn: 2.861 ± 0.775
4.495ThrPro: 4.495 ± 1.183
3.269ThrGln: 3.269 ± 0.697
2.452ThrArg: 2.452 ± 0.539
3.678ThrSer: 3.678 ± 0.951
4.087ThrThr: 4.087 ± 1.445
3.678ThrVal: 3.678 ± 1.069
0.409ThrTrp: 0.409 ± 0.385
2.452ThrTyr: 2.452 ± 0.887
0.0ThrXaa: 0.0 ± 0.0
Val
4.087ValAla: 4.087 ± 0.616
1.226ValCys: 1.226 ± 0.713
6.947ValAsp: 6.947 ± 1.675
5.313ValGlu: 5.313 ± 1.538
2.452ValPhe: 2.452 ± 1.001
3.269ValGly: 3.269 ± 1.63
1.226ValHis: 1.226 ± 0.408
3.269ValIle: 3.269 ± 1.337
2.861ValLys: 2.861 ± 1.151
4.495ValLeu: 4.495 ± 0.846
0.409ValMet: 0.409 ± 0.375
1.226ValAsn: 1.226 ± 0.619
4.495ValPro: 4.495 ± 1.375
2.861ValGln: 2.861 ± 0.984
4.087ValArg: 4.087 ± 1.258
6.539ValSer: 6.539 ± 0.84
3.678ValThr: 3.678 ± 1.893
0.409ValVal: 0.409 ± 0.482
1.226ValTrp: 1.226 ± 0.591
1.635ValTyr: 1.635 ± 0.739
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.48
0.0TrpGlu: 0.0 ± 0.0
0.409TrpPhe: 0.409 ± 0.385
1.226TrpGly: 1.226 ± 0.485
0.409TrpHis: 0.409 ± 0.385
0.817TrpIle: 0.817 ± 0.665
1.635TrpLys: 1.635 ± 1.058
1.226TrpLeu: 1.226 ± 0.412
0.409TrpMet: 0.409 ± 0.276
0.817TrpAsn: 0.817 ± 0.48
0.0TrpPro: 0.0 ± 0.0
0.409TrpGln: 0.409 ± 0.375
1.226TrpArg: 1.226 ± 0.591
0.409TrpSer: 0.409 ± 0.289
1.226TrpThr: 1.226 ± 0.785
0.817TrpVal: 0.817 ± 0.452
0.0TrpTrp: 0.0 ± 0.0
0.409TrpTyr: 0.409 ± 0.333
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.678TyrAla: 3.678 ± 1.2
0.409TyrCys: 0.409 ± 0.474
1.226TyrAsp: 1.226 ± 0.619
0.817TyrGlu: 0.817 ± 0.665
2.452TyrPhe: 2.452 ± 0.649
2.043TyrGly: 2.043 ± 0.52
0.0TyrHis: 0.0 ± 0.0
1.226TyrIle: 1.226 ± 0.643
3.269TyrLys: 3.269 ± 0.926
3.678TyrLeu: 3.678 ± 0.98
1.226TyrMet: 1.226 ± 0.771
2.452TyrAsn: 2.452 ± 0.62
2.861TyrPro: 2.861 ± 0.851
1.226TyrGln: 1.226 ± 0.412
2.452TyrArg: 2.452 ± 1.071
2.043TyrSer: 2.043 ± 0.817
1.635TyrThr: 1.635 ± 0.503
2.452TyrVal: 2.452 ± 0.852
0.817TyrTrp: 0.817 ± 0.48
2.861TyrTyr: 2.861 ± 0.955
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski