Amino acid dipepetide frequency for Zalophus californianus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.295AlaAla: 8.295 ± 1.975
0.0AlaCys: 0.0 ± 0.0
3.733AlaAsp: 3.733 ± 0.935
5.392AlaGlu: 5.392 ± 1.031
3.733AlaPhe: 3.733 ± 1.252
5.392AlaGly: 5.392 ± 1.531
1.659AlaHis: 1.659 ± 0.922
3.318AlaIle: 3.318 ± 1.845
1.659AlaLys: 1.659 ± 0.945
6.221AlaLeu: 6.221 ± 1.065
2.074AlaMet: 2.074 ± 1.049
2.074AlaAsn: 2.074 ± 0.385
7.881AlaPro: 7.881 ± 2.788
4.148AlaGln: 4.148 ± 0.989
6.636AlaArg: 6.636 ± 2.35
2.489AlaSer: 2.489 ± 0.893
5.807AlaThr: 5.807 ± 1.785
6.221AlaVal: 6.221 ± 1.567
1.244AlaTrp: 1.244 ± 1.007
3.318AlaTyr: 3.318 ± 0.949
0.0AlaXaa: 0.0 ± 0.0
Cys
2.903CysAla: 2.903 ± 0.957
1.244CysCys: 1.244 ± 0.72
0.415CysAsp: 0.415 ± 0.487
0.415CysGlu: 0.415 ± 0.323
0.83CysPhe: 0.83 ± 0.647
2.074CysGly: 2.074 ± 1.425
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.659CysLys: 1.659 ± 0.676
1.659CysLeu: 1.659 ± 1.314
0.0CysMet: 0.0 ± 0.0
0.83CysAsn: 0.83 ± 0.407
2.489CysPro: 2.489 ± 0.776
0.0CysGln: 0.0 ± 0.0
2.903CysArg: 2.903 ± 1.558
0.0CysSer: 0.0 ± 0.0
1.659CysThr: 1.659 ± 0.832
0.415CysVal: 0.415 ± 0.323
0.415CysTrp: 0.415 ± 0.358
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.392AspAla: 5.392 ± 1.574
0.83AspCys: 0.83 ± 0.417
2.489AspAsp: 2.489 ± 1.095
2.903AspGlu: 2.903 ± 1.057
1.244AspPhe: 1.244 ± 0.645
4.562AspGly: 4.562 ± 1.602
0.415AspHis: 0.415 ± 0.323
3.318AspIle: 3.318 ± 1.309
1.659AspLys: 1.659 ± 0.727
5.807AspLeu: 5.807 ± 1.07
0.83AspMet: 0.83 ± 0.715
2.489AspAsn: 2.489 ± 0.878
3.318AspPro: 3.318 ± 0.768
1.244AspGln: 1.244 ± 0.6
1.659AspArg: 1.659 ± 0.808
6.636AspSer: 6.636 ± 1.017
4.977AspThr: 4.977 ± 0.835
4.977AspVal: 4.977 ± 2.427
1.659AspTrp: 1.659 ± 0.832
2.489AspTyr: 2.489 ± 0.907
0.0AspXaa: 0.0 ± 0.0
Glu
7.051GluAla: 7.051 ± 1.594
1.244GluCys: 1.244 ± 0.631
3.318GluAsp: 3.318 ± 1.089
6.221GluGlu: 6.221 ± 1.931
0.415GluPhe: 0.415 ± 0.323
5.392GluGly: 5.392 ± 1.469
0.415GluHis: 0.415 ± 0.358
1.244GluIle: 1.244 ± 0.645
0.83GluLys: 0.83 ± 0.548
5.392GluLeu: 5.392 ± 0.843
0.415GluMet: 0.415 ± 0.378
1.659GluAsn: 1.659 ± 1.02
3.318GluPro: 3.318 ± 0.768
4.562GluGln: 4.562 ± 1.328
3.318GluArg: 3.318 ± 1.393
2.489GluSer: 2.489 ± 0.81
2.489GluThr: 2.489 ± 0.92
4.148GluVal: 4.148 ± 1.796
1.244GluTrp: 1.244 ± 0.618
3.318GluTyr: 3.318 ± 0.875
0.0GluXaa: 0.0 ± 0.0
Phe
2.903PheAla: 2.903 ± 0.683
0.83PheCys: 0.83 ± 1.308
4.148PheAsp: 4.148 ± 1.575
0.83PheGlu: 0.83 ± 0.373
2.903PhePhe: 2.903 ± 0.978
1.659PheGly: 1.659 ± 0.525
0.83PheHis: 0.83 ± 0.373
1.659PheIle: 1.659 ± 0.525
2.074PheLys: 2.074 ± 1.252
2.074PheLeu: 2.074 ± 0.493
1.244PheMet: 1.244 ± 0.62
0.83PheAsn: 0.83 ± 0.417
1.244PhePro: 1.244 ± 0.358
0.415PheGln: 0.415 ± 0.358
2.489PheArg: 2.489 ± 0.506
2.074PheSer: 2.074 ± 0.959
1.244PheThr: 1.244 ± 0.588
1.244PheVal: 1.244 ± 0.703
1.659PheTrp: 1.659 ± 1.039
0.415PheTyr: 0.415 ± 0.377
0.0PheXaa: 0.0 ± 0.0
Gly
7.881GlyAla: 7.881 ± 0.789
1.244GlyCys: 1.244 ± 0.723
6.636GlyAsp: 6.636 ± 2.096
4.562GlyGlu: 4.562 ± 1.255
0.83GlyPhe: 0.83 ± 0.417
9.125GlyGly: 9.125 ± 2.2
1.244GlyHis: 1.244 ± 0.703
2.903GlyIle: 2.903 ± 0.974
1.244GlyLys: 1.244 ± 0.621
4.148GlyLeu: 4.148 ± 1.672
0.415GlyMet: 0.415 ± 0.323
2.903GlyAsn: 2.903 ± 0.855
8.71GlyPro: 8.71 ± 2.166
3.733GlyGln: 3.733 ± 1.066
6.636GlyArg: 6.636 ± 2.945
6.636GlySer: 6.636 ± 1.527
6.636GlyThr: 6.636 ± 0.815
4.977GlyVal: 4.977 ± 1.251
1.244GlyTrp: 1.244 ± 0.655
2.489GlyTyr: 2.489 ± 1.174
0.0GlyXaa: 0.0 ± 0.0
His
2.489HisAla: 2.489 ± 0.725
0.83HisCys: 0.83 ± 0.407
0.415HisAsp: 0.415 ± 0.377
1.659HisGlu: 1.659 ± 0.635
0.83HisPhe: 0.83 ± 0.417
1.659HisGly: 1.659 ± 1.377
1.244HisHis: 1.244 ± 0.388
0.83HisIle: 0.83 ± 0.647
0.415HisLys: 0.415 ± 0.323
2.903HisLeu: 2.903 ± 0.556
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.489HisPro: 2.489 ± 1.035
0.83HisGln: 0.83 ± 0.647
2.074HisArg: 2.074 ± 0.702
0.415HisSer: 0.415 ± 0.323
0.83HisThr: 0.83 ± 0.511
1.244HisVal: 1.244 ± 0.817
0.83HisTrp: 0.83 ± 0.432
1.244HisTyr: 1.244 ± 0.854
0.0HisXaa: 0.0 ± 0.0
Ile
1.244IleAla: 1.244 ± 0.631
0.415IleCys: 0.415 ± 0.323
2.903IleAsp: 2.903 ± 1.289
2.074IleGlu: 2.074 ± 0.545
1.244IlePhe: 1.244 ± 0.733
2.074IleGly: 2.074 ± 0.888
0.415IleHis: 0.415 ± 0.323
0.83IleIle: 0.83 ± 0.373
0.415IleLys: 0.415 ± 0.323
4.562IleLeu: 4.562 ± 1.133
0.83IleMet: 0.83 ± 0.647
1.244IleAsn: 1.244 ± 0.358
2.074IlePro: 2.074 ± 1.175
1.244IleGln: 1.244 ± 0.685
2.489IleArg: 2.489 ± 0.705
2.074IleSer: 2.074 ± 1.054
0.83IleThr: 0.83 ± 0.417
2.903IleVal: 2.903 ± 1.019
0.0IleTrp: 0.0 ± 0.0
1.659IleTyr: 1.659 ± 0.782
0.0IleXaa: 0.0 ± 0.0
Lys
4.148LysAla: 4.148 ± 1.573
1.659LysCys: 1.659 ± 0.917
2.489LysAsp: 2.489 ± 0.9
2.074LysGlu: 2.074 ± 1.117
1.244LysPhe: 1.244 ± 0.655
1.244LysGly: 1.244 ± 0.631
2.489LysHis: 2.489 ± 1.292
0.415LysIle: 0.415 ± 0.377
0.83LysLys: 0.83 ± 0.432
2.074LysLeu: 2.074 ± 0.976
0.415LysMet: 0.415 ± 0.358
0.83LysAsn: 0.83 ± 0.647
0.83LysPro: 0.83 ± 0.715
0.83LysGln: 0.83 ± 0.558
5.392LysArg: 5.392 ± 0.77
2.489LysSer: 2.489 ± 1.565
1.244LysThr: 1.244 ± 0.618
1.244LysVal: 1.244 ± 0.862
0.0LysTrp: 0.0 ± 0.0
1.244LysTyr: 1.244 ± 0.655
0.0LysXaa: 0.0 ± 0.0
Leu
2.903LeuAla: 2.903 ± 1.31
1.244LeuCys: 1.244 ± 0.685
5.807LeuAsp: 5.807 ± 1.079
4.148LeuGlu: 4.148 ± 1.107
2.903LeuPhe: 2.903 ± 1.284
8.295LeuGly: 8.295 ± 2.459
2.903LeuHis: 2.903 ± 0.901
1.244LeuIle: 1.244 ± 0.655
4.977LeuLys: 4.977 ± 1.438
7.881LeuLeu: 7.881 ± 2.453
1.659LeuMet: 1.659 ± 0.782
2.903LeuAsn: 2.903 ± 0.529
7.051LeuPro: 7.051 ± 2.983
4.977LeuGln: 4.977 ± 1.117
7.466LeuArg: 7.466 ± 1.086
6.636LeuSer: 6.636 ± 1.791
4.148LeuThr: 4.148 ± 2.145
4.977LeuVal: 4.977 ± 1.228
2.489LeuTrp: 2.489 ± 1.217
2.903LeuTyr: 2.903 ± 0.866
0.0LeuXaa: 0.0 ± 0.0
Met
2.903MetAla: 2.903 ± 0.859
0.0MetCys: 0.0 ± 0.0
1.244MetAsp: 1.244 ± 0.388
0.415MetGlu: 0.415 ± 0.378
0.0MetPhe: 0.0 ± 0.0
1.659MetGly: 1.659 ± 1.039
0.83MetHis: 0.83 ± 0.407
0.415MetIle: 0.415 ± 0.487
0.0MetLys: 0.0 ± 0.0
1.659MetLeu: 1.659 ± 1.093
0.0MetMet: 0.0 ± 0.0
0.415MetAsn: 0.415 ± 0.378
0.0MetPro: 0.0 ± 0.0
0.415MetGln: 0.415 ± 0.323
1.244MetArg: 1.244 ± 0.388
1.659MetSer: 1.659 ± 0.816
0.0MetThr: 0.0 ± 0.0
1.244MetVal: 1.244 ± 0.807
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.903AsnAla: 2.903 ± 1.092
1.244AsnCys: 1.244 ± 0.716
2.489AsnAsp: 2.489 ± 0.881
0.83AsnGlu: 0.83 ± 0.407
0.415AsnPhe: 0.415 ± 0.475
2.903AsnGly: 2.903 ± 1.112
0.415AsnHis: 0.415 ± 0.378
0.415AsnIle: 0.415 ± 0.323
0.415AsnLys: 0.415 ± 0.358
1.659AsnLeu: 1.659 ± 0.286
0.0AsnMet: 0.0 ± 0.0
0.83AsnAsn: 0.83 ± 0.602
2.903AsnPro: 2.903 ± 1.305
1.244AsnGln: 1.244 ± 1.073
2.903AsnArg: 2.903 ± 0.919
1.244AsnSer: 1.244 ± 0.388
2.903AsnThr: 2.903 ± 1.168
1.659AsnVal: 1.659 ± 1.039
0.415AsnTrp: 0.415 ± 0.323
0.83AsnTyr: 0.83 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
7.466ProAla: 7.466 ± 3.002
1.244ProCys: 1.244 ± 0.919
4.977ProAsp: 4.977 ± 1.658
5.392ProGlu: 5.392 ± 1.066
1.244ProPhe: 1.244 ± 0.589
7.051ProGly: 7.051 ± 2.166
1.244ProHis: 1.244 ± 0.676
2.903ProIle: 2.903 ± 1.109
1.244ProLys: 1.244 ± 0.358
8.295ProLeu: 8.295 ± 2.237
0.0ProMet: 0.0 ± 0.0
2.074ProAsn: 2.074 ± 1.107
11.199ProPro: 11.199 ± 2.394
1.244ProGln: 1.244 ± 0.565
6.636ProArg: 6.636 ± 1.274
4.977ProSer: 4.977 ± 1.598
5.807ProThr: 5.807 ± 1.936
4.977ProVal: 4.977 ± 2.676
0.415ProTrp: 0.415 ± 0.378
2.074ProTyr: 2.074 ± 0.882
0.0ProXaa: 0.0 ± 0.0
Gln
4.148GlnAla: 4.148 ± 1.122
0.0GlnCys: 0.0 ± 0.0
2.074GlnAsp: 2.074 ± 0.738
4.148GlnGlu: 4.148 ± 1.368
2.903GlnPhe: 2.903 ± 0.527
2.489GlnGly: 2.489 ± 1.299
0.415GlnHis: 0.415 ± 0.323
1.659GlnIle: 1.659 ± 0.525
2.489GlnLys: 2.489 ± 1.248
3.318GlnLeu: 3.318 ± 1.005
0.415GlnMet: 0.415 ± 0.427
0.83GlnAsn: 0.83 ± 0.432
4.148GlnPro: 4.148 ± 1.594
1.659GlnGln: 1.659 ± 0.834
3.318GlnArg: 3.318 ± 0.533
2.489GlnSer: 2.489 ± 1.303
3.318GlnThr: 3.318 ± 1.427
2.489GlnVal: 2.489 ± 0.745
0.83GlnTrp: 0.83 ± 0.689
1.659GlnTyr: 1.659 ± 1.02
0.0GlnXaa: 0.0 ± 0.0
Arg
6.221ArgAla: 6.221 ± 0.958
2.903ArgCys: 2.903 ± 1.981
1.659ArgAsp: 1.659 ± 0.687
3.318ArgGlu: 3.318 ± 1.203
3.318ArgPhe: 3.318 ± 1.161
8.295ArgGly: 8.295 ± 1.448
3.318ArgHis: 3.318 ± 1.493
0.83ArgIle: 0.83 ± 0.373
4.562ArgLys: 4.562 ± 1.302
8.295ArgLeu: 8.295 ± 0.875
0.0ArgMet: 0.0 ± 0.336
2.074ArgAsn: 2.074 ± 0.711
5.392ArgPro: 5.392 ± 2.447
3.318ArgGln: 3.318 ± 1.416
8.295ArgArg: 8.295 ± 2.857
3.733ArgSer: 3.733 ± 0.769
2.903ArgThr: 2.903 ± 1.761
5.807ArgVal: 5.807 ± 1.203
1.244ArgTrp: 1.244 ± 0.72
3.733ArgTyr: 3.733 ± 1.437
0.0ArgXaa: 0.0 ± 0.0
Ser
2.903SerAla: 2.903 ± 1.098
0.0SerCys: 0.0 ± 0.0
3.318SerAsp: 3.318 ± 1.069
2.489SerGlu: 2.489 ± 1.097
2.074SerPhe: 2.074 ± 0.638
6.636SerGly: 6.636 ± 2.755
1.659SerHis: 1.659 ± 0.569
2.903SerIle: 2.903 ± 1.402
0.83SerLys: 0.83 ± 0.694
4.977SerLeu: 4.977 ± 0.953
1.244SerMet: 1.244 ± 0.541
2.489SerAsn: 2.489 ± 0.907
4.977SerPro: 4.977 ± 0.938
4.562SerGln: 4.562 ± 0.979
5.392SerArg: 5.392 ± 1.346
7.466SerSer: 7.466 ± 1.974
5.807SerThr: 5.807 ± 1.139
3.733SerVal: 3.733 ± 1.872
0.415SerTrp: 0.415 ± 0.378
1.244SerTyr: 1.244 ± 0.569
0.0SerXaa: 0.0 ± 0.0
Thr
2.903ThrAla: 2.903 ± 0.92
1.244ThrCys: 1.244 ± 0.676
6.636ThrAsp: 6.636 ± 1.234
2.074ThrGlu: 2.074 ± 1.211
2.489ThrPhe: 2.489 ± 0.624
5.807ThrGly: 5.807 ± 1.248
2.074ThrHis: 2.074 ± 0.388
1.244ThrIle: 1.244 ± 0.854
1.244ThrLys: 1.244 ± 0.358
7.051ThrLeu: 7.051 ± 1.991
2.903ThrMet: 2.903 ± 1.589
1.659ThrAsn: 1.659 ± 0.945
5.392ThrPro: 5.392 ± 0.982
3.318ThrGln: 3.318 ± 2.588
3.318ThrArg: 3.318 ± 0.677
5.392ThrSer: 5.392 ± 1.893
2.074ThrThr: 2.074 ± 1.586
3.318ThrVal: 3.318 ± 0.677
1.244ThrTrp: 1.244 ± 0.716
0.83ThrTyr: 0.83 ± 0.715
0.0ThrXaa: 0.0 ± 0.0
Val
4.148ValAla: 4.148 ± 1.5
1.659ValCys: 1.659 ± 0.818
2.903ValAsp: 2.903 ± 0.995
4.148ValGlu: 4.148 ± 1.544
2.903ValPhe: 2.903 ± 1.234
3.318ValGly: 3.318 ± 1.45
0.83ValHis: 0.83 ± 0.577
3.318ValIle: 3.318 ± 2.114
2.489ValLys: 2.489 ± 0.911
4.562ValLeu: 4.562 ± 0.51
0.0ValMet: 0.0 ± 0.0
0.415ValAsn: 0.415 ± 0.377
5.392ValPro: 5.392 ± 1.313
4.562ValGln: 4.562 ± 0.991
4.562ValArg: 4.562 ± 1.66
4.148ValSer: 4.148 ± 2.228
6.221ValThr: 6.221 ± 1.521
3.733ValVal: 3.733 ± 1.576
0.415ValTrp: 0.415 ± 0.358
2.074ValTyr: 2.074 ± 0.621
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.417
1.244TrpCys: 1.244 ± 0.919
0.83TrpAsp: 0.83 ± 0.407
1.659TrpGlu: 1.659 ± 0.922
0.83TrpPhe: 0.83 ± 0.407
1.244TrpGly: 1.244 ± 0.807
0.415TrpHis: 0.415 ± 0.358
1.244TrpIle: 1.244 ± 0.388
2.074TrpLys: 2.074 ± 1.25
1.244TrpLeu: 1.244 ± 0.588
0.415TrpMet: 0.415 ± 0.323
0.415TrpAsn: 0.415 ± 0.358
0.0TrpPro: 0.0 ± 0.0
0.415TrpGln: 0.415 ± 0.378
0.83TrpArg: 0.83 ± 0.647
0.415TrpSer: 0.415 ± 0.358
1.659TrpThr: 1.659 ± 1.096
1.244TrpVal: 1.244 ± 0.97
0.0TrpTrp: 0.0 ± 0.0
0.83TrpTyr: 0.83 ± 0.407
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.659TyrAla: 1.659 ± 0.286
0.83TyrCys: 0.83 ± 0.975
0.415TyrAsp: 0.415 ± 0.475
3.733TyrGlu: 3.733 ± 0.988
0.415TyrPhe: 0.415 ± 0.654
3.318TyrGly: 3.318 ± 0.583
0.415TyrHis: 0.415 ± 0.358
0.83TyrIle: 0.83 ± 0.417
2.074TyrLys: 2.074 ± 0.502
3.733TyrLeu: 3.733 ± 0.888
0.83TyrMet: 0.83 ± 0.432
1.659TyrAsn: 1.659 ± 1.175
1.659TyrPro: 1.659 ± 0.788
2.074TyrGln: 2.074 ± 0.738
2.074TyrArg: 2.074 ± 0.493
1.659TyrSer: 1.659 ± 0.782
1.659TyrThr: 1.659 ± 0.549
1.244TyrVal: 1.244 ± 0.475
2.074TyrTrp: 2.074 ± 0.711
2.489TyrTyr: 2.489 ± 1.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2412 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski