Amino acid dipepetide frequency for Fulmarus glacialis papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.84AlaAla: 3.84 ± 1.283
0.384AlaCys: 0.384 ± 0.307
1.92AlaAsp: 1.92 ± 0.909
4.224AlaGlu: 4.224 ± 0.6
2.688AlaPhe: 2.688 ± 0.795
3.072AlaGly: 3.072 ± 0.97
0.768AlaHis: 0.768 ± 0.462
1.92AlaIle: 1.92 ± 0.827
3.84AlaLys: 3.84 ± 1.126
4.992AlaLeu: 4.992 ± 1.736
2.688AlaMet: 2.688 ± 1.241
1.92AlaAsn: 1.92 ± 1.001
4.608AlaPro: 4.608 ± 0.763
1.92AlaGln: 1.92 ± 0.98
4.608AlaArg: 4.608 ± 1.034
4.224AlaSer: 4.224 ± 1.134
3.456AlaThr: 3.456 ± 1.308
2.304AlaVal: 2.304 ± 0.691
0.384AlaTrp: 0.384 ± 0.523
3.072AlaTyr: 3.072 ± 0.797
0.0AlaXaa: 0.0 ± 0.0
Cys
0.384CysAla: 0.384 ± 0.307
0.0CysCys: 0.0 ± 0.0
0.768CysAsp: 0.768 ± 0.365
1.152CysGlu: 1.152 ± 0.552
1.536CysPhe: 1.536 ± 0.594
1.92CysGly: 1.92 ± 0.721
0.384CysHis: 0.384 ± 0.36
1.536CysIle: 1.536 ± 0.657
1.536CysLys: 1.536 ± 0.832
1.92CysLeu: 1.92 ± 1.137
1.152CysMet: 1.152 ± 0.633
0.384CysAsn: 0.384 ± 0.307
2.688CysPro: 2.688 ± 0.547
1.152CysGln: 1.152 ± 0.444
1.536CysArg: 1.536 ± 0.863
2.304CysSer: 2.304 ± 0.994
1.92CysThr: 1.92 ± 0.96
2.304CysVal: 2.304 ± 0.671
0.384CysTrp: 0.384 ± 0.307
1.152CysTyr: 1.152 ± 0.56
0.0CysXaa: 0.0 ± 0.0
Asp
3.072AspAla: 3.072 ± 1.203
2.688AspCys: 2.688 ± 0.529
3.072AspAsp: 3.072 ± 0.786
4.992AspGlu: 4.992 ± 0.91
2.304AspPhe: 2.304 ± 0.975
6.912AspGly: 6.912 ± 1.948
1.536AspHis: 1.536 ± 0.517
3.84AspIle: 3.84 ± 1.401
1.536AspLys: 1.536 ± 0.23
3.84AspLeu: 3.84 ± 0.99
2.304AspMet: 2.304 ± 0.848
1.152AspAsn: 1.152 ± 0.56
4.224AspPro: 4.224 ± 1.139
0.384AspGln: 0.384 ± 0.306
3.072AspArg: 3.072 ± 0.855
1.536AspSer: 1.536 ± 0.544
4.224AspThr: 4.224 ± 1.216
3.456AspVal: 3.456 ± 0.958
0.768AspTrp: 0.768 ± 0.614
0.768AspTyr: 0.768 ± 0.715
0.0AspXaa: 0.0 ± 0.0
Glu
3.84GluAla: 3.84 ± 1.169
0.0GluCys: 0.0 ± 0.0
5.376GluAsp: 5.376 ± 0.697
4.992GluGlu: 4.992 ± 1.62
2.304GluPhe: 2.304 ± 0.785
4.992GluGly: 4.992 ± 2.313
0.768GluHis: 0.768 ± 0.584
1.92GluIle: 1.92 ± 0.8
2.688GluLys: 2.688 ± 1.257
5.376GluLeu: 5.376 ± 1.324
1.536GluMet: 1.536 ± 0.624
0.384GluAsn: 0.384 ± 0.307
1.92GluPro: 1.92 ± 0.374
3.84GluGln: 3.84 ± 0.918
1.536GluArg: 1.536 ± 0.531
2.688GluSer: 2.688 ± 1.114
5.376GluThr: 5.376 ± 1.02
3.072GluVal: 3.072 ± 0.84
0.384GluTrp: 0.384 ± 0.357
1.92GluTyr: 1.92 ± 1.078
0.0GluXaa: 0.0 ± 0.0
Phe
2.688PheAla: 2.688 ± 0.68
1.536PheCys: 1.536 ± 0.77
1.536PheAsp: 1.536 ± 0.658
1.92PheGlu: 1.92 ± 0.709
3.84PhePhe: 3.84 ± 2.138
2.304PheGly: 2.304 ± 0.884
0.0PheHis: 0.0 ± 0.0
1.92PheIle: 1.92 ± 0.691
3.072PheLys: 3.072 ± 1.045
4.224PheLeu: 4.224 ± 1.496
0.384PheMet: 0.384 ± 0.306
2.304PheAsn: 2.304 ± 0.773
0.768PhePro: 0.768 ± 0.531
1.92PheGln: 1.92 ± 0.737
3.456PheArg: 3.456 ± 1.22
3.072PheSer: 3.072 ± 0.817
2.304PheThr: 2.304 ± 0.725
1.92PheVal: 1.92 ± 1.047
1.536PheTrp: 1.536 ± 0.855
0.384PheTyr: 0.384 ± 0.384
0.0PheXaa: 0.0 ± 0.0
Gly
3.072GlyAla: 3.072 ± 0.842
1.152GlyCys: 1.152 ± 0.645
3.072GlyAsp: 3.072 ± 0.767
2.688GlyGlu: 2.688 ± 0.871
2.304GlyPhe: 2.304 ± 0.889
7.68GlyGly: 7.68 ± 1.918
3.456GlyHis: 3.456 ± 0.857
3.456GlyIle: 3.456 ± 0.699
1.536GlyLys: 1.536 ± 0.66
5.376GlyLeu: 5.376 ± 0.599
1.152GlyMet: 1.152 ± 0.33
3.84GlyAsn: 3.84 ± 1.259
4.608GlyPro: 4.608 ± 1.452
3.072GlyGln: 3.072 ± 0.941
3.072GlyArg: 3.072 ± 0.995
5.76GlySer: 5.76 ± 1.387
6.528GlyThr: 6.528 ± 1.383
4.608GlyVal: 4.608 ± 1.492
1.152GlyTrp: 1.152 ± 0.608
3.84GlyTyr: 3.84 ± 0.949
0.0GlyXaa: 0.0 ± 0.0
His
2.688HisAla: 2.688 ± 0.864
0.384HisCys: 0.384 ± 0.384
1.152HisAsp: 1.152 ± 0.608
0.768HisGlu: 0.768 ± 0.614
1.536HisPhe: 1.536 ± 0.66
0.384HisGly: 0.384 ± 0.357
0.384HisHis: 0.384 ± 0.357
0.384HisIle: 0.384 ± 0.357
1.152HisLys: 1.152 ± 0.709
1.536HisLeu: 1.536 ± 0.565
1.152HisMet: 1.152 ± 0.527
0.0HisAsn: 0.0 ± 0.0
0.768HisPro: 0.768 ± 0.365
1.152HisGln: 1.152 ± 0.762
1.92HisArg: 1.92 ± 0.543
1.92HisSer: 1.92 ± 0.962
1.536HisThr: 1.536 ± 0.636
0.768HisVal: 0.768 ± 0.541
0.768HisTrp: 0.768 ± 0.426
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.84IleAla: 3.84 ± 0.617
2.304IleCys: 2.304 ± 1.002
1.92IleAsp: 1.92 ± 0.927
2.688IleGlu: 2.688 ± 0.725
1.152IlePhe: 1.152 ± 0.507
2.688IleGly: 2.688 ± 1.196
1.152IleHis: 1.152 ± 0.641
1.92IleIle: 1.92 ± 0.927
2.304IleLys: 2.304 ± 1.228
4.992IleLeu: 4.992 ± 0.991
1.152IleMet: 1.152 ± 0.553
2.304IleAsn: 2.304 ± 0.88
2.688IlePro: 2.688 ± 0.844
1.152IleGln: 1.152 ± 0.563
3.84IleArg: 3.84 ± 1.216
3.84IleSer: 3.84 ± 1.101
2.688IleThr: 2.688 ± 0.79
4.224IleVal: 4.224 ± 1.329
0.384IleTrp: 0.384 ± 0.357
1.536IleTyr: 1.536 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
3.456LysAla: 3.456 ± 1.353
1.536LysCys: 1.536 ± 0.67
3.072LysAsp: 3.072 ± 1.28
2.304LysGlu: 2.304 ± 1.062
1.152LysPhe: 1.152 ± 0.629
1.536LysGly: 1.536 ± 0.504
0.768LysHis: 0.768 ± 0.545
2.304LysIle: 2.304 ± 0.466
4.224LysLys: 4.224 ± 1.2
4.224LysLeu: 4.224 ± 1.177
1.536LysMet: 1.536 ± 0.792
1.536LysAsn: 1.536 ± 0.758
2.304LysPro: 2.304 ± 0.869
2.304LysGln: 2.304 ± 0.758
6.912LysArg: 6.912 ± 0.762
3.456LysSer: 3.456 ± 1.703
2.688LysThr: 2.688 ± 1.228
2.688LysVal: 2.688 ± 0.602
0.0LysTrp: 0.0 ± 0.0
2.304LysTyr: 2.304 ± 0.838
0.0LysXaa: 0.0 ± 0.0
Leu
4.224LeuAla: 4.224 ± 1.529
3.072LeuCys: 3.072 ± 1.071
5.76LeuAsp: 5.76 ± 1.662
4.608LeuGlu: 4.608 ± 1.12
3.84LeuPhe: 3.84 ± 1.482
2.688LeuGly: 2.688 ± 0.443
3.456LeuHis: 3.456 ± 0.627
5.76LeuIle: 5.76 ± 1.156
6.528LeuLys: 6.528 ± 1.991
6.912LeuLeu: 6.912 ± 2.276
1.92LeuMet: 1.92 ± 0.809
4.608LeuAsn: 4.608 ± 1.462
2.304LeuPro: 2.304 ± 1.006
5.76LeuGln: 5.76 ± 1.894
5.76LeuArg: 5.76 ± 1.301
6.912LeuSer: 6.912 ± 1.089
6.528LeuThr: 6.528 ± 1.26
2.688LeuVal: 2.688 ± 1.272
2.688LeuTrp: 2.688 ± 0.697
1.536LeuTyr: 1.536 ± 0.706
0.0LeuXaa: 0.0 ± 0.0
Met
0.768MetAla: 0.768 ± 0.537
1.536MetCys: 1.536 ± 0.67
0.768MetAsp: 0.768 ± 0.475
1.152MetGlu: 1.152 ± 0.608
1.152MetPhe: 1.152 ± 0.581
3.072MetGly: 3.072 ± 1.39
0.384MetHis: 0.384 ± 0.307
1.92MetIle: 1.92 ± 0.533
0.768MetLys: 0.768 ± 0.489
1.152MetLeu: 1.152 ± 0.616
0.768MetMet: 0.768 ± 0.463
0.768MetAsn: 0.768 ± 0.948
0.768MetPro: 0.768 ± 0.396
1.152MetGln: 1.152 ± 0.33
1.152MetArg: 1.152 ± 0.843
2.304MetSer: 2.304 ± 0.724
2.304MetThr: 2.304 ± 0.93
0.768MetVal: 0.768 ± 0.614
0.0MetTrp: 0.0 ± 0.0
1.536MetTyr: 1.536 ± 0.808
0.0MetXaa: 0.0 ± 0.0
Asn
3.072AsnAla: 3.072 ± 1.107
0.768AsnCys: 0.768 ± 0.489
1.92AsnAsp: 1.92 ± 0.691
1.536AsnGlu: 1.536 ± 0.716
1.152AsnPhe: 1.152 ± 0.685
2.688AsnGly: 2.688 ± 0.603
0.0AsnHis: 0.0 ± 0.0
4.224AsnIle: 4.224 ± 0.797
0.768AsnLys: 0.768 ± 0.396
2.304AsnLeu: 2.304 ± 1.27
0.0AsnMet: 0.0 ± 0.0
1.536AsnAsn: 1.536 ± 0.5
2.688AsnPro: 2.688 ± 1.215
0.384AsnGln: 0.384 ± 0.307
3.072AsnArg: 3.072 ± 1.304
3.456AsnSer: 3.456 ± 1.292
4.608AsnThr: 4.608 ± 1.586
2.304AsnVal: 2.304 ± 0.787
0.0AsnTrp: 0.0 ± 0.0
0.768AsnTyr: 0.768 ± 0.462
0.0AsnXaa: 0.0 ± 0.0
Pro
2.688ProAla: 2.688 ± 1.038
1.92ProCys: 1.92 ± 1.121
3.456ProAsp: 3.456 ± 1.172
5.376ProGlu: 5.376 ± 1.667
1.92ProPhe: 1.92 ± 0.661
3.072ProGly: 3.072 ± 1.441
1.152ProHis: 1.152 ± 0.558
2.688ProIle: 2.688 ± 0.586
2.688ProLys: 2.688 ± 0.792
6.912ProLeu: 6.912 ± 1.058
1.152ProMet: 1.152 ± 0.492
1.536ProAsn: 1.536 ± 0.399
6.144ProPro: 6.144 ± 1.339
3.072ProGln: 3.072 ± 1.347
3.84ProArg: 3.84 ± 0.825
4.608ProSer: 4.608 ± 1.571
1.152ProThr: 1.152 ± 0.608
6.528ProVal: 6.528 ± 1.227
1.152ProTrp: 1.152 ± 0.608
2.304ProTyr: 2.304 ± 0.953
0.0ProXaa: 0.0 ± 0.0
Gln
1.92GlnAla: 1.92 ± 0.74
1.152GlnCys: 1.152 ± 0.785
3.072GlnAsp: 3.072 ± 0.603
1.152GlnGlu: 1.152 ± 0.602
0.768GlnPhe: 0.768 ± 0.426
4.992GlnGly: 4.992 ± 1.041
1.152GlnHis: 1.152 ± 0.709
2.304GlnIle: 2.304 ± 0.988
0.768GlnLys: 0.768 ± 0.715
3.456GlnLeu: 3.456 ± 0.782
1.536GlnMet: 1.536 ± 0.399
1.536GlnAsn: 1.536 ± 0.718
1.92GlnPro: 1.92 ± 0.625
2.688GlnGln: 2.688 ± 1.496
2.304GlnArg: 2.304 ± 0.872
3.072GlnSer: 3.072 ± 0.938
3.456GlnThr: 3.456 ± 1.272
0.768GlnVal: 0.768 ± 0.547
1.152GlnTrp: 1.152 ± 0.379
1.536GlnTyr: 1.536 ± 0.518
0.0GlnXaa: 0.0 ± 0.0
Arg
4.224ArgAla: 4.224 ± 0.969
2.304ArgCys: 2.304 ± 0.686
2.304ArgAsp: 2.304 ± 0.62
3.072ArgGlu: 3.072 ± 0.751
3.072ArgPhe: 3.072 ± 0.943
3.84ArgGly: 3.84 ± 1.605
1.92ArgHis: 1.92 ± 1.109
3.84ArgIle: 3.84 ± 1.469
2.688ArgLys: 2.688 ± 1.245
4.224ArgLeu: 4.224 ± 0.875
1.92ArgMet: 1.92 ± 1.025
5.376ArgAsn: 5.376 ± 1.504
2.688ArgPro: 2.688 ± 0.783
2.304ArgGln: 2.304 ± 0.757
4.608ArgArg: 4.608 ± 1.997
3.072ArgSer: 3.072 ± 1.141
6.912ArgThr: 6.912 ± 1.679
4.224ArgVal: 4.224 ± 1.289
0.768ArgTrp: 0.768 ± 0.567
2.688ArgTyr: 2.688 ± 0.569
0.0ArgXaa: 0.0 ± 0.0
Ser
5.76SerAla: 5.76 ± 1.441
2.304SerCys: 2.304 ± 0.915
5.76SerAsp: 5.76 ± 1.094
3.456SerGlu: 3.456 ± 2.126
3.456SerPhe: 3.456 ± 1.497
6.912SerGly: 6.912 ± 1.096
1.536SerHis: 1.536 ± 0.618
1.152SerIle: 1.152 ± 0.724
4.608SerLys: 4.608 ± 1.517
6.912SerLeu: 6.912 ± 1.51
0.384SerMet: 0.384 ± 0.307
2.688SerAsn: 2.688 ± 0.951
6.912SerPro: 6.912 ± 1.849
1.92SerGln: 1.92 ± 0.604
3.072SerArg: 3.072 ± 1.582
8.449SerSer: 8.449 ± 2.742
3.84SerThr: 3.84 ± 0.771
3.84SerVal: 3.84 ± 1.047
0.768SerTrp: 0.768 ± 0.518
0.768SerTyr: 0.768 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
3.84ThrAla: 3.84 ± 0.978
0.768ThrCys: 0.768 ± 0.473
2.688ThrAsp: 2.688 ± 0.777
4.608ThrGlu: 4.608 ± 1.059
3.072ThrPhe: 3.072 ± 0.902
4.608ThrGly: 4.608 ± 2.102
0.768ThrHis: 0.768 ± 0.489
3.456ThrIle: 3.456 ± 1.253
2.304ThrLys: 2.304 ± 1.066
6.912ThrLeu: 6.912 ± 0.669
1.152ThrMet: 1.152 ± 0.592
1.536ThrAsn: 1.536 ± 0.542
5.76ThrPro: 5.76 ± 1.158
2.304ThrGln: 2.304 ± 0.883
4.608ThrArg: 4.608 ± 1.357
5.376ThrSer: 5.376 ± 1.488
3.456ThrThr: 3.456 ± 1.564
8.449ThrVal: 8.449 ± 2.253
1.92ThrTrp: 1.92 ± 0.789
3.456ThrTyr: 3.456 ± 1.127
0.0ThrXaa: 0.0 ± 0.0
Val
1.152ValAla: 1.152 ± 0.494
0.768ValCys: 0.768 ± 0.616
3.456ValAsp: 3.456 ± 0.799
2.304ValGlu: 2.304 ± 1.356
2.688ValPhe: 2.688 ± 0.872
4.992ValGly: 4.992 ± 1.229
0.0ValHis: 0.0 ± 0.0
1.92ValIle: 1.92 ± 0.57
4.608ValLys: 4.608 ± 1.357
6.528ValLeu: 6.528 ± 1.52
0.768ValMet: 0.768 ± 0.799
3.456ValAsn: 3.456 ± 0.951
7.68ValPro: 7.68 ± 2.682
2.688ValGln: 2.688 ± 0.981
3.84ValArg: 3.84 ± 1.218
6.528ValSer: 6.528 ± 1.547
3.84ValThr: 3.84 ± 0.908
3.456ValVal: 3.456 ± 0.972
0.384ValTrp: 0.384 ± 0.306
0.384ValTyr: 0.384 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
0.384TrpAla: 0.384 ± 0.307
0.384TrpCys: 0.384 ± 0.442
2.688TrpAsp: 2.688 ± 0.844
0.0TrpGlu: 0.0 ± 0.0
0.768TrpPhe: 0.768 ± 0.396
1.152TrpGly: 1.152 ± 0.637
0.384TrpHis: 0.384 ± 0.357
0.384TrpIle: 0.384 ± 0.449
1.536TrpLys: 1.536 ± 0.732
1.92TrpLeu: 1.92 ± 0.611
0.384TrpMet: 0.384 ± 0.306
0.0TrpAsn: 0.0 ± 0.0
0.768TrpPro: 0.768 ± 0.613
1.152TrpGln: 1.152 ± 0.603
0.768TrpArg: 0.768 ± 0.613
0.384TrpSer: 0.384 ± 0.357
1.536TrpThr: 1.536 ± 1.04
1.536TrpVal: 1.536 ± 0.701
0.0TrpTrp: 0.0 ± 0.0
0.384TrpTyr: 0.384 ± 0.36
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.304TyrAla: 2.304 ± 1.072
1.152TyrCys: 1.152 ± 0.61
1.92TyrAsp: 1.92 ± 0.625
1.92TyrGlu: 1.92 ± 0.906
0.384TyrPhe: 0.384 ± 0.357
1.92TyrGly: 1.92 ± 0.374
0.384TyrHis: 0.384 ± 0.307
1.92TyrIle: 1.92 ± 0.595
1.152TyrLys: 1.152 ± 0.581
3.84TyrLeu: 3.84 ± 1.263
0.768TyrMet: 0.768 ± 0.426
0.384TyrAsn: 0.384 ± 0.36
1.152TyrPro: 1.152 ± 0.685
0.384TyrGln: 0.384 ± 0.342
3.072TyrArg: 3.072 ± 0.811
1.536TyrSer: 1.536 ± 0.399
2.688TyrThr: 2.688 ± 0.85
1.536TyrVal: 1.536 ± 0.894
1.92TyrTrp: 1.92 ± 0.692
1.536TyrTyr: 1.536 ± 0.798
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (2605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski