Amino acid dipepetide frequency for Telfairia golden mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.465AlaAla: 2.465 ± 1.471
0.616AlaCys: 0.616 ± 0.641
1.848AlaAsp: 1.848 ± 0.809
2.465AlaGlu: 2.465 ± 1.506
1.848AlaPhe: 1.848 ± 0.75
0.616AlaGly: 0.616 ± 0.583
1.848AlaHis: 1.848 ± 1.054
2.465AlaIle: 2.465 ± 1.242
5.545AlaLys: 5.545 ± 1.159
6.778AlaLeu: 6.778 ± 1.92
0.616AlaMet: 0.616 ± 0.547
1.232AlaAsn: 1.232 ± 0.755
3.081AlaPro: 3.081 ± 1.357
3.697AlaGln: 3.697 ± 1.131
3.697AlaArg: 3.697 ± 1.513
5.545AlaSer: 5.545 ± 1.112
3.697AlaThr: 3.697 ± 1.484
4.313AlaVal: 4.313 ± 1.697
1.848AlaTrp: 1.848 ± 1.043
1.232AlaTyr: 1.232 ± 0.677
0.0AlaXaa: 0.0 ± 0.0
Cys
1.232CysAla: 1.232 ± 0.879
1.232CysCys: 1.232 ± 1.122
0.616CysAsp: 0.616 ± 0.583
0.616CysGlu: 0.616 ± 0.641
1.232CysPhe: 1.232 ± 0.861
1.848CysGly: 1.848 ± 1.208
0.0CysHis: 0.0 ± 0.0
0.616CysIle: 0.616 ± 0.641
1.232CysLys: 1.232 ± 1.281
1.232CysLeu: 1.232 ± 0.869
1.232CysMet: 1.232 ± 0.664
1.232CysAsn: 1.232 ± 0.728
1.232CysPro: 1.232 ± 1.122
0.0CysGln: 0.0 ± 0.0
0.616CysArg: 0.616 ± 0.517
2.465CysSer: 2.465 ± 1.322
1.232CysThr: 1.232 ± 0.679
1.848CysVal: 1.848 ± 0.824
0.616CysTrp: 0.616 ± 0.517
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.232AspAla: 1.232 ± 0.728
1.848AspCys: 1.848 ± 1.208
1.848AspAsp: 1.848 ± 1.208
1.848AspGlu: 1.848 ± 0.675
3.081AspPhe: 3.081 ± 0.895
3.081AspGly: 3.081 ± 1.007
2.465AspHis: 2.465 ± 1.144
2.465AspIle: 2.465 ± 1.616
2.465AspLys: 2.465 ± 0.826
4.929AspLeu: 4.929 ± 1.65
0.616AspMet: 0.616 ± 0.802
1.232AspAsn: 1.232 ± 0.785
1.232AspPro: 1.232 ± 0.677
2.465AspGln: 2.465 ± 0.84
3.081AspArg: 3.081 ± 1.239
4.929AspSer: 4.929 ± 1.502
2.465AspThr: 2.465 ± 0.912
8.01AspVal: 8.01 ± 2.509
1.232AspTrp: 1.232 ± 0.661
2.465AspTyr: 2.465 ± 1.288
0.0AspXaa: 0.0 ± 0.0
Glu
4.929GluAla: 4.929 ± 1.865
0.616GluCys: 0.616 ± 0.547
1.232GluAsp: 1.232 ± 1.095
3.697GluGlu: 3.697 ± 1.789
2.465GluPhe: 2.465 ± 1.006
6.161GluGly: 6.161 ± 0.94
0.616GluHis: 0.616 ± 0.583
1.232GluIle: 1.232 ± 0.829
0.616GluLys: 0.616 ± 0.517
5.545GluLeu: 5.545 ± 1.618
0.0GluMet: 0.0 ± 0.0
3.081GluAsn: 3.081 ± 1.664
3.081GluPro: 3.081 ± 0.948
1.232GluGln: 1.232 ± 0.756
2.465GluArg: 2.465 ± 1.144
2.465GluSer: 2.465 ± 1.353
1.232GluThr: 1.232 ± 0.728
1.232GluVal: 1.232 ± 0.828
1.232GluTrp: 1.232 ± 1.033
1.232GluTyr: 1.232 ± 1.095
0.0GluXaa: 0.0 ± 0.0
Phe
1.232PheAla: 1.232 ± 0.879
0.616PheCys: 0.616 ± 0.641
3.081PheAsp: 3.081 ± 1.914
1.232PheGlu: 1.232 ± 0.728
1.848PhePhe: 1.848 ± 0.707
1.232PheGly: 1.232 ± 0.785
2.465PheHis: 2.465 ± 1.354
1.232PheIle: 1.232 ± 1.167
4.313PheLys: 4.313 ± 2.128
4.313PheLeu: 4.313 ± 2.269
0.616PheMet: 0.616 ± 0.517
1.848PheAsn: 1.848 ± 0.94
2.465PhePro: 2.465 ± 1.278
1.848PheGln: 1.848 ± 1.023
5.545PheArg: 5.545 ± 1.557
4.929PheSer: 4.929 ± 2.109
1.232PheThr: 1.232 ± 0.669
1.848PheVal: 1.848 ± 0.809
1.848PheTrp: 1.848 ± 0.94
1.232PheTyr: 1.232 ± 0.888
0.0PheXaa: 0.0 ± 0.0
Gly
3.697GlyAla: 3.697 ± 1.019
1.848GlyCys: 1.848 ± 1.404
5.545GlyAsp: 5.545 ± 1.541
3.697GlyGlu: 3.697 ± 1.048
1.848GlyPhe: 1.848 ± 1.004
3.697GlyGly: 3.697 ± 1.756
1.232GlyHis: 1.232 ± 0.677
1.232GlyIle: 1.232 ± 0.847
3.697GlyLys: 3.697 ± 2.045
1.848GlyLeu: 1.848 ± 0.992
1.232GlyMet: 1.232 ± 0.665
2.465GlyAsn: 2.465 ± 1.398
3.081GlyPro: 3.081 ± 0.983
1.848GlyGln: 1.848 ± 1.023
2.465GlyArg: 2.465 ± 1.13
4.313GlySer: 4.313 ± 1.343
4.313GlyThr: 4.313 ± 2.165
1.232GlyVal: 1.232 ± 0.861
0.616GlyTrp: 0.616 ± 0.547
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
3.081HisAla: 3.081 ± 1.389
1.848HisCys: 1.848 ± 1.395
2.465HisAsp: 2.465 ± 1.613
0.616HisGlu: 0.616 ± 0.63
1.232HisPhe: 1.232 ± 1.033
1.848HisGly: 1.848 ± 1.355
1.232HisHis: 1.232 ± 0.888
1.232HisIle: 1.232 ± 0.879
1.232HisLys: 1.232 ± 0.924
3.081HisLeu: 3.081 ± 1.147
1.848HisMet: 1.848 ± 0.883
3.081HisAsn: 3.081 ± 1.797
2.465HisPro: 2.465 ± 1.504
1.848HisGln: 1.848 ± 0.764
1.848HisArg: 1.848 ± 0.93
1.848HisSer: 1.848 ± 0.91
1.848HisThr: 1.848 ± 1.922
1.848HisVal: 1.848 ± 0.937
0.0HisTrp: 0.0 ± 0.0
1.848HisTyr: 1.848 ± 0.824
0.0HisXaa: 0.0 ± 0.0
Ile
1.232IleAla: 1.232 ± 0.661
1.232IleCys: 1.232 ± 0.677
4.929IleAsp: 4.929 ± 2.744
1.848IleGlu: 1.848 ± 1.18
1.848IlePhe: 1.848 ± 1.55
0.616IleGly: 0.616 ± 0.583
1.848IleHis: 1.848 ± 1.686
2.465IleIle: 2.465 ± 2.02
4.929IleLys: 4.929 ± 0.899
0.616IleLeu: 0.616 ± 0.583
1.848IleMet: 1.848 ± 0.919
1.232IleAsn: 1.232 ± 1.137
0.616IlePro: 0.616 ± 0.517
3.697IleGln: 3.697 ± 1.912
6.161IleArg: 6.161 ± 1.243
4.313IleSer: 4.313 ± 2.045
2.465IleThr: 2.465 ± 1.684
3.697IleVal: 3.697 ± 1.541
1.232IleTrp: 1.232 ± 0.828
1.232IleTyr: 1.232 ± 0.77
0.0IleXaa: 0.0 ± 0.0
Lys
4.313LysAla: 4.313 ± 1.628
1.232LysCys: 1.232 ± 0.847
3.697LysAsp: 3.697 ± 1.705
3.081LysGlu: 3.081 ± 1.03
0.616LysPhe: 0.616 ± 0.802
1.848LysGly: 1.848 ± 1.18
1.848LysHis: 1.848 ± 0.902
3.081LysIle: 3.081 ± 1.191
1.848LysLys: 1.848 ± 1.421
4.929LysLeu: 4.929 ± 2.018
1.232LysMet: 1.232 ± 1.167
3.081LysAsn: 3.081 ± 1.674
3.081LysPro: 3.081 ± 1.224
1.232LysGln: 1.232 ± 0.857
5.545LysArg: 5.545 ± 2.972
5.545LysSer: 5.545 ± 1.427
0.616LysThr: 0.616 ± 0.517
4.313LysVal: 4.313 ± 1.818
0.0LysTrp: 0.0 ± 0.0
3.081LysTyr: 3.081 ± 1.051
0.0LysXaa: 0.0 ± 0.0
Leu
2.465LeuAla: 2.465 ± 1.149
1.232LeuCys: 1.232 ± 1.033
6.161LeuAsp: 6.161 ± 2.712
3.081LeuGlu: 3.081 ± 1.128
3.081LeuPhe: 3.081 ± 1.551
3.697LeuGly: 3.697 ± 0.596
3.081LeuHis: 3.081 ± 1.515
3.697LeuIle: 3.697 ± 1.333
5.545LeuLys: 5.545 ± 1.116
9.242LeuLeu: 9.242 ± 1.801
1.848LeuMet: 1.848 ± 0.979
6.778LeuAsn: 6.778 ± 1.208
3.697LeuPro: 3.697 ± 1.749
1.848LeuGln: 1.848 ± 1.395
6.778LeuArg: 6.778 ± 2.419
4.929LeuSer: 4.929 ± 1.293
5.545LeuThr: 5.545 ± 1.257
1.848LeuVal: 1.848 ± 0.93
0.0LeuTrp: 0.0 ± 0.0
2.465LeuTyr: 2.465 ± 1.7
0.0LeuXaa: 0.0 ± 0.0
Met
1.232MetAla: 1.232 ± 0.785
0.616MetCys: 0.616 ± 0.583
3.081MetAsp: 3.081 ± 0.903
0.616MetGlu: 0.616 ± 0.547
1.848MetPhe: 1.848 ± 1.185
1.848MetGly: 1.848 ± 0.835
0.616MetHis: 0.616 ± 0.561
0.616MetIle: 0.616 ± 0.547
0.616MetLys: 0.616 ± 0.547
0.616MetLeu: 0.616 ± 0.561
1.232MetMet: 1.232 ± 0.787
0.616MetAsn: 0.616 ± 0.641
0.0MetPro: 0.0 ± 0.0
0.616MetGln: 0.616 ± 0.63
2.465MetArg: 2.465 ± 0.781
1.848MetSer: 1.848 ± 0.707
0.0MetThr: 0.0 ± 0.0
1.848MetVal: 1.848 ± 1.227
0.616MetTrp: 0.616 ± 0.561
2.465MetTyr: 2.465 ± 1.905
0.0MetXaa: 0.0 ± 0.0
Asn
4.313AsnAla: 4.313 ± 1.504
0.616AsnCys: 0.616 ± 0.547
1.848AsnAsp: 1.848 ± 0.675
3.697AsnGlu: 3.697 ± 1.475
1.848AsnPhe: 1.848 ± 1.708
2.465AsnGly: 2.465 ± 0.825
3.081AsnHis: 3.081 ± 2.66
2.465AsnIle: 2.465 ± 0.912
0.0AsnLys: 0.0 ± 0.0
3.697AsnLeu: 3.697 ± 1.4
1.848AsnMet: 1.848 ± 1.262
2.465AsnAsn: 2.465 ± 0.825
3.697AsnPro: 3.697 ± 1.093
1.232AsnGln: 1.232 ± 0.829
4.929AsnArg: 4.929 ± 1.005
4.313AsnSer: 4.313 ± 1.123
3.697AsnThr: 3.697 ± 1.263
5.545AsnVal: 5.545 ± 2.225
0.0AsnTrp: 0.0 ± 0.0
2.465AsnTyr: 2.465 ± 1.288
0.0AsnXaa: 0.0 ± 0.0
Pro
0.616ProAla: 0.616 ± 0.517
1.232ProCys: 1.232 ± 0.77
1.848ProAsp: 1.848 ± 1.226
3.081ProGlu: 3.081 ± 0.971
2.465ProPhe: 2.465 ± 1.176
1.848ProGly: 1.848 ± 0.675
3.081ProHis: 3.081 ± 1.519
4.313ProIle: 4.313 ± 0.93
4.313ProLys: 4.313 ± 1.959
2.465ProLeu: 2.465 ± 0.907
0.0ProMet: 0.0 ± 0.0
4.929ProAsn: 4.929 ± 2.022
1.848ProPro: 1.848 ± 1.054
1.848ProGln: 1.848 ± 0.93
4.313ProArg: 4.313 ± 1.452
4.313ProSer: 4.313 ± 1.799
6.161ProThr: 6.161 ± 2.154
4.313ProVal: 4.313 ± 2.022
0.616ProTrp: 0.616 ± 0.547
1.848ProTyr: 1.848 ± 1.301
0.0ProXaa: 0.0 ± 0.0
Gln
3.081GlnAla: 3.081 ± 1.39
0.0GlnCys: 0.0 ± 0.0
1.848GlnAsp: 1.848 ± 1.185
2.465GlnGlu: 2.465 ± 1.096
0.616GlnPhe: 0.616 ± 0.517
3.081GlnGly: 3.081 ± 1.444
0.616GlnHis: 0.616 ± 0.583
2.465GlnIle: 2.465 ± 1.209
0.616GlnLys: 0.616 ± 0.561
2.465GlnLeu: 2.465 ± 1.08
0.0GlnMet: 0.0 ± 0.0
1.848GlnAsn: 1.848 ± 1.054
2.465GlnPro: 2.465 ± 1.678
0.616GlnGln: 0.616 ± 0.517
2.465GlnArg: 2.465 ± 0.84
3.697GlnSer: 3.697 ± 1.496
1.848GlnThr: 1.848 ± 1.131
4.929GlnVal: 4.929 ± 2.173
0.0GlnTrp: 0.0 ± 0.0
0.616GlnTyr: 0.616 ± 0.583
0.0GlnXaa: 0.0 ± 0.0
Arg
3.697ArgAla: 3.697 ± 1.662
1.232ArgCys: 1.232 ± 0.664
4.929ArgAsp: 4.929 ± 1.62
1.848ArgGlu: 1.848 ± 1.043
4.313ArgPhe: 4.313 ± 1.685
3.697ArgGly: 3.697 ± 1.625
3.081ArgHis: 3.081 ± 1.387
3.081ArgIle: 3.081 ± 0.913
2.465ArgLys: 2.465 ± 1.189
6.161ArgLeu: 6.161 ± 1.377
1.848ArgMet: 1.848 ± 1.922
3.081ArgAsn: 3.081 ± 1.709
7.394ArgPro: 7.394 ± 1.469
1.848ArgGln: 1.848 ± 0.848
9.242ArgArg: 9.242 ± 3.951
8.01ArgSer: 8.01 ± 1.901
4.929ArgThr: 4.929 ± 1.326
3.081ArgVal: 3.081 ± 1.224
0.0ArgTrp: 0.0 ± 0.0
4.313ArgTyr: 4.313 ± 2.285
0.0ArgXaa: 0.0 ± 0.0
Ser
6.161SerAla: 6.161 ± 1.851
0.616SerCys: 0.616 ± 0.517
3.081SerAsp: 3.081 ± 0.594
1.848SerGlu: 1.848 ± 0.991
6.778SerPhe: 6.778 ± 1.321
2.465SerGly: 2.465 ± 1.752
1.848SerHis: 1.848 ± 1.306
3.697SerIle: 3.697 ± 1.677
6.161SerLys: 6.161 ± 1.589
4.929SerLeu: 4.929 ± 1.084
3.081SerMet: 3.081 ± 1.633
6.161SerAsn: 6.161 ± 1.78
7.394SerPro: 7.394 ± 1.693
1.848SerGln: 1.848 ± 1.395
3.697SerArg: 3.697 ± 1.916
11.091SerSer: 11.091 ± 3.554
6.161SerThr: 6.161 ± 1.944
6.161SerVal: 6.161 ± 2.257
0.616SerTrp: 0.616 ± 0.641
4.313SerTyr: 4.313 ± 1.709
0.0SerXaa: 0.0 ± 0.0
Thr
2.465ThrAla: 2.465 ± 1.189
3.081ThrCys: 3.081 ± 1.073
1.232ThrAsp: 1.232 ± 0.702
2.465ThrGlu: 2.465 ± 0.84
3.081ThrPhe: 3.081 ± 1.776
6.161ThrGly: 6.161 ± 2.076
4.929ThrHis: 4.929 ± 1.496
4.313ThrIle: 4.313 ± 1.624
0.616ThrLys: 0.616 ± 0.517
3.081ThrLeu: 3.081 ± 1.233
0.616ThrMet: 0.616 ± 0.547
4.313ThrAsn: 4.313 ± 1.772
2.465ThrPro: 2.465 ± 0.84
2.465ThrGln: 2.465 ± 1.648
4.929ThrArg: 4.929 ± 1.588
3.697ThrSer: 3.697 ± 1.349
4.929ThrThr: 4.929 ± 1.576
3.081ThrVal: 3.081 ± 1.242
1.232ThrTrp: 1.232 ± 0.756
3.697ThrTyr: 3.697 ± 1.218
0.0ThrXaa: 0.0 ± 0.0
Val
2.465ValAla: 2.465 ± 1.101
0.616ValCys: 0.616 ± 0.583
2.465ValAsp: 2.465 ± 1.131
3.697ValGlu: 3.697 ± 1.94
1.848ValPhe: 1.848 ± 1.273
1.232ValGly: 1.232 ± 0.881
2.465ValHis: 2.465 ± 0.688
5.545ValIle: 5.545 ± 1.715
5.545ValLys: 5.545 ± 0.883
4.929ValLeu: 4.929 ± 2.307
0.616ValMet: 0.616 ± 0.641
3.697ValAsn: 3.697 ± 1.855
4.313ValPro: 4.313 ± 1.565
3.697ValGln: 3.697 ± 1.138
4.313ValArg: 4.313 ± 1.827
6.161ValSer: 6.161 ± 1.724
3.697ValThr: 3.697 ± 2.369
3.081ValVal: 3.081 ± 1.478
1.232ValTrp: 1.232 ± 1.01
4.313ValTyr: 4.313 ± 0.878
0.0ValXaa: 0.0 ± 0.0
Trp
2.465TrpAla: 2.465 ± 1.08
0.0TrpCys: 0.0 ± 0.0
0.616TrpAsp: 0.616 ± 0.561
0.616TrpGlu: 0.616 ± 0.547
0.0TrpPhe: 0.0 ± 0.0
0.616TrpGly: 0.616 ± 0.517
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.616TrpLeu: 0.616 ± 0.641
0.616TrpMet: 0.616 ± 0.649
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.616TrpGln: 0.616 ± 0.517
0.0TrpArg: 0.0 ± 0.0
1.848TrpSer: 1.848 ± 1.004
2.465TrpThr: 2.465 ± 1.77
1.232TrpVal: 1.232 ± 0.669
0.0TrpTrp: 0.0 ± 0.0
0.616TrpTyr: 0.616 ± 0.517
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.081TyrAla: 3.081 ± 0.966
0.0TyrCys: 0.0 ± 0.0
0.616TyrAsp: 0.616 ± 0.641
1.848TyrGlu: 1.848 ± 0.938
3.081TyrPhe: 3.081 ± 1.869
1.848TyrGly: 1.848 ± 0.712
0.0TyrHis: 0.0 ± 0.0
1.848TyrIle: 1.848 ± 0.75
2.465TyrLys: 2.465 ± 1.688
5.545TyrLeu: 5.545 ± 1.692
1.848TyrMet: 1.848 ± 0.895
1.848TyrAsn: 1.848 ± 0.722
1.848TyrPro: 1.848 ± 1.143
1.232TyrGln: 1.232 ± 0.77
3.697TyrArg: 3.697 ± 2.353
1.848TyrSer: 1.848 ± 0.824
4.313TyrThr: 4.313 ± 1.738
2.465TyrVal: 2.465 ± 1.331
0.0TyrTrp: 0.0 ± 0.0
0.616TyrTyr: 0.616 ± 0.547
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1624 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski