Amino acid dipepetide frequency for Cotton leaf curl Allahabad virus [India:Karnal:OY81B:2005]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.029AlaAla: 4.029 ± 1.959
1.612AlaCys: 1.612 ± 0.717
0.806AlaAsp: 0.806 ± 0.769
0.806AlaGlu: 0.806 ± 0.602
0.0AlaPhe: 0.0 ± 0.0
2.417AlaGly: 2.417 ± 1.078
1.612AlaHis: 1.612 ± 1.291
0.806AlaIle: 0.806 ± 0.602
4.029AlaLys: 4.029 ± 1.728
5.641AlaLeu: 5.641 ± 1.834
0.0AlaMet: 0.0 ± 0.0
0.806AlaAsn: 0.806 ± 0.602
3.223AlaPro: 3.223 ± 1.239
4.029AlaGln: 4.029 ± 2.308
2.417AlaArg: 2.417 ± 1.404
4.835AlaSer: 4.835 ± 2.267
3.223AlaThr: 3.223 ± 2.088
0.806AlaVal: 0.806 ± 0.909
1.612AlaTrp: 1.612 ± 0.717
2.417AlaTyr: 2.417 ± 0.904
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.612CysCys: 1.612 ± 1.639
0.0CysAsp: 0.0 ± 0.0
1.612CysGlu: 1.612 ± 0.717
0.806CysPhe: 0.806 ± 0.909
1.612CysGly: 1.612 ± 0.913
0.806CysHis: 0.806 ± 0.888
3.223CysIle: 3.223 ± 1.085
0.806CysLys: 0.806 ± 0.769
0.806CysLeu: 0.806 ± 0.819
0.806CysMet: 0.806 ± 0.819
1.612CysAsn: 1.612 ± 0.913
3.223CysPro: 3.223 ± 1.958
0.806CysGln: 0.806 ± 0.602
1.612CysArg: 1.612 ± 1.151
2.417CysSer: 2.417 ± 1.834
0.806CysThr: 0.806 ± 0.769
0.806CysVal: 0.806 ± 0.769
0.0CysTrp: 0.0 ± 0.0
0.806CysTyr: 0.806 ± 0.82
0.0CysXaa: 0.0 ± 0.0
Asp
2.417AspAla: 2.417 ± 1.807
0.0AspCys: 0.0 ± 0.0
1.612AspAsp: 1.612 ± 0.913
2.417AspGlu: 2.417 ± 0.882
1.612AspPhe: 1.612 ± 0.717
2.417AspGly: 2.417 ± 1.807
0.806AspHis: 0.806 ± 0.602
4.835AspIle: 4.835 ± 1.604
2.417AspLys: 2.417 ± 1.078
3.223AspLeu: 3.223 ± 1.737
0.0AspMet: 0.0 ± 0.0
1.612AspAsn: 1.612 ± 1.043
2.417AspPro: 2.417 ± 1.095
2.417AspGln: 2.417 ± 1.137
2.417AspArg: 2.417 ± 1.359
2.417AspSer: 2.417 ± 0.804
0.0AspThr: 0.0 ± 0.0
6.446AspVal: 6.446 ± 1.818
1.612AspTrp: 1.612 ± 0.913
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.223GluAla: 3.223 ± 0.922
0.806GluCys: 0.806 ± 0.888
2.417GluAsp: 2.417 ± 0.986
4.029GluGlu: 4.029 ± 2.443
2.417GluPhe: 2.417 ± 1.367
1.612GluGly: 1.612 ± 0.717
1.612GluHis: 1.612 ± 1.291
0.0GluIle: 0.0 ± 0.0
2.417GluLys: 2.417 ± 1.253
4.835GluLeu: 4.835 ± 2.653
0.0GluMet: 0.0 ± 0.0
3.223GluAsn: 3.223 ± 2.088
1.612GluPro: 1.612 ± 0.717
2.417GluGln: 2.417 ± 0.904
0.806GluArg: 0.806 ± 0.909
3.223GluSer: 3.223 ± 0.922
3.223GluThr: 3.223 ± 1.764
2.417GluVal: 2.417 ± 1.281
1.612GluTrp: 1.612 ± 0.913
0.806GluTyr: 0.806 ± 0.82
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.612PheCys: 1.612 ± 0.717
2.417PheAsp: 2.417 ± 1.078
0.806PheGlu: 0.806 ± 0.769
0.806PhePhe: 0.806 ± 0.769
1.612PheGly: 1.612 ± 1.537
1.612PheHis: 1.612 ± 1.205
2.417PheIle: 2.417 ± 1.115
3.223PheLys: 3.223 ± 1.815
8.864PheLeu: 8.864 ± 2.616
0.806PheMet: 0.806 ± 0.602
3.223PheAsn: 3.223 ± 1.981
0.806PhePro: 0.806 ± 0.602
3.223PheGln: 3.223 ± 1.762
1.612PheArg: 1.612 ± 0.908
4.835PheSer: 4.835 ± 2.48
3.223PheThr: 3.223 ± 2.032
2.417PheVal: 2.417 ± 0.904
0.0PheTrp: 0.0 ± 0.0
1.612PheTyr: 1.612 ± 0.985
0.0PheXaa: 0.0 ± 0.0
Gly
1.612GlyAla: 1.612 ± 1.205
4.029GlyCys: 4.029 ± 0.966
1.612GlyAsp: 1.612 ± 1.205
1.612GlyGlu: 1.612 ± 1.043
1.612GlyPhe: 1.612 ± 1.109
3.223GlyGly: 3.223 ± 1.053
1.612GlyHis: 1.612 ± 0.913
1.612GlyIle: 1.612 ± 0.913
5.641GlyLys: 5.641 ± 2.728
4.029GlyLeu: 4.029 ± 2.025
1.612GlyMet: 1.612 ± 1.639
0.0GlyAsn: 0.0 ± 0.0
4.029GlyPro: 4.029 ± 1.668
0.806GlyGln: 0.806 ± 0.769
2.417GlyArg: 2.417 ± 1.253
5.641GlySer: 5.641 ± 1.674
2.417GlyThr: 2.417 ± 1.083
4.029GlyVal: 4.029 ± 2.422
0.0GlyTrp: 0.0 ± 0.0
0.806GlyTyr: 0.806 ± 0.82
0.0GlyXaa: 0.0 ± 0.0
His
2.417HisAla: 2.417 ± 1.359
1.612HisCys: 1.612 ± 1.109
2.417HisAsp: 2.417 ± 1.312
0.0HisGlu: 0.0 ± 0.0
3.223HisPhe: 3.223 ± 1.926
2.417HisGly: 2.417 ± 1.698
2.417HisHis: 2.417 ± 1.926
3.223HisIle: 3.223 ± 1.474
0.806HisLys: 0.806 ± 0.909
1.612HisLeu: 1.612 ± 0.87
0.806HisMet: 0.806 ± 0.82
3.223HisAsn: 3.223 ± 1.827
2.417HisPro: 2.417 ± 1.244
2.417HisGln: 2.417 ± 1.155
4.835HisArg: 4.835 ± 2.87
2.417HisSer: 2.417 ± 1.362
2.417HisThr: 2.417 ± 1.359
1.612HisVal: 1.612 ± 1.234
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
2.417IleCys: 2.417 ± 1.823
0.806IleAsp: 0.806 ± 0.602
0.806IleGlu: 0.806 ± 0.602
1.612IlePhe: 1.612 ± 1.205
1.612IleGly: 1.612 ± 1.537
2.417IleHis: 2.417 ± 1.095
1.612IleIle: 1.612 ± 1.234
6.446IleLys: 6.446 ± 2.7
5.641IleLeu: 5.641 ± 3.189
0.806IleMet: 0.806 ± 0.82
3.223IleAsn: 3.223 ± 1.325
3.223IlePro: 3.223 ± 1.444
4.029IleGln: 4.029 ± 1.762
4.029IleArg: 4.029 ± 1.608
5.641IleSer: 5.641 ± 1.746
2.417IleThr: 2.417 ± 2.006
4.029IleVal: 4.029 ± 1.327
2.417IleTrp: 2.417 ± 1.799
0.806IleTyr: 0.806 ± 0.769
0.0IleXaa: 0.0 ± 0.0
Lys
3.223LysAla: 3.223 ± 1.815
1.612LysCys: 1.612 ± 0.908
0.806LysAsp: 0.806 ± 0.602
4.029LysGlu: 4.029 ± 1.464
2.417LysPhe: 2.417 ± 0.804
3.223LysGly: 3.223 ± 1.377
4.029LysHis: 4.029 ± 2.414
3.223LysIle: 3.223 ± 1.661
0.806LysLys: 0.806 ± 0.769
3.223LysLeu: 3.223 ± 1.84
0.0LysMet: 0.0 ± 0.0
4.029LysAsn: 4.029 ± 1.729
4.029LysPro: 4.029 ± 2.025
1.612LysGln: 1.612 ± 0.985
3.223LysArg: 3.223 ± 2.164
5.641LysSer: 5.641 ± 1.449
4.835LysThr: 4.835 ± 1.117
5.641LysVal: 5.641 ± 1.88
0.806LysTrp: 0.806 ± 0.769
4.029LysTyr: 4.029 ± 1.023
0.0LysXaa: 0.0 ± 0.0
Leu
0.806LeuAla: 0.806 ± 0.602
2.417LeuCys: 2.417 ± 1.367
5.641LeuAsp: 5.641 ± 2.74
3.223LeuGlu: 3.223 ± 1.094
3.223LeuPhe: 3.223 ± 1.308
6.446LeuGly: 6.446 ± 1.665
1.612LeuHis: 1.612 ± 0.908
4.029LeuIle: 4.029 ± 2.185
5.641LeuLys: 5.641 ± 1.745
4.029LeuLeu: 4.029 ± 2.472
0.806LeuMet: 0.806 ± 0.769
5.641LeuAsn: 5.641 ± 2.177
1.612LeuPro: 1.612 ± 1.026
2.417LeuGln: 2.417 ± 1.095
6.446LeuArg: 6.446 ± 2.371
5.641LeuSer: 5.641 ± 0.831
7.252LeuThr: 7.252 ± 1.765
8.058LeuVal: 8.058 ± 4.275
0.806LeuTrp: 0.806 ± 0.819
4.835LeuTyr: 4.835 ± 2.508
0.0LeuXaa: 0.0 ± 0.0
Met
0.806MetAla: 0.806 ± 0.769
0.806MetCys: 0.806 ± 0.769
1.612MetAsp: 1.612 ± 1.817
2.417MetGlu: 2.417 ± 1.672
1.612MetPhe: 1.612 ± 1.537
2.417MetGly: 2.417 ± 1.183
2.417MetHis: 2.417 ± 1.323
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.612MetLeu: 1.612 ± 0.717
0.806MetMet: 0.806 ± 0.708
0.806MetAsn: 0.806 ± 0.769
2.417MetPro: 2.417 ± 1.183
0.806MetGln: 0.806 ± 0.888
0.806MetArg: 0.806 ± 0.909
2.417MetSer: 2.417 ± 1.786
0.0MetThr: 0.0 ± 0.0
1.612MetVal: 1.612 ± 1.151
1.612MetTrp: 1.612 ± 0.979
2.417MetTyr: 2.417 ± 1.591
0.0MetXaa: 0.0 ± 0.0
Asn
4.029AsnAla: 4.029 ± 2.154
0.806AsnCys: 0.806 ± 0.888
1.612AsnAsp: 1.612 ± 1.205
3.223AsnGlu: 3.223 ± 1.802
1.612AsnPhe: 1.612 ± 0.717
2.417AsnGly: 2.417 ± 1.887
2.417AsnHis: 2.417 ± 1.654
3.223AsnIle: 3.223 ± 1.022
1.612AsnLys: 1.612 ± 1.638
7.252AsnLeu: 7.252 ± 4.192
3.223AsnMet: 3.223 ± 2.196
4.029AsnAsn: 4.029 ± 1.045
6.446AsnPro: 6.446 ± 1.372
3.223AsnGln: 3.223 ± 0.912
3.223AsnArg: 3.223 ± 1.72
4.835AsnSer: 4.835 ± 2.091
0.806AsnThr: 0.806 ± 0.602
3.223AsnVal: 3.223 ± 1.414
0.0AsnTrp: 0.0 ± 0.0
3.223AsnTyr: 3.223 ± 0.939
0.0AsnXaa: 0.0 ± 0.0
Pro
3.223ProAla: 3.223 ± 1.085
1.612ProCys: 1.612 ± 0.985
2.417ProAsp: 2.417 ± 1.207
0.806ProGlu: 0.806 ± 0.602
2.417ProPhe: 2.417 ± 1.115
0.806ProGly: 0.806 ± 0.602
3.223ProHis: 3.223 ± 1.926
4.029ProIle: 4.029 ± 1.632
3.223ProLys: 3.223 ± 1.886
4.029ProLeu: 4.029 ± 1.406
3.223ProMet: 3.223 ± 1.168
5.641ProAsn: 5.641 ± 1.787
3.223ProPro: 3.223 ± 1.763
3.223ProGln: 3.223 ± 1.618
6.446ProArg: 6.446 ± 1.301
6.446ProSer: 6.446 ± 3.179
4.029ProThr: 4.029 ± 1.685
3.223ProVal: 3.223 ± 0.922
0.0ProTrp: 0.0 ± 0.0
1.612ProTyr: 1.612 ± 0.87
0.0ProXaa: 0.0 ± 0.0
Gln
3.223GlnAla: 3.223 ± 1.362
0.0GlnCys: 0.0 ± 0.0
1.612GlnAsp: 1.612 ± 0.717
4.029GlnGlu: 4.029 ± 1.639
3.223GlnPhe: 3.223 ± 1.731
2.417GlnGly: 2.417 ± 1.367
2.417GlnHis: 2.417 ± 1.926
1.612GlnIle: 1.612 ± 0.913
0.806GlnLys: 0.806 ± 0.82
2.417GlnLeu: 2.417 ± 1.164
0.806GlnMet: 0.806 ± 0.602
2.417GlnAsn: 2.417 ± 1.834
4.029GlnPro: 4.029 ± 2.532
2.417GlnGln: 2.417 ± 1.654
3.223GlnArg: 3.223 ± 1.747
4.835GlnSer: 4.835 ± 1.416
2.417GlnThr: 2.417 ± 1.137
4.029GlnVal: 4.029 ± 0.965
0.0GlnTrp: 0.0 ± 0.0
1.612GlnTyr: 1.612 ± 1.043
0.0GlnXaa: 0.0 ± 0.0
Arg
2.417ArgAla: 2.417 ± 1.654
0.806ArgCys: 0.806 ± 0.888
4.029ArgAsp: 4.029 ± 1.41
2.417ArgGlu: 2.417 ± 0.986
1.612ArgPhe: 1.612 ± 0.985
3.223ArgGly: 3.223 ± 1.379
2.417ArgHis: 2.417 ± 0.904
8.058ArgIle: 8.058 ± 3.698
3.223ArgLys: 3.223 ± 1.366
2.417ArgLeu: 2.417 ± 1.245
2.417ArgMet: 2.417 ± 1.601
2.417ArgAsn: 2.417 ± 1.404
5.641ArgPro: 5.641 ± 1.232
2.417ArgGln: 2.417 ± 1.164
4.029ArgArg: 4.029 ± 2.321
4.835ArgSer: 4.835 ± 0.955
2.417ArgThr: 2.417 ± 1.115
5.641ArgVal: 5.641 ± 1.16
0.0ArgTrp: 0.0 ± 0.0
3.223ArgTyr: 3.223 ± 1.569
0.0ArgXaa: 0.0 ± 0.0
Ser
4.029SerAla: 4.029 ± 1.472
2.417SerCys: 2.417 ± 1.155
4.029SerAsp: 4.029 ± 0.965
3.223SerGlu: 3.223 ± 1.094
4.835SerPhe: 4.835 ± 2.394
3.223SerGly: 3.223 ± 1.377
2.417SerHis: 2.417 ± 1.519
4.029SerIle: 4.029 ± 1.729
8.864SerLys: 8.864 ± 3.771
5.641SerLeu: 5.641 ± 2.577
2.417SerMet: 2.417 ± 3.228
7.252SerAsn: 7.252 ± 1.599
5.641SerPro: 5.641 ± 2.295
0.806SerGln: 0.806 ± 0.888
5.641SerArg: 5.641 ± 1.142
9.67SerSer: 9.67 ± 3.494
7.252SerThr: 7.252 ± 0.568
2.417SerVal: 2.417 ± 2.306
0.806SerTrp: 0.806 ± 0.602
1.612SerTyr: 1.612 ± 0.913
0.0SerXaa: 0.0 ± 0.0
Thr
2.417ThrAla: 2.417 ± 0.804
0.0ThrCys: 0.0 ± 0.0
1.612ThrAsp: 1.612 ± 1.537
1.612ThrGlu: 1.612 ± 1.104
2.417ThrPhe: 2.417 ± 1.244
5.641ThrGly: 5.641 ± 2.069
4.029ThrHis: 4.029 ± 1.396
0.806ThrIle: 0.806 ± 0.602
3.223ThrLys: 3.223 ± 1.33
3.223ThrLeu: 3.223 ± 1.346
4.029ThrMet: 4.029 ± 1.983
4.835ThrAsn: 4.835 ± 1.756
4.029ThrPro: 4.029 ± 1.708
5.641ThrGln: 5.641 ± 1.842
1.612ThrArg: 1.612 ± 1.013
5.641ThrSer: 5.641 ± 2.179
3.223ThrThr: 3.223 ± 2.054
1.612ThrVal: 1.612 ± 1.537
0.806ThrTrp: 0.806 ± 0.82
0.806ThrTyr: 0.806 ± 0.602
0.0ThrXaa: 0.0 ± 0.0
Val
1.612ValAla: 1.612 ± 0.953
0.0ValCys: 0.0 ± 0.0
4.029ValAsp: 4.029 ± 1.7
2.417ValGlu: 2.417 ± 1.838
5.641ValPhe: 5.641 ± 1.51
0.806ValGly: 0.806 ± 0.769
1.612ValHis: 1.612 ± 1.043
7.252ValIle: 7.252 ± 2.582
5.641ValLys: 5.641 ± 2.295
6.446ValLeu: 6.446 ± 2.657
1.612ValMet: 1.612 ± 1.537
3.223ValAsn: 3.223 ± 1.774
4.835ValPro: 4.835 ± 1.038
3.223ValGln: 3.223 ± 0.922
4.835ValArg: 4.835 ± 2.943
0.806ValSer: 0.806 ± 0.602
4.835ValThr: 4.835 ± 2.735
3.223ValVal: 3.223 ± 1.33
0.0ValTrp: 0.0 ± 0.0
3.223ValTyr: 3.223 ± 2.088
0.0ValXaa: 0.0 ± 0.0
Trp
2.417TrpAla: 2.417 ± 1.807
0.0TrpCys: 0.0 ± 0.0
0.806TrpAsp: 0.806 ± 0.82
0.806TrpGlu: 0.806 ± 0.909
0.806TrpPhe: 0.806 ± 0.819
0.0TrpGly: 0.0 ± 0.0
0.806TrpHis: 0.806 ± 0.769
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.806TrpGln: 0.806 ± 0.602
1.612TrpArg: 1.612 ± 1.026
1.612TrpSer: 1.612 ± 1.082
1.612TrpThr: 1.612 ± 1.043
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.806TrpTyr: 0.806 ± 0.602
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.223TyrAla: 3.223 ± 1.362
0.0TyrCys: 0.0 ± 0.0
0.806TyrAsp: 0.806 ± 0.769
2.417TyrGlu: 2.417 ± 1.641
3.223TyrPhe: 3.223 ± 0.912
0.806TyrGly: 0.806 ± 0.602
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
1.612TyrLys: 1.612 ± 0.979
5.641TyrLeu: 5.641 ± 2.28
2.417TyrMet: 2.417 ± 0.728
3.223TyrAsn: 3.223 ± 1.559
0.0TyrPro: 0.0 ± 0.0
0.806TyrGln: 0.806 ± 0.769
2.417TyrArg: 2.417 ± 1.565
3.223TyrSer: 3.223 ± 1.414
0.806TyrThr: 0.806 ± 0.769
4.029TyrVal: 4.029 ± 1.262
0.0TyrTrp: 0.0 ± 0.0
0.806TyrTyr: 0.806 ± 0.888
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1242 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski