Amino acid dipepetide frequency for Tomato leaf curl Rajasthan virus - [India:Rajasthan:2005]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.613AlaAla: 5.613 ± 1.88
0.802AlaCys: 0.802 ± 0.662
2.406AlaAsp: 2.406 ± 1.087
1.604AlaGlu: 1.604 ± 1.026
0.802AlaPhe: 0.802 ± 0.846
0.802AlaGly: 0.802 ± 0.662
2.406AlaHis: 2.406 ± 1.056
1.604AlaIle: 1.604 ± 1.176
5.613AlaLys: 5.613 ± 1.483
5.613AlaLeu: 5.613 ± 1.51
0.0AlaMet: 0.0 ± 0.0
1.604AlaAsn: 1.604 ± 0.67
3.208AlaPro: 3.208 ± 1.595
1.604AlaGln: 1.604 ± 0.957
4.812AlaArg: 4.812 ± 1.545
5.613AlaSer: 5.613 ± 1.888
3.208AlaThr: 3.208 ± 2.103
3.208AlaVal: 3.208 ± 1.073
1.604AlaTrp: 1.604 ± 0.67
2.406AlaTyr: 2.406 ± 1.188
0.0AlaXaa: 0.0 ± 0.0
Cys
0.802CysAla: 0.802 ± 0.588
1.604CysCys: 1.604 ± 1.788
0.0CysAsp: 0.0 ± 0.0
1.604CysGlu: 1.604 ± 1.063
2.406CysPhe: 2.406 ± 1.71
2.406CysGly: 2.406 ± 1.507
0.802CysHis: 0.802 ± 0.75
0.802CysIle: 0.802 ± 0.662
1.604CysLys: 1.604 ± 0.67
0.802CysLeu: 0.802 ± 0.588
0.802CysMet: 0.802 ± 0.894
0.802CysAsn: 0.802 ± 0.588
2.406CysPro: 2.406 ± 2.681
0.0CysGln: 0.0 ± 0.0
0.802CysArg: 0.802 ± 0.588
1.604CysSer: 1.604 ± 0.864
0.802CysThr: 0.802 ± 0.662
0.802CysVal: 0.802 ± 0.662
0.802CysTrp: 0.802 ± 0.846
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.208AspAla: 3.208 ± 1.672
0.0AspCys: 0.0 ± 0.0
1.604AspAsp: 1.604 ± 1.176
1.604AspGlu: 1.604 ± 0.67
1.604AspPhe: 1.604 ± 0.908
3.208AspGly: 3.208 ± 1.785
0.802AspHis: 0.802 ± 0.75
4.812AspIle: 4.812 ± 2.709
1.604AspLys: 1.604 ± 0.67
5.613AspLeu: 5.613 ± 1.961
0.0AspMet: 0.0 ± 0.0
2.406AspAsn: 2.406 ± 1.087
1.604AspPro: 1.604 ± 0.957
0.0AspGln: 0.0 ± 0.0
4.812AspArg: 4.812 ± 1.785
5.613AspSer: 5.613 ± 1.983
2.406AspThr: 2.406 ± 1.82
8.019AspVal: 8.019 ± 1.647
2.406AspTrp: 2.406 ± 1.273
2.406AspTyr: 2.406 ± 1.588
0.0AspXaa: 0.0 ± 0.0
Glu
3.208GluAla: 3.208 ± 1.178
0.0GluCys: 0.0 ± 0.0
0.802GluAsp: 0.802 ± 0.588
5.613GluGlu: 5.613 ± 4.115
1.604GluPhe: 1.604 ± 0.957
3.208GluGly: 3.208 ± 1.178
1.604GluHis: 1.604 ± 0.847
0.802GluIle: 0.802 ± 0.902
1.604GluLys: 1.604 ± 0.957
3.208GluLeu: 3.208 ± 1.72
0.0GluMet: 0.0 ± 0.0
3.208GluAsn: 3.208 ± 1.813
2.406GluPro: 2.406 ± 0.956
2.406GluGln: 2.406 ± 1.849
0.802GluArg: 0.802 ± 0.902
3.208GluSer: 3.208 ± 1.762
4.01GluThr: 4.01 ± 1.312
2.406GluVal: 2.406 ± 0.784
1.604GluTrp: 1.604 ± 0.864
1.604GluTyr: 1.604 ± 0.946
0.0GluXaa: 0.0 ± 0.0
Phe
0.802PheAla: 0.802 ± 0.846
1.604PheCys: 1.604 ± 0.973
3.208PheAsp: 3.208 ± 1.341
3.208PheGlu: 3.208 ± 0.91
1.604PhePhe: 1.604 ± 0.67
1.604PheGly: 1.604 ± 0.973
2.406PheHis: 2.406 ± 1.313
1.604PheIle: 1.604 ± 1.176
3.208PheLys: 3.208 ± 1.892
4.812PheLeu: 4.812 ± 2.614
2.406PheMet: 2.406 ± 1.169
1.604PheAsn: 1.604 ± 1.024
2.406PhePro: 2.406 ± 1.403
3.208PheGln: 3.208 ± 1.41
2.406PheArg: 2.406 ± 1.245
1.604PheSer: 1.604 ± 0.864
0.802PheThr: 0.802 ± 0.75
0.802PheVal: 0.802 ± 0.662
0.0PheTrp: 0.0 ± 0.0
0.802PheTyr: 0.802 ± 0.662
0.0PheXaa: 0.0 ± 0.0
Gly
1.604GlyAla: 1.604 ± 1.176
1.604GlyCys: 1.604 ± 0.908
2.406GlyAsp: 2.406 ± 1.188
2.406GlyGlu: 2.406 ± 1.267
1.604GlyPhe: 1.604 ± 1.026
3.208GlyGly: 3.208 ± 0.987
0.802GlyHis: 0.802 ± 0.588
2.406GlyIle: 2.406 ± 1.752
5.613GlyLys: 5.613 ± 2.471
6.415GlyLeu: 6.415 ± 2.776
0.0GlyMet: 0.0 ± 0.0
3.208GlyAsn: 3.208 ± 1.183
3.208GlyPro: 3.208 ± 0.987
3.208GlyGln: 3.208 ± 0.991
2.406GlyArg: 2.406 ± 1.188
2.406GlySer: 2.406 ± 1.0
2.406GlyThr: 2.406 ± 1.099
3.208GlyVal: 3.208 ± 1.905
0.0GlyTrp: 0.0 ± 0.0
0.802GlyTyr: 0.802 ± 0.894
0.0GlyXaa: 0.0 ± 0.0
His
2.406HisAla: 2.406 ± 1.529
1.604HisCys: 1.604 ± 1.026
4.812HisAsp: 4.812 ± 2.131
1.604HisGlu: 1.604 ± 0.957
2.406HisPhe: 2.406 ± 1.313
2.406HisGly: 2.406 ± 1.215
0.802HisHis: 0.802 ± 0.75
5.613HisIle: 5.613 ± 2.704
0.802HisLys: 0.802 ± 0.902
2.406HisLeu: 2.406 ± 1.588
0.0HisMet: 0.0 ± 0.0
4.01HisAsn: 4.01 ± 1.234
3.208HisPro: 3.208 ± 2.4
0.802HisGln: 0.802 ± 0.894
3.208HisArg: 3.208 ± 1.609
3.208HisSer: 3.208 ± 1.552
1.604HisThr: 1.604 ± 1.324
1.604HisVal: 1.604 ± 0.946
0.0HisTrp: 0.0 ± 0.0
0.802HisTyr: 0.802 ± 0.588
0.0HisXaa: 0.0 ± 0.0
Ile
0.802IleAla: 0.802 ± 0.588
1.604IleCys: 1.604 ± 0.847
3.208IleAsp: 3.208 ± 1.77
1.604IleGlu: 1.604 ± 0.847
2.406IlePhe: 2.406 ± 1.292
2.406IleGly: 2.406 ± 1.813
1.604IleHis: 1.604 ± 0.847
2.406IleIle: 2.406 ± 1.47
4.812IleLys: 4.812 ± 1.953
2.406IleLeu: 2.406 ± 1.71
0.0IleMet: 0.0 ± 0.0
3.208IleAsn: 3.208 ± 1.238
1.604IlePro: 1.604 ± 0.864
3.208IleGln: 3.208 ± 1.892
7.217IleArg: 7.217 ± 2.607
8.019IleSer: 8.019 ± 2.447
4.812IleThr: 4.812 ± 2.039
2.406IleVal: 2.406 ± 0.784
2.406IleTrp: 2.406 ± 1.087
3.208IleTyr: 3.208 ± 1.957
0.0IleXaa: 0.0 ± 0.0
Lys
4.812LysAla: 4.812 ± 1.293
0.802LysCys: 0.802 ± 0.902
2.406LysAsp: 2.406 ± 1.764
4.01LysGlu: 4.01 ± 2.152
1.604LysPhe: 1.604 ± 0.67
1.604LysGly: 1.604 ± 0.865
4.01LysHis: 4.01 ± 1.65
5.613LysIle: 5.613 ± 1.812
5.613LysLys: 5.613 ± 2.356
0.802LysLeu: 0.802 ± 0.902
0.802LysMet: 0.802 ± 0.824
7.217LysAsn: 7.217 ± 2.12
2.406LysPro: 2.406 ± 1.196
0.802LysGln: 0.802 ± 0.588
2.406LysArg: 2.406 ± 1.433
4.812LysSer: 4.812 ± 1.82
4.01LysThr: 4.01 ± 0.879
4.812LysVal: 4.812 ± 1.651
0.802LysTrp: 0.802 ± 0.662
4.01LysTyr: 4.01 ± 1.155
0.0LysXaa: 0.0 ± 0.0
Leu
2.406LeuAla: 2.406 ± 1.756
0.802LeuCys: 0.802 ± 0.588
7.217LeuAsp: 7.217 ± 2.586
4.01LeuGlu: 4.01 ± 1.814
0.0LeuPhe: 0.0 ± 0.0
5.613LeuGly: 5.613 ± 1.469
2.406LeuHis: 2.406 ± 0.944
5.613LeuIle: 5.613 ± 2.646
6.415LeuLys: 6.415 ± 1.938
0.802LeuLeu: 0.802 ± 0.894
1.604LeuMet: 1.604 ± 0.823
3.208LeuAsn: 3.208 ± 1.531
4.01LeuPro: 4.01 ± 2.686
2.406LeuGln: 2.406 ± 1.313
6.415LeuArg: 6.415 ± 2.861
3.208LeuSer: 3.208 ± 1.695
7.217LeuThr: 7.217 ± 2.078
4.01LeuVal: 4.01 ± 1.188
0.802LeuTrp: 0.802 ± 0.846
2.406LeuTyr: 2.406 ± 0.893
0.0LeuXaa: 0.0 ± 0.0
Met
1.604MetAla: 1.604 ± 0.67
1.604MetCys: 1.604 ± 0.67
1.604MetAsp: 1.604 ± 0.973
0.802MetGlu: 0.802 ± 0.824
1.604MetPhe: 1.604 ± 1.324
4.01MetGly: 4.01 ± 1.389
0.802MetHis: 0.802 ± 0.662
1.604MetIle: 1.604 ± 0.946
0.0MetLys: 0.0 ± 0.0
1.604MetLeu: 1.604 ± 1.063
0.0MetMet: 0.0 ± 0.0
0.802MetAsn: 0.802 ± 0.662
0.802MetPro: 0.802 ± 0.846
0.802MetGln: 0.802 ± 0.75
0.0MetArg: 0.0 ± 0.0
0.802MetSer: 0.802 ± 0.662
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.604MetTrp: 1.604 ± 0.957
1.604MetTyr: 1.604 ± 1.324
0.0MetXaa: 0.0 ± 0.0
Asn
3.208AsnAla: 3.208 ± 1.595
0.802AsnCys: 0.802 ± 0.75
2.406AsnAsp: 2.406 ± 1.188
1.604AsnGlu: 1.604 ± 1.063
3.208AsnPhe: 3.208 ± 1.615
1.604AsnGly: 1.604 ± 0.847
4.01AsnHis: 4.01 ± 1.61
2.406AsnIle: 2.406 ± 1.073
1.604AsnLys: 1.604 ± 0.847
3.208AsnLeu: 3.208 ± 1.41
3.208AsnMet: 3.208 ± 1.767
3.208AsnAsn: 3.208 ± 1.595
1.604AsnPro: 1.604 ± 1.024
1.604AsnGln: 1.604 ± 0.908
5.613AsnArg: 5.613 ± 1.586
4.01AsnSer: 4.01 ± 1.045
4.01AsnThr: 4.01 ± 1.676
4.01AsnVal: 4.01 ± 1.691
0.0AsnTrp: 0.0 ± 0.0
4.01AsnTyr: 4.01 ± 1.794
0.0AsnXaa: 0.0 ± 0.0
Pro
2.406ProAla: 2.406 ± 1.401
3.208ProCys: 3.208 ± 1.194
1.604ProAsp: 1.604 ± 1.063
2.406ProGlu: 2.406 ± 1.313
2.406ProPhe: 2.406 ± 1.292
1.604ProGly: 1.604 ± 1.077
3.208ProHis: 3.208 ± 1.795
1.604ProIle: 1.604 ± 0.67
1.604ProLys: 1.604 ± 1.176
3.208ProLeu: 3.208 ± 1.073
1.604ProMet: 1.604 ± 0.967
4.01ProAsn: 4.01 ± 2.095
3.208ProPro: 3.208 ± 1.795
5.613ProGln: 5.613 ± 2.234
3.208ProArg: 3.208 ± 1.337
4.812ProSer: 4.812 ± 0.819
1.604ProThr: 1.604 ± 1.176
6.415ProVal: 6.415 ± 1.708
0.0ProTrp: 0.0 ± 0.0
1.604ProTyr: 1.604 ± 0.973
0.0ProXaa: 0.0 ± 0.0
Gln
2.406GlnAla: 2.406 ± 1.099
0.802GlnCys: 0.802 ± 0.588
3.208GlnAsp: 3.208 ± 1.786
1.604GlnGlu: 1.604 ± 0.67
2.406GlnPhe: 2.406 ± 1.292
0.802GlnGly: 0.802 ± 0.588
2.406GlnHis: 2.406 ± 1.356
3.208GlnIle: 3.208 ± 2.621
2.406GlnLys: 2.406 ± 1.772
2.406GlnLeu: 2.406 ± 1.756
0.0GlnMet: 0.0 ± 0.0
1.604GlnAsn: 1.604 ± 1.077
4.01GlnPro: 4.01 ± 1.479
2.406GlnGln: 2.406 ± 0.821
0.802GlnArg: 0.802 ± 0.588
3.208GlnSer: 3.208 ± 0.91
1.604GlnThr: 1.604 ± 1.176
3.208GlnVal: 3.208 ± 1.478
0.802GlnTrp: 0.802 ± 0.588
0.802GlnTyr: 0.802 ± 0.662
0.0GlnXaa: 0.0 ± 0.0
Arg
3.208ArgAla: 3.208 ± 1.25
3.208ArgCys: 3.208 ± 1.914
4.01ArgAsp: 4.01 ± 1.253
3.208ArgGlu: 3.208 ± 1.323
3.208ArgPhe: 3.208 ± 1.178
4.01ArgGly: 4.01 ± 1.148
4.01ArgHis: 4.01 ± 1.519
3.208ArgIle: 3.208 ± 1.025
2.406ArgLys: 2.406 ± 1.47
4.01ArgLeu: 4.01 ± 2.396
1.604ArgMet: 1.604 ± 1.324
0.802ArgAsn: 0.802 ± 0.902
6.415ArgPro: 6.415 ± 1.721
1.604ArgGln: 1.604 ± 1.026
4.812ArgArg: 4.812 ± 2.723
3.208ArgSer: 3.208 ± 0.987
5.613ArgThr: 5.613 ± 2.889
5.613ArgVal: 5.613 ± 1.561
0.0ArgTrp: 0.0 ± 0.0
1.604ArgTyr: 1.604 ± 1.063
0.0ArgXaa: 0.0 ± 0.0
Ser
4.01SerAla: 4.01 ± 1.624
0.0SerCys: 0.0 ± 0.0
3.208SerAsp: 3.208 ± 0.849
0.802SerGlu: 0.802 ± 0.588
1.604SerPhe: 1.604 ± 0.67
3.208SerGly: 3.208 ± 1.523
0.802SerHis: 0.802 ± 0.75
4.812SerIle: 4.812 ± 2.367
8.019SerLys: 8.019 ± 2.365
3.208SerLeu: 3.208 ± 1.552
2.406SerMet: 2.406 ± 1.082
5.613SerAsn: 5.613 ± 1.816
7.217SerPro: 7.217 ± 1.978
3.208SerGln: 3.208 ± 1.286
7.217SerArg: 7.217 ± 1.735
9.623SerSer: 9.623 ± 3.788
5.613SerThr: 5.613 ± 3.114
6.415SerVal: 6.415 ± 3.562
0.0SerTrp: 0.0 ± 0.0
3.208SerTyr: 3.208 ± 1.726
0.0SerXaa: 0.0 ± 0.0
Thr
5.613ThrAla: 5.613 ± 2.1
0.0ThrCys: 0.0 ± 0.0
2.406ThrAsp: 2.406 ± 1.356
1.604ThrGlu: 1.604 ± 1.024
2.406ThrPhe: 2.406 ± 1.886
4.01ThrGly: 4.01 ± 1.829
3.208ThrHis: 3.208 ± 1.465
3.208ThrIle: 3.208 ± 1.523
4.812ThrLys: 4.812 ± 1.215
4.01ThrLeu: 4.01 ± 1.017
0.802ThrMet: 0.802 ± 0.588
3.208ThrAsn: 3.208 ± 1.946
3.208ThrPro: 3.208 ± 0.91
1.604ThrGln: 1.604 ± 1.803
1.604ThrArg: 1.604 ± 0.908
8.019ThrSer: 8.019 ± 3.067
0.0ThrThr: 0.0 ± 0.0
3.208ThrVal: 3.208 ± 1.497
2.406ThrTrp: 2.406 ± 1.356
1.604ThrTyr: 1.604 ± 0.847
0.0ThrXaa: 0.0 ± 0.0
Val
0.802ValAla: 0.802 ± 0.75
1.604ValCys: 1.604 ± 0.957
3.208ValAsp: 3.208 ± 1.102
2.406ValGlu: 2.406 ± 1.82
4.812ValPhe: 4.812 ± 1.626
1.604ValGly: 1.604 ± 1.044
5.613ValHis: 5.613 ± 3.266
4.812ValIle: 4.812 ± 1.659
4.812ValLys: 4.812 ± 1.764
8.821ValLeu: 8.821 ± 2.141
2.406ValMet: 2.406 ± 1.196
1.604ValAsn: 1.604 ± 0.973
1.604ValPro: 1.604 ± 0.67
3.208ValGln: 3.208 ± 1.786
4.01ValArg: 4.01 ± 2.626
4.812ValSer: 4.812 ± 1.344
4.01ValThr: 4.01 ± 2.626
3.208ValVal: 3.208 ± 1.178
0.802ValTrp: 0.802 ± 0.902
5.613ValTyr: 5.613 ± 2.075
0.0ValXaa: 0.0 ± 0.0
Trp
3.208TrpAla: 3.208 ± 0.987
0.0TrpCys: 0.0 ± 0.0
1.604TrpAsp: 1.604 ± 1.026
0.802TrpGlu: 0.802 ± 0.902
0.802TrpPhe: 0.802 ± 0.824
0.802TrpGly: 0.802 ± 0.588
0.802TrpHis: 0.802 ± 0.662
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.802TrpLeu: 0.802 ± 0.846
0.802TrpMet: 0.802 ± 0.662
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.802TrpGln: 0.802 ± 0.588
0.802TrpArg: 0.802 ± 0.75
0.802TrpSer: 0.802 ± 0.846
2.406TrpThr: 2.406 ± 1.793
0.802TrpVal: 0.802 ± 0.588
0.0TrpTrp: 0.0 ± 0.0
0.802TrpTyr: 0.802 ± 0.588
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.406TyrAla: 2.406 ± 1.196
0.0TyrCys: 0.0 ± 0.0
2.406TyrAsp: 2.406 ± 1.267
0.802TyrGlu: 0.802 ± 0.662
2.406TyrPhe: 2.406 ± 0.893
0.802TyrGly: 0.802 ± 0.588
0.802TyrHis: 0.802 ± 0.902
3.208TyrIle: 3.208 ± 1.523
0.802TyrLys: 0.802 ± 0.588
6.415TyrLeu: 6.415 ± 1.558
2.406TyrMet: 2.406 ± 1.06
4.01TyrAsn: 4.01 ± 1.155
0.802TyrPro: 0.802 ± 0.588
1.604TyrGln: 1.604 ± 0.67
2.406TyrArg: 2.406 ± 1.401
1.604TyrSer: 1.604 ± 0.957
0.802TyrThr: 0.802 ± 0.902
5.613TyrVal: 5.613 ± 1.656
0.0TyrTrp: 0.0 ± 0.0
0.802TyrTyr: 0.802 ± 0.75
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1248 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski