Amino acid dipepetide frequency for Dianke virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.687AlaAla: 1.687 ± 0.759
1.012AlaCys: 1.012 ± 0.683
2.699AlaAsp: 2.699 ± 0.245
2.868AlaGlu: 2.868 ± 0.777
2.53AlaPhe: 2.53 ± 0.614
1.181AlaGly: 1.181 ± 0.211
1.687AlaHis: 1.687 ± 0.186
5.904AlaIle: 5.904 ± 0.09
3.88AlaLys: 3.88 ± 0.443
5.735AlaLeu: 5.735 ± 1.121
1.012AlaMet: 1.012 ± 0.106
3.036AlaAsn: 3.036 ± 0.341
1.518AlaPro: 1.518 ± 0.238
2.024AlaGln: 2.024 ± 0.212
2.362AlaArg: 2.362 ± 1.005
2.362AlaSer: 2.362 ± 0.395
3.205AlaThr: 3.205 ± 0.589
2.362AlaVal: 2.362 ± 0.348
0.506AlaTrp: 0.506 ± 0.167
3.88AlaTyr: 3.88 ± 1.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.675CysAla: 0.675 ± 0.177
0.169CysCys: 0.169 ± 0.117
1.012CysAsp: 1.012 ± 0.347
1.012CysGlu: 1.012 ± 0.266
0.675CysPhe: 0.675 ± 0.177
1.35CysGly: 1.35 ± 0.324
0.675CysHis: 0.675 ± 0.282
1.518CysIle: 1.518 ± 0.409
2.024CysLys: 2.024 ± 0.532
1.518CysLeu: 1.518 ± 0.238
0.675CysMet: 0.675 ± 0.177
1.518CysAsn: 1.518 ± 0.359
1.181CysPro: 1.181 ± 0.567
0.169CysGln: 0.169 ± 0.117
0.675CysArg: 0.675 ± 0.45
0.675CysSer: 0.675 ± 0.152
1.856CysThr: 1.856 ± 0.595
1.687CysVal: 1.687 ± 0.246
0.169CysTrp: 0.169 ± 0.114
1.35CysTyr: 1.35 ± 0.355
0.0CysXaa: 0.0 ± 0.0
Asp
2.362AspAla: 2.362 ± 0.036
1.181AspCys: 1.181 ± 0.454
3.205AspAsp: 3.205 ± 0.589
2.024AspGlu: 2.024 ± 0.264
3.036AspPhe: 3.036 ± 0.762
1.687AspGly: 1.687 ± 0.438
1.181AspHis: 1.181 ± 0.211
4.217AspIle: 4.217 ± 0.111
2.024AspLys: 2.024 ± 0.847
6.41AspLeu: 6.41 ± 0.461
0.506AspMet: 0.506 ± 0.259
3.88AspAsn: 3.88 ± 1.021
2.699AspPro: 2.699 ± 0.841
1.687AspGln: 1.687 ± 0.499
2.362AspArg: 2.362 ± 0.348
2.699AspSer: 2.699 ± 0.943
4.892AspThr: 4.892 ± 1.009
3.88AspVal: 3.88 ± 0.26
0.337AspTrp: 0.337 ± 0.089
3.88AspTyr: 3.88 ± 0.545
0.0AspXaa: 0.0 ± 0.0
Glu
1.518GluAla: 1.518 ± 0.238
1.181GluCys: 1.181 ± 0.216
2.193GluAsp: 2.193 ± 0.753
1.687GluGlu: 1.687 ± 0.499
4.049GluPhe: 4.049 ± 0.529
0.675GluGly: 0.675 ± 0.347
2.868GluHis: 2.868 ± 0.124
4.049GluIle: 4.049 ± 0.519
2.868GluLys: 2.868 ± 0.457
6.073GluLeu: 6.073 ± 0.793
1.35GluMet: 1.35 ± 0.123
3.205GluAsn: 3.205 ± 0.226
2.868GluPro: 2.868 ± 0.362
2.193GluGln: 2.193 ± 0.357
1.012GluArg: 1.012 ± 0.106
2.868GluSer: 2.868 ± 0.542
2.868GluThr: 2.868 ± 0.124
1.012GluVal: 1.012 ± 0.347
0.0GluTrp: 0.0 ± 0.0
1.687GluTyr: 1.687 ± 0.443
0.0GluXaa: 0.0 ± 0.0
Phe
2.362PheAla: 2.362 ± 0.645
1.012PheCys: 1.012 ± 0.374
3.374PheAsp: 3.374 ± 0.684
2.024PheGlu: 2.024 ± 0.579
0.169PhePhe: 0.169 ± 0.114
2.53PheGly: 2.53 ± 0.372
0.169PheHis: 0.169 ± 0.117
2.53PheIle: 2.53 ± 0.814
2.024PheLys: 2.024 ± 0.802
3.036PheLeu: 3.036 ± 0.717
1.35PheMet: 1.35 ± 0.407
2.868PheAsn: 2.868 ± 0.982
1.012PhePro: 1.012 ± 0.317
1.181PheGln: 1.181 ± 0.289
1.012PheArg: 1.012 ± 0.374
2.699PheSer: 2.699 ± 0.608
2.868PheThr: 2.868 ± 0.361
3.711PheVal: 3.711 ± 1.607
0.169PheTrp: 0.169 ± 0.114
2.024PheTyr: 2.024 ± 0.486
0.0PheXaa: 0.0 ± 0.0
Gly
1.012GlyAla: 1.012 ± 0.266
0.506GlyCys: 0.506 ± 0.167
1.518GlyAsp: 1.518 ± 0.44
2.024GlyGlu: 2.024 ± 0.661
1.687GlyPhe: 1.687 ± 0.265
1.518GlyGly: 1.518 ± 0.44
0.506GlyHis: 0.506 ± 0.167
2.193GlyIle: 2.193 ± 0.357
2.868GlyLys: 2.868 ± 0.362
2.868GlyLeu: 2.868 ± 0.757
0.843GlyMet: 0.843 ± 0.249
1.687GlyAsn: 1.687 ± 0.246
1.181GlyPro: 1.181 ± 0.437
0.843GlyGln: 0.843 ± 0.438
1.181GlyArg: 1.181 ± 0.454
1.687GlySer: 1.687 ± 0.441
2.699GlyThr: 2.699 ± 0.427
1.35GlyVal: 1.35 ± 0.355
0.506GlyTrp: 0.506 ± 0.259
1.687GlyTyr: 1.687 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
2.362HisAla: 2.362 ± 0.211
0.843HisCys: 0.843 ± 0.249
1.687HisAsp: 1.687 ± 0.351
1.518HisGlu: 1.518 ± 0.44
1.35HisPhe: 1.35 ± 0.231
0.843HisGly: 0.843 ± 0.071
1.518HisHis: 1.518 ± 0.439
2.699HisIle: 2.699 ± 0.427
2.699HisLys: 2.699 ± 0.427
4.049HisLeu: 4.049 ± 0.808
1.012HisMet: 1.012 ± 0.347
3.374HisAsn: 3.374 ± 0.302
2.699HisPro: 2.699 ± 0.191
2.024HisGln: 2.024 ± 0.968
0.675HisArg: 0.675 ± 0.282
1.35HisSer: 1.35 ± 0.231
3.205HisThr: 3.205 ± 0.417
1.518HisVal: 1.518 ± 0.106
0.337HisTrp: 0.337 ± 0.234
1.856HisTyr: 1.856 ± 0.39
0.0HisXaa: 0.0 ± 0.0
Ile
4.217IleAla: 4.217 ± 1.272
1.687IleCys: 1.687 ± 0.441
4.386IleAsp: 4.386 ± 1.22
3.88IleGlu: 3.88 ± 0.174
2.53IlePhe: 2.53 ± 0.69
1.856IleGly: 1.856 ± 0.39
2.193IleHis: 2.193 ± 0.597
5.061IleIle: 5.061 ± 0.466
4.217IleLys: 4.217 ± 0.884
7.422IleLeu: 7.422 ± 0.585
2.53IleMet: 2.53 ± 0.474
7.085IleAsn: 7.085 ± 1.679
4.217IlePro: 4.217 ± 0.897
3.205IleGln: 3.205 ± 0.441
3.374IleArg: 3.374 ± 0.666
3.543IleSer: 3.543 ± 0.918
4.386IleThr: 4.386 ± 0.568
4.555IleVal: 4.555 ± 0.561
1.012IleTrp: 1.012 ± 0.106
4.217IleTyr: 4.217 ± 0.816
0.0IleXaa: 0.0 ± 0.0
Lys
2.699LysAla: 2.699 ± 0.443
0.506LysCys: 0.506 ± 0.167
4.217LysAsp: 4.217 ± 0.537
1.687LysGlu: 1.687 ± 0.556
1.687LysPhe: 1.687 ± 0.644
1.181LysGly: 1.181 ± 0.324
2.868LysHis: 2.868 ± 0.618
4.723LysIle: 4.723 ± 1.104
1.181LysLys: 1.181 ± 0.614
6.242LysLeu: 6.242 ± 0.271
0.675LysMet: 0.675 ± 0.212
3.88LysAsn: 3.88 ± 0.536
5.567LysPro: 5.567 ± 0.582
2.868LysGln: 2.868 ± 0.957
2.699LysArg: 2.699 ± 0.495
3.543LysSer: 3.543 ± 0.054
4.892LysThr: 4.892 ± 0.163
3.205LysVal: 3.205 ± 0.352
0.506LysTrp: 0.506 ± 0.16
4.892LysTyr: 4.892 ± 0.463
0.0LysXaa: 0.0 ± 0.0
Leu
5.229LeuAla: 5.229 ± 0.803
1.856LeuCys: 1.856 ± 0.473
5.735LeuAsp: 5.735 ± 1.447
5.061LeuGlu: 5.061 ± 1.121
4.049LeuPhe: 4.049 ± 0.53
2.699LeuGly: 2.699 ± 0.248
3.543LeuHis: 3.543 ± 0.634
7.591LeuIle: 7.591 ± 0.775
9.109LeuLys: 9.109 ± 0.55
9.615LeuLeu: 9.615 ± 0.386
2.868LeuMet: 2.868 ± 0.681
8.266LeuAsn: 8.266 ± 0.529
3.036LeuPro: 3.036 ± 0.546
6.073LeuGln: 6.073 ± 0.884
5.229LeuArg: 5.229 ± 0.09
7.254LeuSer: 7.254 ± 0.424
7.254LeuThr: 7.254 ± 0.778
4.555LeuVal: 4.555 ± 0.576
0.843LeuTrp: 0.843 ± 0.219
6.748LeuTyr: 6.748 ± 1.416
0.0LeuXaa: 0.0 ± 0.0
Met
1.181MetAla: 1.181 ± 0.332
0.675MetCys: 0.675 ± 0.162
1.35MetAsp: 1.35 ± 0.355
1.518MetGlu: 1.518 ± 0.239
1.012MetPhe: 1.012 ± 0.114
0.843MetGly: 0.843 ± 0.249
0.843MetHis: 0.843 ± 0.23
1.012MetIle: 1.012 ± 0.394
1.35MetLys: 1.35 ± 0.325
3.88MetLeu: 3.88 ± 0.743
0.337MetMet: 0.337 ± 0.089
1.518MetAsn: 1.518 ± 0.238
0.506MetPro: 0.506 ± 0.342
1.518MetGln: 1.518 ± 0.513
1.181MetArg: 1.181 ± 0.332
0.675MetSer: 0.675 ± 0.47
1.181MetThr: 1.181 ± 0.332
1.012MetVal: 1.012 ± 0.114
0.0MetTrp: 0.0 ± 0.0
0.675MetTyr: 0.675 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
4.555AsnAla: 4.555 ± 1.837
1.181AsnCys: 1.181 ± 0.454
3.374AsnAsp: 3.374 ± 1.222
3.205AsnGlu: 3.205 ± 0.98
4.049AsnPhe: 4.049 ± 0.835
2.024AsnGly: 2.024 ± 1.173
2.193AsnHis: 2.193 ± 0.357
4.723AsnIle: 4.723 ± 0.378
4.217AsnLys: 4.217 ± 0.311
7.085AsnLeu: 7.085 ± 0.796
1.687AsnMet: 1.687 ± 0.626
5.229AsnAsn: 5.229 ± 0.876
3.543AsnPro: 3.543 ± 0.933
3.036AsnGln: 3.036 ± 0.123
1.687AsnArg: 1.687 ± 0.183
3.88AsnSer: 3.88 ± 1.616
6.579AsnThr: 6.579 ± 0.508
4.723AsnVal: 4.723 ± 0.82
0.843AsnTrp: 0.843 ± 0.242
4.723AsnTyr: 4.723 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
3.205ProAla: 3.205 ± 0.83
0.337ProCys: 0.337 ± 0.235
1.518ProAsp: 1.518 ± 0.417
3.036ProGlu: 3.036 ± 0.52
0.843ProPhe: 0.843 ± 0.373
1.856ProGly: 1.856 ± 0.532
1.35ProHis: 1.35 ± 0.123
3.711ProIle: 3.711 ± 0.855
1.856ProLys: 1.856 ± 0.465
7.254ProLeu: 7.254 ± 0.87
0.675ProMet: 0.675 ± 0.152
2.53ProAsn: 2.53 ± 0.658
1.856ProPro: 1.856 ± 0.152
1.856ProGln: 1.856 ± 0.672
2.193ProArg: 2.193 ± 0.518
3.88ProSer: 3.88 ± 0.953
3.543ProThr: 3.543 ± 0.437
2.53ProVal: 2.53 ± 0.131
0.337ProTrp: 0.337 ± 0.089
2.193ProTyr: 2.193 ± 0.1
0.0ProXaa: 0.0 ± 0.0
Gln
2.699GlnAla: 2.699 ± 1.046
1.181GlnCys: 1.181 ± 0.216
1.856GlnAsp: 1.856 ± 0.593
1.856GlnGlu: 1.856 ± 0.344
1.012GlnPhe: 1.012 ± 0.317
1.35GlnGly: 1.35 ± 0.536
2.868GlnHis: 2.868 ± 0.2
2.193GlnIle: 2.193 ± 0.326
2.024GlnLys: 2.024 ± 0.053
5.904GlnLeu: 5.904 ± 0.778
0.337GlnMet: 0.337 ± 0.225
3.036GlnAsn: 3.036 ± 0.442
2.53GlnPro: 2.53 ± 0.461
3.205GlnGln: 3.205 ± 1.638
1.518GlnArg: 1.518 ± 0.238
2.699GlnSer: 2.699 ± 0.348
2.362GlnThr: 2.362 ± 0.285
1.35GlnVal: 1.35 ± 0.407
0.337GlnTrp: 0.337 ± 0.235
2.362GlnTyr: 2.362 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
1.856ArgAla: 1.856 ± 0.573
0.506ArgCys: 0.506 ± 0.173
1.856ArgAsp: 1.856 ± 0.152
2.53ArgGlu: 2.53 ± 0.434
1.181ArgPhe: 1.181 ± 0.211
0.506ArgGly: 0.506 ± 0.167
1.35ArgHis: 1.35 ± 0.325
3.711ArgIle: 3.711 ± 0.863
3.374ArgLys: 3.374 ± 0.968
2.53ArgLeu: 2.53 ± 0.136
0.675ArgMet: 0.675 ± 0.152
3.374ArgAsn: 3.374 ± 0.607
1.012ArgPro: 1.012 ± 0.511
1.856ArgGln: 1.856 ± 0.503
2.024ArgArg: 2.024 ± 0.264
1.35ArgSer: 1.35 ± 0.528
2.699ArgThr: 2.699 ± 0.248
2.024ArgVal: 2.024 ± 0.638
0.337ArgTrp: 0.337 ± 0.225
4.049ArgTyr: 4.049 ± 1.382
0.0ArgXaa: 0.0 ± 0.0
Ser
3.205SerAla: 3.205 ± 0.605
1.181SerCys: 1.181 ± 0.289
3.374SerAsp: 3.374 ± 0.887
2.868SerGlu: 2.868 ± 0.235
1.687SerPhe: 1.687 ± 0.246
1.687SerGly: 1.687 ± 0.556
2.53SerHis: 2.53 ± 1.369
4.217SerIle: 4.217 ± 1.255
3.374SerLys: 3.374 ± 0.302
5.229SerLeu: 5.229 ± 2.185
1.35SerMet: 1.35 ± 0.598
3.711SerAsn: 3.711 ± 1.329
2.868SerPro: 2.868 ± 0.654
2.699SerGln: 2.699 ± 0.544
1.35SerArg: 1.35 ± 0.325
4.892SerSer: 4.892 ± 2.812
3.711SerThr: 3.711 ± 1.102
2.53SerVal: 2.53 ± 0.333
0.0SerTrp: 0.0 ± 0.0
4.386SerTyr: 4.386 ± 1.49
0.0SerXaa: 0.0 ± 0.0
Thr
4.555ThrAla: 4.555 ± 0.713
2.193ThrCys: 2.193 ± 0.284
3.711ThrAsp: 3.711 ± 0.149
2.868ThrGlu: 2.868 ± 0.849
1.856ThrPhe: 1.856 ± 0.286
3.036ThrGly: 3.036 ± 0.213
2.699ThrHis: 2.699 ± 0.208
6.242ThrIle: 6.242 ± 1.3
5.229ThrLys: 5.229 ± 0.876
9.109ThrLeu: 9.109 ± 0.466
1.687ThrMet: 1.687 ± 0.265
3.711ThrAsn: 3.711 ± 0.747
4.217ThrPro: 4.217 ± 0.34
2.53ThrGln: 2.53 ± 0.727
3.036ThrArg: 3.036 ± 0.318
4.892ThrSer: 4.892 ± 0.763
9.784ThrThr: 9.784 ± 2.716
3.88ThrVal: 3.88 ± 0.764
0.337ThrTrp: 0.337 ± 0.234
3.711ThrTyr: 3.711 ± 0.141
0.0ThrXaa: 0.0 ± 0.0
Val
2.362ValAla: 2.362 ± 0.618
1.687ValCys: 1.687 ± 0.186
3.036ValAsp: 3.036 ± 0.546
1.856ValGlu: 1.856 ± 0.709
3.036ValPhe: 3.036 ± 0.213
0.506ValGly: 0.506 ± 0.173
3.036ValHis: 3.036 ± 0.475
3.543ValIle: 3.543 ± 0.416
2.024ValLys: 2.024 ± 0.589
6.579ValLeu: 6.579 ± 0.553
1.012ValMet: 1.012 ± 0.106
4.217ValAsn: 4.217 ± 1.462
2.53ValPro: 2.53 ± 0.136
1.012ValGln: 1.012 ± 0.317
2.024ValArg: 2.024 ± 0.404
2.868ValSer: 2.868 ± 0.752
4.386ValThr: 4.386 ± 0.465
2.53ValVal: 2.53 ± 0.966
0.0ValTrp: 0.0 ± 0.0
3.543ValTyr: 3.543 ± 0.437
0.0ValXaa: 0.0 ± 0.0
Trp
0.337TrpAla: 0.337 ± 0.235
0.0TrpCys: 0.0 ± 0.0
0.843TrpAsp: 0.843 ± 0.471
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.169TrpGly: 0.169 ± 0.117
0.675TrpHis: 0.675 ± 0.45
0.843TrpIle: 0.843 ± 0.242
0.0TrpLys: 0.0 ± 0.0
0.843TrpLeu: 0.843 ± 0.23
0.169TrpMet: 0.169 ± 0.117
0.506TrpAsn: 0.506 ± 0.351
0.169TrpPro: 0.169 ± 0.248
0.337TrpGln: 0.337 ± 0.089
0.337TrpArg: 0.337 ± 0.228
0.169TrpSer: 0.169 ± 0.117
0.506TrpThr: 0.506 ± 0.167
0.169TrpVal: 0.169 ± 0.114
0.0TrpTrp: 0.0 ± 0.0
0.843TrpTyr: 0.843 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.374TyrAla: 3.374 ± 0.666
1.687TyrCys: 1.687 ± 0.142
3.205TyrAsp: 3.205 ± 0.226
2.699TyrGlu: 2.699 ± 0.433
1.687TyrPhe: 1.687 ± 0.246
2.868TyrGly: 2.868 ± 0.199
2.868TyrHis: 2.868 ± 0.2
5.061TyrIle: 5.061 ± 0.272
3.374TyrLys: 3.374 ± 0.711
5.229TyrLeu: 5.229 ± 0.769
1.35TyrMet: 1.35 ± 0.355
5.904TyrAsn: 5.904 ± 1.72
1.35TyrPro: 1.35 ± 0.407
2.362TyrGln: 2.362 ± 0.601
2.868TyrArg: 2.868 ± 0.514
2.868TyrSer: 2.868 ± 0.235
6.41TyrThr: 6.41 ± 0.672
3.205TyrVal: 3.205 ± 0.651
0.337TyrTrp: 0.337 ± 0.496
3.205TyrTyr: 3.205 ± 0.55
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5929 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski