Amino acid dipepetide frequency for Suakwa aphid-borne yellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.96AlaAla: 1.96 ± 0.72
2.287AlaCys: 2.287 ± 0.873
3.267AlaAsp: 3.267 ± 0.694
5.88AlaGlu: 5.88 ± 1.005
3.594AlaPhe: 3.594 ± 0.698
4.247AlaGly: 4.247 ± 0.425
0.0AlaHis: 0.0 ± 0.0
4.574AlaIle: 4.574 ± 0.837
2.94AlaLys: 2.94 ± 1.461
7.841AlaLeu: 7.841 ± 1.882
0.98AlaMet: 0.98 ± 0.507
1.307AlaAsn: 1.307 ± 0.584
6.534AlaPro: 6.534 ± 0.746
3.92AlaGln: 3.92 ± 0.676
6.861AlaArg: 6.861 ± 1.458
5.554AlaSer: 5.554 ± 1.269
3.594AlaThr: 3.594 ± 0.448
2.94AlaVal: 2.94 ± 0.633
1.633AlaTrp: 1.633 ± 0.863
2.287AlaTyr: 2.287 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
0.98CysAla: 0.98 ± 0.223
0.98CysCys: 0.98 ± 0.459
1.633CysAsp: 1.633 ± 0.292
1.633CysGlu: 1.633 ± 0.292
0.327CysPhe: 0.327 ± 0.36
2.94CysGly: 2.94 ± 1.06
0.0CysHis: 0.0 ± 0.0
0.98CysIle: 0.98 ± 0.687
2.287CysLys: 2.287 ± 0.429
2.94CysLeu: 2.94 ± 0.906
0.327CysMet: 0.327 ± 0.259
0.98CysAsn: 0.98 ± 0.223
0.327CysPro: 0.327 ± 0.207
0.653CysGln: 0.653 ± 0.457
0.0CysArg: 0.0 ± 0.0
0.98CysSer: 0.98 ± 0.621
0.0CysThr: 0.0 ± 0.0
0.653CysVal: 0.653 ± 0.72
0.98CysTrp: 0.98 ± 0.353
0.653CysTyr: 0.653 ± 0.24
0.0CysXaa: 0.0 ± 0.0
Asp
3.92AspAla: 3.92 ± 0.405
1.307AspCys: 1.307 ± 0.402
4.574AspAsp: 4.574 ± 1.193
2.287AspGlu: 2.287 ± 0.462
3.594AspPhe: 3.594 ± 0.77
4.247AspGly: 4.247 ± 0.978
0.653AspHis: 0.653 ± 0.467
1.307AspIle: 1.307 ± 0.479
1.633AspLys: 1.633 ± 0.972
5.88AspLeu: 5.88 ± 1.261
0.653AspMet: 0.653 ± 0.414
2.287AspAsn: 2.287 ± 0.4
1.96AspPro: 1.96 ± 0.552
1.633AspGln: 1.633 ± 0.35
1.96AspArg: 1.96 ± 0.612
2.614AspSer: 2.614 ± 1.09
1.633AspThr: 1.633 ± 0.697
0.98AspVal: 0.98 ± 0.223
2.94AspTrp: 2.94 ± 0.769
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.9GluAla: 4.9 ± 0.482
0.98GluCys: 0.98 ± 0.59
3.594GluAsp: 3.594 ± 0.753
5.554GluGlu: 5.554 ± 1.157
2.614GluPhe: 2.614 ± 0.703
2.94GluGly: 2.94 ± 0.746
0.98GluHis: 0.98 ± 0.447
2.614GluIle: 2.614 ± 0.914
2.287GluLys: 2.287 ± 0.6
5.227GluLeu: 5.227 ± 0.638
0.653GluMet: 0.653 ± 0.24
1.633GluAsn: 1.633 ± 0.567
4.9GluPro: 4.9 ± 0.851
2.94GluGln: 2.94 ± 0.72
3.267GluArg: 3.267 ± 0.746
5.227GluSer: 5.227 ± 1.152
3.92GluThr: 3.92 ± 0.868
4.247GluVal: 4.247 ± 0.685
2.287GluTrp: 2.287 ± 0.602
3.594GluTyr: 3.594 ± 0.893
0.0GluXaa: 0.0 ± 0.0
Phe
0.653PheAla: 0.653 ± 0.72
1.633PheCys: 1.633 ± 0.567
1.96PheAsp: 1.96 ± 0.442
2.287PheGlu: 2.287 ± 0.428
3.92PhePhe: 3.92 ± 1.438
3.92PheGly: 3.92 ± 0.801
1.633PheHis: 1.633 ± 0.384
3.92PheIle: 3.92 ± 0.728
2.287PheLys: 2.287 ± 0.796
2.94PheLeu: 2.94 ± 1.121
0.0PheMet: 0.0 ± 0.0
1.633PheAsn: 1.633 ± 0.514
1.633PhePro: 1.633 ± 0.52
2.287PheGln: 2.287 ± 0.666
4.247PheArg: 4.247 ± 0.716
2.614PheSer: 2.614 ± 0.722
1.307PheThr: 1.307 ± 0.521
3.594PheVal: 3.594 ± 0.595
1.633PheTrp: 1.633 ± 0.65
0.653PheTyr: 0.653 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
5.227GlyAla: 5.227 ± 1.392
0.98GlyCys: 0.98 ± 0.223
3.594GlyAsp: 3.594 ± 0.572
5.554GlyGlu: 5.554 ± 0.425
3.267GlyPhe: 3.267 ± 0.943
4.247GlyGly: 4.247 ± 1.438
1.633GlyHis: 1.633 ± 0.292
1.96GlyIle: 1.96 ± 0.933
3.594GlyLys: 3.594 ± 0.798
5.227GlyLeu: 5.227 ± 0.809
1.96GlyMet: 1.96 ± 0.982
3.92GlyAsn: 3.92 ± 1.351
3.267GlyPro: 3.267 ± 0.428
0.327GlyGln: 0.327 ± 0.207
5.554GlyArg: 5.554 ± 1.457
8.494GlySer: 8.494 ± 2.026
4.574GlyThr: 4.574 ± 0.437
4.574GlyVal: 4.574 ± 0.469
1.633GlyTrp: 1.633 ± 0.375
2.287GlyTyr: 2.287 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
1.633HisAla: 1.633 ± 0.514
0.98HisCys: 0.98 ± 0.423
0.98HisAsp: 0.98 ± 0.617
1.633HisGlu: 1.633 ± 0.506
0.0HisPhe: 0.0 ± 0.0
0.98HisGly: 0.98 ± 0.533
0.0HisHis: 0.0 ± 0.0
1.307HisIle: 1.307 ± 0.42
0.98HisLys: 0.98 ± 0.355
0.98HisLeu: 0.98 ± 0.223
1.307HisMet: 1.307 ± 0.479
0.327HisAsn: 0.327 ± 0.207
1.633HisPro: 1.633 ± 0.292
0.327HisGln: 0.327 ± 0.259
0.653HisArg: 0.653 ± 0.24
3.267HisSer: 3.267 ± 1.291
1.307HisThr: 1.307 ± 0.479
1.633HisVal: 1.633 ± 0.428
0.653HisTrp: 0.653 ± 0.24
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.594IleAla: 3.594 ± 1.047
0.327IleCys: 0.327 ± 0.207
3.594IleAsp: 3.594 ± 0.691
0.653IleGlu: 0.653 ± 0.414
3.267IlePhe: 3.267 ± 1.112
0.98IleGly: 0.98 ± 0.355
0.98IleHis: 0.98 ± 0.353
0.98IleIle: 0.98 ± 0.353
3.594IleLys: 3.594 ± 0.687
6.861IleLeu: 6.861 ± 1.406
0.327IleMet: 0.327 ± 0.207
2.614IleAsn: 2.614 ± 1.068
3.267IlePro: 3.267 ± 0.694
2.614IleGln: 2.614 ± 0.589
3.594IleArg: 3.594 ± 1.236
6.207IleSer: 6.207 ± 1.128
4.574IleThr: 4.574 ± 1.091
0.653IleVal: 0.653 ± 0.258
0.0IleTrp: 0.0 ± 0.0
0.653IleTyr: 0.653 ± 0.258
0.0IleXaa: 0.0 ± 0.0
Lys
4.574LysAla: 4.574 ± 0.985
0.327LysCys: 0.327 ± 0.207
1.633LysAsp: 1.633 ± 0.564
2.614LysGlu: 2.614 ± 0.799
2.287LysPhe: 2.287 ± 1.244
4.9LysGly: 4.9 ± 1.424
2.94LysHis: 2.94 ± 0.517
2.614LysIle: 2.614 ± 0.583
0.653LysLys: 0.653 ± 0.24
4.247LysLeu: 4.247 ± 0.78
1.96LysMet: 1.96 ± 0.42
3.594LysAsn: 3.594 ± 1.063
1.96LysPro: 1.96 ± 0.436
0.327LysGln: 0.327 ± 0.259
1.307LysArg: 1.307 ± 0.326
6.861LysSer: 6.861 ± 0.9
2.287LysThr: 2.287 ± 0.743
1.96LysVal: 1.96 ± 0.615
0.653LysTrp: 0.653 ± 0.258
0.653LysTyr: 0.653 ± 0.334
0.327LysXaa: 0.327 ± 0.259
Leu
5.227LeuAla: 5.227 ± 1.034
2.94LeuCys: 2.94 ± 0.881
3.92LeuAsp: 3.92 ± 0.544
9.147LeuGlu: 9.147 ± 1.807
2.94LeuPhe: 2.94 ± 0.926
7.841LeuGly: 7.841 ± 1.033
1.633LeuHis: 1.633 ± 0.824
4.574LeuIle: 4.574 ± 0.631
3.267LeuLys: 3.267 ± 1.537
8.167LeuLeu: 8.167 ± 1.868
0.327LeuMet: 0.327 ± 0.444
3.267LeuAsn: 3.267 ± 0.658
6.534LeuPro: 6.534 ± 1.237
2.94LeuGln: 2.94 ± 0.592
4.247LeuArg: 4.247 ± 1.586
8.167LeuSer: 8.167 ± 1.065
6.861LeuThr: 6.861 ± 0.649
3.594LeuVal: 3.594 ± 1.399
1.633LeuTrp: 1.633 ± 0.514
2.94LeuTyr: 2.94 ± 0.547
0.0LeuXaa: 0.0 ± 0.0
Met
0.98MetAla: 0.98 ± 0.447
0.0MetCys: 0.0 ± 0.0
0.653MetAsp: 0.653 ± 0.24
1.633MetGlu: 1.633 ± 0.754
0.0MetPhe: 0.0 ± 0.0
0.98MetGly: 0.98 ± 0.423
0.327MetHis: 0.327 ± 0.259
0.0MetIle: 0.0 ± 0.0
2.287MetLys: 2.287 ± 0.929
1.633MetLeu: 1.633 ± 0.719
0.0MetMet: 0.0 ± 0.0
1.96MetAsn: 1.96 ± 0.504
0.327MetPro: 0.327 ± 0.259
1.307MetGln: 1.307 ± 0.521
0.0MetArg: 0.0 ± 0.0
2.287MetSer: 2.287 ± 1.009
0.98MetThr: 0.98 ± 0.474
1.633MetVal: 1.633 ± 0.479
0.0MetTrp: 0.0 ± 0.0
0.327MetTyr: 0.327 ± 0.259
0.0MetXaa: 0.0 ± 0.0
Asn
2.94AsnAla: 2.94 ± 0.835
0.327AsnCys: 0.327 ± 0.36
0.98AsnAsp: 0.98 ± 0.474
2.287AsnGlu: 2.287 ± 0.547
2.94AsnPhe: 2.94 ± 1.03
5.227AsnGly: 5.227 ± 1.546
0.327AsnHis: 0.327 ± 0.259
2.614AsnIle: 2.614 ± 0.589
1.633AsnLys: 1.633 ± 0.35
3.594AsnLeu: 3.594 ± 0.262
0.98AsnMet: 0.98 ± 0.463
2.287AsnAsn: 2.287 ± 0.666
3.267AsnPro: 3.267 ± 0.724
1.633AsnGln: 1.633 ± 0.514
2.94AsnArg: 2.94 ± 0.855
3.594AsnSer: 3.594 ± 0.498
3.594AsnThr: 3.594 ± 0.595
1.307AsnVal: 1.307 ± 0.43
0.0AsnTrp: 0.0 ± 0.0
1.96AsnTyr: 1.96 ± 1.227
0.0AsnXaa: 0.0 ± 0.0
Pro
5.554ProAla: 5.554 ± 0.87
0.653ProCys: 0.653 ± 0.414
0.98ProAsp: 0.98 ± 0.355
4.574ProGlu: 4.574 ± 0.988
1.307ProPhe: 1.307 ± 0.479
5.88ProGly: 5.88 ± 1.103
2.287ProHis: 2.287 ± 0.425
2.287ProIle: 2.287 ± 0.42
2.287ProLys: 2.287 ± 0.595
4.247ProLeu: 4.247 ± 1.652
0.327ProMet: 0.327 ± 0.259
0.653ProAsn: 0.653 ± 0.334
5.227ProPro: 5.227 ± 3.161
2.287ProGln: 2.287 ± 1.04
3.92ProArg: 3.92 ± 1.372
7.514ProSer: 7.514 ± 1.129
4.574ProThr: 4.574 ± 0.488
3.594ProVal: 3.594 ± 0.679
0.0ProTrp: 0.0 ± 0.0
1.307ProTyr: 1.307 ± 0.402
0.0ProXaa: 0.0 ± 0.0
Gln
3.92GlnAla: 3.92 ± 0.606
0.653GlnCys: 0.653 ± 0.414
0.98GlnAsp: 0.98 ± 0.374
0.653GlnGlu: 0.653 ± 0.357
1.633GlnPhe: 1.633 ± 0.474
1.633GlnGly: 1.633 ± 0.474
0.98GlnHis: 0.98 ± 0.603
0.98GlnIle: 0.98 ± 0.531
3.267GlnLys: 3.267 ± 0.533
1.307GlnLeu: 1.307 ± 0.271
0.98GlnMet: 0.98 ± 0.473
2.94GlnAsn: 2.94 ± 0.477
0.98GlnPro: 0.98 ± 0.374
0.327GlnGln: 0.327 ± 0.207
2.94GlnArg: 2.94 ± 1.437
2.287GlnSer: 2.287 ± 1.089
1.96GlnThr: 1.96 ± 0.665
1.96GlnVal: 1.96 ± 0.644
0.327GlnTrp: 0.327 ± 0.444
0.653GlnTyr: 0.653 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
5.88ArgAla: 5.88 ± 0.829
2.287ArgCys: 2.287 ± 0.4
0.98ArgAsp: 0.98 ± 0.223
3.92ArgGlu: 3.92 ± 0.774
0.653ArgPhe: 0.653 ± 0.258
5.554ArgGly: 5.554 ± 1.394
1.307ArgHis: 1.307 ± 0.527
4.574ArgIle: 4.574 ± 0.53
2.614ArgLys: 2.614 ± 0.803
6.534ArgLeu: 6.534 ± 1.714
0.98ArgMet: 0.98 ± 0.405
5.88ArgAsn: 5.88 ± 2.201
2.94ArgPro: 2.94 ± 1.297
2.287ArgGln: 2.287 ± 0.428
9.801ArgArg: 9.801 ± 3.735
1.96ArgSer: 1.96 ± 0.585
3.267ArgThr: 3.267 ± 0.813
3.594ArgVal: 3.594 ± 0.487
0.98ArgTrp: 0.98 ± 0.355
1.96ArgTyr: 1.96 ± 0.708
0.0ArgXaa: 0.0 ± 0.0
Ser
5.88SerAla: 5.88 ± 1.117
1.307SerCys: 1.307 ± 0.714
3.594SerAsp: 3.594 ± 0.715
2.94SerGlu: 2.94 ± 0.403
4.247SerPhe: 4.247 ± 0.555
6.861SerGly: 6.861 ± 1.517
0.98SerHis: 0.98 ± 0.353
5.88SerIle: 5.88 ± 1.004
3.92SerLys: 3.92 ± 0.606
6.534SerLeu: 6.534 ± 0.856
1.96SerMet: 1.96 ± 0.675
2.94SerAsn: 2.94 ± 0.626
5.227SerPro: 5.227 ± 1.257
2.287SerGln: 2.287 ± 1.268
7.187SerArg: 7.187 ± 1.684
12.741SerSer: 12.741 ± 3.156
4.9SerThr: 4.9 ± 1.394
5.88SerVal: 5.88 ± 1.898
2.287SerTrp: 2.287 ± 0.904
4.9SerTyr: 4.9 ± 0.287
0.0SerXaa: 0.0 ± 0.0
Thr
5.227ThrAla: 5.227 ± 0.844
0.653ThrCys: 0.653 ± 0.24
3.267ThrAsp: 3.267 ± 0.584
3.267ThrGlu: 3.267 ± 0.709
3.92ThrPhe: 3.92 ± 0.805
3.267ThrGly: 3.267 ± 0.863
1.307ThrHis: 1.307 ± 0.694
2.94ThrIle: 2.94 ± 0.42
2.614ThrLys: 2.614 ± 0.459
5.227ThrLeu: 5.227 ± 1.544
0.653ThrMet: 0.653 ± 0.467
1.96ThrAsn: 1.96 ± 0.442
2.94ThrPro: 2.94 ± 0.988
0.0ThrGln: 0.0 ± 0.0
2.94ThrArg: 2.94 ± 0.737
4.9ThrSer: 4.9 ± 1.519
2.94ThrThr: 2.94 ± 0.857
2.94ThrVal: 2.94 ± 1.114
2.287ThrTrp: 2.287 ± 0.429
1.96ThrTyr: 1.96 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
4.9ValAla: 4.9 ± 1.036
0.653ValCys: 0.653 ± 0.24
3.92ValAsp: 3.92 ± 0.994
3.594ValGlu: 3.594 ± 0.732
1.633ValPhe: 1.633 ± 0.35
1.96ValGly: 1.96 ± 0.442
0.327ValHis: 0.327 ± 0.207
2.94ValIle: 2.94 ± 0.856
3.92ValLys: 3.92 ± 0.439
5.554ValLeu: 5.554 ± 1.313
2.287ValMet: 2.287 ± 0.4
1.307ValAsn: 1.307 ± 0.554
3.92ValPro: 3.92 ± 1.36
2.614ValGln: 2.614 ± 0.722
3.594ValArg: 3.594 ± 0.461
3.267ValSer: 3.267 ± 0.799
0.653ValThr: 0.653 ± 0.24
6.861ValVal: 6.861 ± 1.85
0.0ValTrp: 0.0 ± 0.0
0.98ValTyr: 0.98 ± 0.59
0.0ValXaa: 0.0 ± 0.0
Trp
2.614TrpAla: 2.614 ± 0.759
0.0TrpCys: 0.0 ± 0.0
0.653TrpAsp: 0.653 ± 0.414
2.614TrpGlu: 2.614 ± 0.545
0.327TrpPhe: 0.327 ± 0.36
0.98TrpGly: 0.98 ± 0.474
1.633TrpHis: 1.633 ± 0.35
0.98TrpIle: 0.98 ± 0.223
1.307TrpLys: 1.307 ± 0.479
1.96TrpLeu: 1.96 ± 0.775
0.327TrpMet: 0.327 ± 0.207
0.653TrpAsn: 0.653 ± 0.467
0.98TrpPro: 0.98 ± 0.353
0.0TrpGln: 0.0 ± 0.0
0.98TrpArg: 0.98 ± 0.389
1.96TrpSer: 1.96 ± 0.955
0.98TrpThr: 0.98 ± 0.447
0.653TrpVal: 0.653 ± 0.24
0.0TrpTrp: 0.0 ± 0.0
0.327TrpTyr: 0.327 ± 0.259
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.633TyrAla: 1.633 ± 0.292
0.98TyrCys: 0.98 ± 0.223
1.307TyrAsp: 1.307 ± 0.271
1.633TyrGlu: 1.633 ± 0.564
2.287TyrPhe: 2.287 ± 0.613
2.287TyrGly: 2.287 ± 0.547
0.653TyrHis: 0.653 ± 0.457
1.307TyrIle: 1.307 ± 0.667
1.633TyrLys: 1.633 ± 0.856
3.267TyrLeu: 3.267 ± 1.268
0.0TyrMet: 0.0 ± 0.0
2.287TyrAsn: 2.287 ± 0.6
1.633TyrPro: 1.633 ± 0.396
0.653TyrGln: 0.653 ± 0.467
1.96TyrArg: 1.96 ± 0.447
1.96TyrSer: 1.96 ± 0.866
1.307TyrThr: 1.307 ± 0.42
1.307TyrVal: 1.307 ± 0.43
0.0TyrTrp: 0.0 ± 0.0
0.327TyrTyr: 0.327 ± 0.207
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.327XaaVal: 0.327 ± 0.259
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski