Amino acid dipepetide frequency for Human T-cell leukemia virus 3 (strain Pyl43) (HTLV-3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.746AlaAla: 3.746 ± 0.483
1.153AlaCys: 1.153 ± 0.363
3.458AlaAsp: 3.458 ± 1.02
1.441AlaGlu: 1.441 ± 0.363
3.17AlaPhe: 3.17 ± 0.792
3.746AlaGly: 3.746 ± 0.616
1.441AlaHis: 1.441 ± 0.363
8.069AlaIle: 8.069 ± 1.157
1.441AlaLys: 1.441 ± 0.246
9.51AlaLeu: 9.51 ± 1.925
0.576AlaMet: 0.576 ± 0.555
2.305AlaAsn: 2.305 ± 0.645
8.069AlaPro: 8.069 ± 1.629
3.746AlaGln: 3.746 ± 0.626
3.458AlaArg: 3.458 ± 0.741
4.899AlaSer: 4.899 ± 0.7
3.746AlaThr: 3.746 ± 0.667
2.305AlaVal: 2.305 ± 0.285
0.0AlaTrp: 0.0 ± 0.0
2.594AlaTyr: 2.594 ± 0.876
0.0AlaXaa: 0.0 ± 0.0
Cys
0.288CysAla: 0.288 ± 0.371
0.288CysCys: 0.288 ± 0.371
0.288CysAsp: 0.288 ± 0.175
0.865CysGlu: 0.865 ± 0.347
1.729CysPhe: 1.729 ± 0.622
1.441CysGly: 1.441 ± 0.596
0.576CysHis: 0.576 ± 0.299
0.288CysIle: 0.288 ± 0.175
1.729CysLys: 1.729 ± 0.286
2.594CysLeu: 2.594 ± 0.65
0.288CysMet: 0.288 ± 0.371
0.576CysAsn: 0.576 ± 0.344
4.611CysPro: 4.611 ± 0.872
4.323CysGln: 4.323 ± 0.893
0.576CysArg: 0.576 ± 0.35
2.882CysSer: 2.882 ± 0.727
0.576CysThr: 0.576 ± 0.743
1.441CysVal: 1.441 ± 0.785
0.288CysTrp: 0.288 ± 0.449
0.288CysTyr: 0.288 ± 0.371
0.0CysXaa: 0.0 ± 0.0
Asp
1.729AspAla: 1.729 ± 0.418
2.594AspCys: 2.594 ± 0.655
1.441AspAsp: 1.441 ± 0.363
0.288AspGlu: 0.288 ± 0.314
0.865AspPhe: 0.865 ± 0.347
0.865AspGly: 0.865 ± 0.588
0.865AspHis: 0.865 ± 0.399
1.153AspIle: 1.153 ± 0.333
1.729AspLys: 1.729 ± 0.58
7.781AspLeu: 7.781 ± 1.241
0.288AspMet: 0.288 ± 0.285
1.729AspAsn: 1.729 ± 0.286
7.205AspPro: 7.205 ± 1.118
2.017AspGln: 2.017 ± 0.638
0.865AspArg: 0.865 ± 0.823
1.153AspSer: 1.153 ± 0.333
3.17AspThr: 3.17 ± 1.152
0.865AspVal: 0.865 ± 0.399
0.288AspTrp: 0.288 ± 0.314
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.17GluAla: 3.17 ± 0.658
0.865GluCys: 0.865 ± 0.347
1.153GluAsp: 1.153 ± 0.605
1.441GluGlu: 1.441 ± 0.632
1.153GluPhe: 1.153 ± 0.363
1.441GluGly: 1.441 ± 0.363
1.729GluHis: 1.729 ± 0.525
1.153GluIle: 1.153 ± 0.401
0.865GluLys: 0.865 ± 0.415
2.305GluLeu: 2.305 ± 0.856
0.865GluMet: 0.865 ± 0.347
0.865GluAsn: 0.865 ± 0.415
2.594GluPro: 2.594 ± 0.427
1.441GluGln: 1.441 ± 0.882
2.594GluArg: 2.594 ± 0.496
0.865GluSer: 0.865 ± 0.5
4.323GluThr: 4.323 ± 0.855
2.017GluVal: 2.017 ± 0.561
0.0GluTrp: 0.0 ± 0.0
1.153GluTyr: 1.153 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
0.288PheAla: 0.288 ± 0.175
1.441PheCys: 1.441 ± 0.363
1.729PheAsp: 1.729 ± 0.695
0.576PheGlu: 0.576 ± 0.299
0.576PhePhe: 0.576 ± 0.299
1.153PheGly: 1.153 ± 0.651
2.017PheHis: 2.017 ± 0.568
1.441PheIle: 1.441 ± 0.487
0.865PheLys: 0.865 ± 0.361
5.187PheLeu: 5.187 ± 0.841
0.865PheMet: 0.865 ± 0.347
0.288PheAsn: 0.288 ± 0.314
3.458PhePro: 3.458 ± 1.302
2.305PheGln: 2.305 ± 1.027
1.729PheArg: 1.729 ± 0.525
4.323PheSer: 4.323 ± 1.734
0.865PheThr: 0.865 ± 0.415
1.153PheVal: 1.153 ± 0.557
0.288PheTrp: 0.288 ± 0.371
0.288PheTyr: 0.288 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
5.187GlyAla: 5.187 ± 0.883
0.576GlyCys: 0.576 ± 0.482
0.576GlyAsp: 0.576 ± 0.299
2.017GlyGlu: 2.017 ± 0.57
0.865GlyPhe: 0.865 ± 0.5
4.035GlyGly: 4.035 ± 0.744
1.729GlyHis: 1.729 ± 0.58
2.017GlyIle: 2.017 ± 0.486
2.017GlyLys: 2.017 ± 0.406
8.357GlyLeu: 8.357 ± 1.521
0.288GlyMet: 0.288 ± 0.168
1.441GlyAsn: 1.441 ± 0.246
7.205GlyPro: 7.205 ± 0.821
3.458GlyGln: 3.458 ± 0.905
2.017GlyArg: 2.017 ± 0.519
4.323GlySer: 4.323 ± 0.76
2.882GlyThr: 2.882 ± 0.789
1.153GlyVal: 1.153 ± 0.333
0.288GlyTrp: 0.288 ± 0.371
2.305GlyTyr: 2.305 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
2.882HisAla: 2.882 ± 0.816
0.576HisCys: 0.576 ± 0.344
1.441HisAsp: 1.441 ± 0.246
0.865HisGlu: 0.865 ± 0.347
0.865HisPhe: 0.865 ± 0.415
1.729HisGly: 1.729 ± 0.58
2.594HisHis: 2.594 ± 0.8
1.441HisIle: 1.441 ± 0.418
0.576HisLys: 0.576 ± 0.482
3.746HisLeu: 3.746 ± 0.966
0.0HisMet: 0.0 ± 0.0
0.576HisAsn: 0.576 ± 0.35
2.305HisPro: 2.305 ± 0.608
3.458HisGln: 3.458 ± 0.913
2.017HisArg: 2.017 ± 0.195
1.441HisSer: 1.441 ± 0.487
1.729HisThr: 1.729 ± 0.752
3.17HisVal: 3.17 ± 0.386
3.17HisTrp: 3.17 ± 0.828
0.576HisTyr: 0.576 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
2.305IleAla: 2.305 ± 0.999
0.865IleCys: 0.865 ± 0.399
2.017IleAsp: 2.017 ± 0.464
0.576IleGlu: 0.576 ± 0.35
1.729IlePhe: 1.729 ± 0.897
1.441IleGly: 1.441 ± 0.4
2.594IleHis: 2.594 ± 0.893
1.729IleIle: 1.729 ± 0.752
2.017IleLys: 2.017 ± 0.688
10.086IleLeu: 10.086 ± 1.49
0.0IleMet: 0.0 ± 0.0
1.729IleAsn: 1.729 ± 0.58
5.476IlePro: 5.476 ± 0.737
3.746IleGln: 3.746 ± 0.699
0.865IleArg: 0.865 ± 0.482
4.035IleSer: 4.035 ± 1.189
3.458IleThr: 3.458 ± 0.646
2.017IleVal: 2.017 ± 0.506
1.441IleTrp: 1.441 ± 0.621
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.458LysAla: 3.458 ± 0.328
0.288LysCys: 0.288 ± 0.371
4.035LysAsp: 4.035 ± 1.156
2.594LysGlu: 2.594 ± 0.659
1.729LysPhe: 1.729 ± 0.573
2.305LysGly: 2.305 ± 0.645
0.865LysHis: 0.865 ± 0.376
2.305LysIle: 2.305 ± 0.295
1.441LysLys: 1.441 ± 0.363
2.594LysLeu: 2.594 ± 0.823
0.0LysMet: 0.0 ± 0.0
3.458LysAsn: 3.458 ± 0.866
2.305LysPro: 2.305 ± 0.518
4.035LysGln: 4.035 ± 1.228
1.441LysArg: 1.441 ± 0.694
2.017LysSer: 2.017 ± 1.061
4.611LysThr: 4.611 ± 1.342
1.153LysVal: 1.153 ± 0.598
0.865LysTrp: 0.865 ± 0.525
1.153LysTyr: 1.153 ± 0.512
0.0LysXaa: 0.0 ± 0.0
Leu
10.663LeuAla: 10.663 ± 1.504
2.882LeuCys: 2.882 ± 0.621
4.035LeuAsp: 4.035 ± 0.702
3.458LeuGlu: 3.458 ± 0.446
3.746LeuPhe: 3.746 ± 1.782
6.34LeuGly: 6.34 ± 1.31
6.916LeuHis: 6.916 ± 0.787
8.069LeuIle: 8.069 ± 0.877
4.035LeuLys: 4.035 ± 0.236
12.968LeuLeu: 12.968 ± 1.731
1.441LeuMet: 1.441 ± 0.245
6.052LeuAsn: 6.052 ± 1.143
13.833LeuPro: 13.833 ± 2.662
10.951LeuGln: 10.951 ± 1.459
9.222LeuArg: 9.222 ± 1.313
5.764LeuSer: 5.764 ± 1.916
5.476LeuThr: 5.476 ± 1.499
3.17LeuVal: 3.17 ± 0.591
1.729LeuTrp: 1.729 ± 0.58
3.17LeuTyr: 3.17 ± 1.426
0.0LeuXaa: 0.0 ± 0.0
Met
0.576MetAla: 0.576 ± 0.299
0.0MetCys: 0.0 ± 0.0
0.576MetAsp: 0.576 ± 0.428
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.153MetGly: 1.153 ± 0.316
0.0MetHis: 0.0 ± 0.0
1.153MetIle: 1.153 ± 0.401
0.865MetLys: 0.865 ± 0.347
1.441MetLeu: 1.441 ± 0.451
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.576MetPro: 0.576 ± 0.898
0.865MetGln: 0.865 ± 0.347
0.0MetArg: 0.0 ± 0.0
0.576MetSer: 0.576 ± 0.299
0.288MetThr: 0.288 ± 0.314
0.288MetVal: 0.288 ± 0.371
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.729AsnAla: 1.729 ± 0.58
1.153AsnCys: 1.153 ± 0.651
0.576AsnAsp: 0.576 ± 0.278
0.576AsnGlu: 0.576 ± 0.299
1.153AsnPhe: 1.153 ± 0.316
1.729AsnGly: 1.729 ± 0.58
1.729AsnHis: 1.729 ± 0.799
2.017AsnIle: 2.017 ± 1.001
1.729AsnLys: 1.729 ± 0.695
2.882AsnLeu: 2.882 ± 0.62
0.0AsnMet: 0.0 ± 0.0
2.017AsnAsn: 2.017 ± 0.195
6.052AsnPro: 6.052 ± 1.206
2.305AsnGln: 2.305 ± 0.777
0.576AsnArg: 0.576 ± 0.407
2.305AsnSer: 2.305 ± 0.48
2.882AsnThr: 2.882 ± 0.758
2.017AsnVal: 2.017 ± 0.568
0.288AsnTrp: 0.288 ± 0.371
2.305AsnTyr: 2.305 ± 0.632
0.0AsnXaa: 0.0 ± 0.0
Pro
6.916ProAla: 6.916 ± 0.821
5.187ProCys: 5.187 ± 1.207
2.305ProAsp: 2.305 ± 0.899
5.187ProGlu: 5.187 ± 1.227
3.17ProPhe: 3.17 ± 0.571
7.493ProGly: 7.493 ± 0.993
2.882ProHis: 2.882 ± 0.484
5.764ProIle: 5.764 ± 0.752
6.628ProLys: 6.628 ± 2.121
9.798ProLeu: 9.798 ± 1.123
1.441ProMet: 1.441 ± 0.681
3.17ProAsn: 3.17 ± 0.892
14.986ProPro: 14.986 ± 3.034
6.34ProGln: 6.34 ± 1.02
6.052ProArg: 6.052 ± 0.975
9.798ProSer: 9.798 ± 2.989
5.187ProThr: 5.187 ± 1.515
7.493ProVal: 7.493 ± 1.534
3.458ProTrp: 3.458 ± 0.699
3.17ProTyr: 3.17 ± 1.093
0.0ProXaa: 0.0 ± 0.0
Gln
9.222GlnAla: 9.222 ± 1.353
2.305GlnCys: 2.305 ± 0.497
3.458GlnAsp: 3.458 ± 0.586
2.882GlnGlu: 2.882 ± 0.782
2.882GlnPhe: 2.882 ± 0.807
4.611GlnGly: 4.611 ± 1.092
1.153GlnHis: 1.153 ± 0.687
2.017GlnIle: 2.017 ± 0.702
3.17GlnLys: 3.17 ± 0.798
5.764GlnLeu: 5.764 ± 0.826
0.865GlnMet: 0.865 ± 0.347
1.729GlnAsn: 1.729 ± 0.238
8.069GlnPro: 8.069 ± 1.511
4.899GlnGln: 4.899 ± 1.377
1.441GlnArg: 1.441 ± 0.899
2.882GlnSer: 2.882 ± 0.947
4.323GlnThr: 4.323 ± 0.741
3.17GlnVal: 3.17 ± 0.977
1.153GlnTrp: 1.153 ± 0.333
2.305GlnTyr: 2.305 ± 0.667
0.0GlnXaa: 0.0 ± 0.0
Arg
3.17ArgAla: 3.17 ± 0.73
1.441ArgCys: 1.441 ± 0.596
3.17ArgAsp: 3.17 ± 0.99
3.458ArgGlu: 3.458 ± 0.746
1.153ArgPhe: 1.153 ± 0.855
3.17ArgGly: 3.17 ± 0.386
0.288ArgHis: 0.288 ± 0.314
0.0ArgIle: 0.0 ± 0.0
3.458ArgLys: 3.458 ± 0.658
6.916ArgLeu: 6.916 ± 0.7
0.288ArgMet: 0.288 ± 0.175
0.865ArgAsn: 0.865 ± 0.5
6.34ArgPro: 6.34 ± 1.456
0.865ArgGln: 0.865 ± 0.328
2.882ArgArg: 2.882 ± 0.759
3.17ArgSer: 3.17 ± 0.445
1.729ArgThr: 1.729 ± 0.601
3.17ArgVal: 3.17 ± 0.595
0.576ArgTrp: 0.576 ± 0.35
0.576ArgTyr: 0.576 ± 0.35
0.0ArgXaa: 0.0 ± 0.0
Ser
5.764SerAla: 5.764 ± 1.313
2.017SerCys: 2.017 ± 0.76
2.594SerAsp: 2.594 ± 1.322
2.594SerGlu: 2.594 ± 0.579
3.746SerPhe: 3.746 ± 1.231
2.594SerGly: 2.594 ± 0.641
2.594SerHis: 2.594 ± 0.7
3.17SerIle: 3.17 ± 1.194
3.17SerLys: 3.17 ± 1.152
11.816SerLeu: 11.816 ± 2.922
0.576SerMet: 0.576 ± 0.428
3.746SerAsn: 3.746 ± 1.052
8.934SerPro: 8.934 ± 2.081
3.17SerGln: 3.17 ± 1.135
3.458SerArg: 3.458 ± 0.89
11.816SerSer: 11.816 ± 3.032
4.611SerThr: 4.611 ± 2.511
3.17SerVal: 3.17 ± 0.473
0.865SerTrp: 0.865 ± 0.694
0.576SerTyr: 0.576 ± 0.743
0.0SerXaa: 0.0 ± 0.0
Thr
2.882ThrAla: 2.882 ± 0.762
1.153ThrCys: 1.153 ± 0.417
1.729ThrAsp: 1.729 ± 0.815
0.0ThrGlu: 0.0 ± 0.0
0.865ThrPhe: 0.865 ± 0.783
4.611ThrGly: 4.611 ± 1.459
2.305ThrHis: 2.305 ± 0.518
2.882ThrIle: 2.882 ± 0.827
3.458ThrLys: 3.458 ± 0.472
6.628ThrLeu: 6.628 ± 1.164
0.0ThrMet: 0.0 ± 0.0
3.17ThrAsn: 3.17 ± 0.643
8.934ThrPro: 8.934 ± 2.283
2.594ThrGln: 2.594 ± 0.425
2.305ThrArg: 2.305 ± 0.905
4.899ThrSer: 4.899 ± 1.382
2.305ThrThr: 2.305 ± 1.125
2.882ThrVal: 2.882 ± 0.762
2.594ThrTrp: 2.594 ± 0.675
2.017ThrTyr: 2.017 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
3.458ValAla: 3.458 ± 0.807
0.576ValCys: 0.576 ± 0.482
0.865ValAsp: 0.865 ± 0.694
2.017ValGlu: 2.017 ± 0.688
0.865ValPhe: 0.865 ± 0.415
1.153ValGly: 1.153 ± 0.333
1.729ValHis: 1.729 ± 0.573
2.594ValIle: 2.594 ± 1.066
1.153ValLys: 1.153 ± 0.417
6.052ValLeu: 6.052 ± 1.122
0.0ValMet: 0.0 ± 0.28
0.865ValAsn: 0.865 ± 0.347
2.017ValPro: 2.017 ± 0.941
4.899ValGln: 4.899 ± 1.299
2.305ValArg: 2.305 ± 0.725
6.34ValSer: 6.34 ± 1.026
2.017ValThr: 2.017 ± 0.575
2.017ValVal: 2.017 ± 0.882
1.729ValTrp: 1.729 ± 0.286
1.153ValTyr: 1.153 ± 0.717
0.0ValXaa: 0.0 ± 0.0
Trp
1.153TrpAla: 1.153 ± 0.316
0.288TrpCys: 0.288 ± 0.314
0.865TrpAsp: 0.865 ± 0.415
0.576TrpGlu: 0.576 ± 0.549
0.0TrpPhe: 0.0 ± 0.0
0.865TrpGly: 0.865 ± 0.415
0.576TrpHis: 0.576 ± 0.482
0.576TrpIle: 0.576 ± 0.344
1.153TrpLys: 1.153 ± 0.316
3.458TrpLeu: 3.458 ± 0.746
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.441TrpPro: 1.441 ± 0.876
1.441TrpGln: 1.441 ± 0.451
1.441TrpArg: 1.441 ± 0.653
1.729TrpSer: 1.729 ± 0.286
2.594TrpThr: 2.594 ± 0.424
0.576TrpVal: 0.576 ± 0.35
0.0TrpTrp: 0.0 ± 0.0
0.288TrpTyr: 0.288 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.288TyrAla: 0.288 ± 0.371
0.288TyrCys: 0.288 ± 0.175
0.576TyrAsp: 0.576 ± 0.743
0.576TyrGlu: 0.576 ± 0.344
0.576TyrPhe: 0.576 ± 0.344
0.865TyrGly: 0.865 ± 0.5
0.576TyrHis: 0.576 ± 0.743
0.288TyrIle: 0.288 ± 0.175
1.153TyrLys: 1.153 ± 0.505
4.035TyrLeu: 4.035 ± 0.701
0.288TyrMet: 0.288 ± 0.175
1.441TyrAsn: 1.441 ± 0.522
2.017TyrPro: 2.017 ± 0.682
1.441TyrGln: 1.441 ± 0.363
1.441TyrArg: 1.441 ± 0.502
5.764TyrSer: 5.764 ± 1.461
1.441TyrThr: 1.441 ± 1.193
0.865TyrVal: 0.865 ± 0.415
0.0TyrTrp: 0.0 ± 0.0
1.153TyrTyr: 1.153 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski