Amino acid dipepetide frequency for Avian leukosis virus RSA (RSV-SRA) (Rous sarcoma virus (strain Schmidt-Ruppin A))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.894AlaAla: 9.894 ± 2.18
1.979AlaCys: 1.979 ± 0.422
4.288AlaAsp: 4.288 ± 0.66
3.958AlaGlu: 3.958 ± 1.206
2.968AlaPhe: 2.968 ± 0.925
4.288AlaGly: 4.288 ± 0.715
0.989AlaHis: 0.989 ± 0.956
3.958AlaIle: 3.958 ± 0.6
3.298AlaLys: 3.298 ± 1.223
10.224AlaLeu: 10.224 ± 2.893
5.277AlaMet: 5.277 ± 1.897
3.298AlaAsn: 3.298 ± 0.456
7.256AlaPro: 7.256 ± 1.212
2.309AlaGln: 2.309 ± 0.445
6.596AlaArg: 6.596 ± 1.039
5.937AlaSer: 5.937 ± 0.668
5.607AlaThr: 5.607 ± 0.795
7.916AlaVal: 7.916 ± 1.301
2.309AlaTrp: 2.309 ± 0.389
0.66AlaTyr: 0.66 ± 0.301
0.0AlaXaa: 0.0 ± 0.0
Cys
0.66CysAla: 0.66 ± 0.257
0.66CysCys: 0.66 ± 0.826
0.66CysAsp: 0.66 ± 0.257
0.989CysGlu: 0.989 ± 0.697
1.979CysPhe: 1.979 ± 0.378
3.958CysGly: 3.958 ± 2.003
0.0CysHis: 0.0 ± 0.0
0.989CysIle: 0.989 ± 0.872
0.66CysLys: 0.66 ± 0.257
2.639CysLeu: 2.639 ± 1.748
0.0CysMet: 0.0 ± 0.0
1.319CysAsn: 1.319 ± 0.515
0.989CysPro: 0.989 ± 0.359
0.989CysGln: 0.989 ± 0.359
0.66CysArg: 0.66 ± 0.257
0.989CysSer: 0.989 ± 0.697
0.989CysThr: 0.989 ± 1.239
0.33CysVal: 0.33 ± 0.413
0.66CysTrp: 0.66 ± 0.301
1.649CysTyr: 1.649 ± 0.364
0.0CysXaa: 0.0 ± 0.0
Asp
1.979AspAla: 1.979 ± 0.718
1.649AspCys: 1.649 ± 0.595
0.989AspAsp: 0.989 ± 0.697
1.649AspGlu: 1.649 ± 0.557
2.309AspPhe: 2.309 ± 1.337
3.958AspGly: 3.958 ± 1.436
0.989AspHis: 0.989 ± 0.288
2.968AspIle: 2.968 ± 0.619
1.649AspLys: 1.649 ± 0.688
3.628AspLeu: 3.628 ± 1.03
0.989AspMet: 0.989 ± 0.359
0.989AspAsn: 0.989 ± 0.697
2.639AspPro: 2.639 ± 0.356
0.989AspGln: 0.989 ± 0.359
3.298AspArg: 3.298 ± 0.731
2.309AspSer: 2.309 ± 0.804
2.968AspThr: 2.968 ± 0.679
2.639AspVal: 2.639 ± 0.711
2.309AspTrp: 2.309 ± 0.843
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.277GluAla: 5.277 ± 0.813
0.66GluCys: 0.66 ± 0.257
2.309GluAsp: 2.309 ± 0.555
2.968GluGlu: 2.968 ± 1.095
0.0GluPhe: 0.0 ± 0.0
5.607GluGly: 5.607 ± 1.5
0.989GluHis: 0.989 ± 0.292
3.298GluIle: 3.298 ± 0.574
2.968GluLys: 2.968 ± 0.685
2.968GluLeu: 2.968 ± 0.619
0.33GluMet: 0.33 ± 0.947
0.33GluAsn: 0.33 ± 0.191
4.617GluPro: 4.617 ± 0.573
5.277GluGln: 5.277 ± 1.09
5.277GluArg: 5.277 ± 1.434
2.639GluSer: 2.639 ± 2.011
1.319GluThr: 1.319 ± 0.515
5.277GluVal: 5.277 ± 1.038
1.319GluTrp: 1.319 ± 0.385
0.66GluTyr: 0.66 ± 0.826
0.0GluXaa: 0.0 ± 0.0
Phe
1.979PheAla: 1.979 ± 0.378
0.33PheCys: 0.33 ± 0.413
0.33PheAsp: 0.33 ± 0.191
0.33PheGlu: 0.33 ± 0.191
0.66PhePhe: 0.66 ± 0.382
0.989PheGly: 0.989 ± 0.288
0.0PheHis: 0.0 ± 0.0
0.33PheIle: 0.33 ± 0.191
0.33PheLys: 0.33 ± 0.413
1.319PheLeu: 1.319 ± 1.652
0.66PheMet: 0.66 ± 0.301
0.33PheAsn: 0.33 ± 0.413
0.66PhePro: 0.66 ± 0.257
1.319PheGln: 1.319 ± 0.601
1.979PheArg: 1.979 ± 0.324
1.319PheSer: 1.319 ± 0.601
3.628PheThr: 3.628 ± 1.117
1.319PheVal: 1.319 ± 0.515
0.989PheTrp: 0.989 ± 0.292
0.33PheTyr: 0.33 ± 0.191
0.0PheXaa: 0.0 ± 0.0
Gly
4.617GlyAla: 4.617 ± 1.217
1.649GlyCys: 1.649 ± 1.071
1.979GlyAsp: 1.979 ± 0.928
7.256GlyGlu: 7.256 ± 1.709
1.649GlyPhe: 1.649 ± 0.535
6.266GlyGly: 6.266 ± 1.904
2.639GlyHis: 2.639 ± 0.558
5.277GlyIle: 5.277 ± 0.992
4.617GlyLys: 4.617 ± 1.186
10.224GlyLeu: 10.224 ± 2.026
1.979GlyMet: 1.979 ± 0.324
3.958GlyAsn: 3.958 ± 0.939
7.586GlyPro: 7.586 ± 1.387
5.277GlyGln: 5.277 ± 1.001
5.607GlyArg: 5.607 ± 3.262
9.235GlySer: 9.235 ± 1.034
2.309GlyThr: 2.309 ± 0.952
5.937GlyVal: 5.937 ± 1.616
1.319GlyTrp: 1.319 ± 0.892
1.649GlyTyr: 1.649 ± 0.99
0.0GlyXaa: 0.0 ± 0.0
His
0.989HisAla: 0.989 ± 0.292
0.33HisCys: 0.33 ± 0.191
1.319HisAsp: 1.319 ± 0.515
0.33HisGlu: 0.33 ± 0.413
0.33HisPhe: 0.33 ± 0.191
1.649HisGly: 1.649 ± 0.525
0.33HisHis: 0.33 ± 0.191
0.66HisIle: 0.66 ± 0.382
0.989HisLys: 0.989 ± 0.359
1.979HisLeu: 1.979 ± 2.193
0.0HisMet: 0.0 ± 0.0
0.989HisAsn: 0.989 ± 0.292
2.968HisPro: 2.968 ± 0.681
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.66HisSer: 0.66 ± 0.301
1.649HisThr: 1.649 ± 0.557
0.989HisVal: 0.989 ± 0.288
0.66HisTrp: 0.66 ± 0.382
0.989HisTyr: 0.989 ± 0.359
0.0HisXaa: 0.0 ± 0.0
Ile
4.288IleAla: 4.288 ± 0.898
0.66IleCys: 0.66 ± 0.301
1.979IleAsp: 1.979 ± 0.324
2.309IleGlu: 2.309 ± 0.389
0.989IlePhe: 0.989 ± 0.697
3.958IleGly: 3.958 ± 1.324
0.66IleHis: 0.66 ± 0.257
2.309IleIle: 2.309 ± 0.558
3.628IleLys: 3.628 ± 0.865
4.947IleLeu: 4.947 ± 1.243
0.0IleMet: 0.0 ± 0.0
1.319IleAsn: 1.319 ± 0.668
3.298IlePro: 3.298 ± 0.718
2.639IleGln: 2.639 ± 0.356
3.958IleArg: 3.958 ± 0.667
4.617IleSer: 4.617 ± 0.474
5.937IleThr: 5.937 ± 1.212
1.319IleVal: 1.319 ± 0.601
0.66IleTrp: 0.66 ± 0.382
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.288LysAla: 4.288 ± 0.792
0.0LysCys: 0.0 ± 0.0
2.968LysAsp: 2.968 ± 0.568
2.968LysGlu: 2.968 ± 0.359
0.989LysPhe: 0.989 ± 0.292
4.288LysGly: 4.288 ± 0.598
0.989LysHis: 0.989 ± 0.288
2.968LysIle: 2.968 ± 0.679
2.968LysLys: 2.968 ± 1.186
3.298LysLeu: 3.298 ± 0.664
2.639LysMet: 2.639 ± 0.519
0.33LysAsn: 0.33 ± 0.191
2.309LysPro: 2.309 ± 0.851
2.639LysGln: 2.639 ± 0.676
1.319LysArg: 1.319 ± 0.178
3.298LysSer: 3.298 ± 0.805
3.958LysThr: 3.958 ± 0.649
2.639LysVal: 2.639 ± 0.449
1.649LysTrp: 1.649 ± 2.0
0.66LysTyr: 0.66 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
9.894LeuAla: 9.894 ± 0.834
1.649LeuCys: 1.649 ± 0.364
3.298LeuAsp: 3.298 ± 0.456
4.947LeuGlu: 4.947 ± 1.771
1.979LeuPhe: 1.979 ± 0.422
10.224LeuGly: 10.224 ± 0.977
2.309LeuHis: 2.309 ± 1.335
3.628LeuIle: 3.628 ± 0.611
5.277LeuLys: 5.277 ± 0.435
15.172LeuLeu: 15.172 ± 4.554
3.628LeuMet: 3.628 ± 0.791
0.66LeuAsn: 0.66 ± 0.301
6.596LeuPro: 6.596 ± 1.785
5.277LeuGln: 5.277 ± 0.713
5.937LeuArg: 5.937 ± 0.668
4.617LeuSer: 4.617 ± 0.871
5.937LeuThr: 5.937 ± 0.777
6.596LeuVal: 6.596 ± 0.862
2.309LeuTrp: 2.309 ± 0.392
1.319LeuTyr: 1.319 ± 0.514
0.0LeuXaa: 0.0 ± 0.0
Met
3.958MetAla: 3.958 ± 1.437
0.33MetCys: 0.33 ± 0.413
0.989MetAsp: 0.989 ± 0.359
2.639MetGlu: 2.639 ± 0.798
0.0MetPhe: 0.0 ± 0.0
1.649MetGly: 1.649 ± 0.226
0.0MetHis: 0.0 ± 0.0
0.989MetIle: 0.989 ± 0.292
0.989MetLys: 0.989 ± 0.288
1.649MetLeu: 1.649 ± 0.226
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.66MetPro: 0.66 ± 0.257
0.33MetGln: 0.33 ± 0.191
1.649MetArg: 1.649 ± 0.869
1.979MetSer: 1.979 ± 0.771
0.989MetThr: 0.989 ± 0.359
2.639MetVal: 2.639 ± 0.948
0.66MetTrp: 0.66 ± 1.051
0.66MetTyr: 0.66 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
1.649AsnAla: 1.649 ± 0.525
1.649AsnCys: 1.649 ± 1.071
0.33AsnAsp: 0.33 ± 0.191
0.989AsnGlu: 0.989 ± 0.288
0.0AsnPhe: 0.0 ± 0.0
0.989AsnGly: 0.989 ± 0.359
0.33AsnHis: 0.33 ± 0.191
1.319AsnIle: 1.319 ± 1.105
0.66AsnLys: 0.66 ± 0.301
3.628AsnLeu: 3.628 ± 1.076
0.33AsnMet: 0.33 ± 0.413
0.66AsnAsn: 0.66 ± 0.301
1.649AsnPro: 1.649 ± 0.595
1.649AsnGln: 1.649 ± 0.807
3.298AsnArg: 3.298 ± 0.456
2.639AsnSer: 2.639 ± 1.109
1.319AsnThr: 1.319 ± 0.515
0.989AsnVal: 0.989 ± 0.288
0.0AsnTrp: 0.0 ± 0.0
0.33AsnTyr: 0.33 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
7.586ProAla: 7.586 ± 1.825
0.989ProCys: 0.989 ± 0.697
1.979ProAsp: 1.979 ± 0.378
2.968ProGlu: 2.968 ± 0.666
1.649ProPhe: 1.649 ± 0.99
8.575ProGly: 8.575 ± 1.225
1.649ProHis: 1.649 ± 0.807
2.309ProIle: 2.309 ± 0.445
2.968ProLys: 2.968 ± 0.502
9.235ProLeu: 9.235 ± 2.411
0.66ProMet: 0.66 ± 0.684
0.66ProAsn: 0.66 ± 0.301
4.947ProPro: 4.947 ± 1.368
2.639ProGln: 2.639 ± 1.654
4.947ProArg: 4.947 ± 1.427
6.926ProSer: 6.926 ± 1.166
3.628ProThr: 3.628 ± 0.885
6.266ProVal: 6.266 ± 1.791
2.309ProTrp: 2.309 ± 0.723
1.319ProTyr: 1.319 ± 0.514
0.0ProXaa: 0.0 ± 0.0
Gln
4.947GlnAla: 4.947 ± 0.897
2.309GlnCys: 2.309 ± 0.558
0.33GlnAsp: 0.33 ± 0.191
1.649GlnGlu: 1.649 ± 0.557
0.0GlnPhe: 0.0 ± 0.0
9.235GlnGly: 9.235 ± 1.4
0.66GlnHis: 0.66 ± 0.382
1.649GlnIle: 1.649 ± 0.525
3.628GlnLys: 3.628 ± 0.486
5.607GlnLeu: 5.607 ± 1.31
0.0GlnMet: 0.0 ± 0.0
0.33GlnAsn: 0.33 ± 0.413
3.958GlnPro: 3.958 ± 0.697
2.639GlnGln: 2.639 ± 0.953
3.628GlnArg: 3.628 ± 1.13
0.989GlnSer: 0.989 ± 1.239
2.639GlnThr: 2.639 ± 0.356
1.649GlnVal: 1.649 ± 0.807
0.989GlnTrp: 0.989 ± 0.572
0.66GlnTyr: 0.66 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
6.926ArgAla: 6.926 ± 0.63
1.319ArgCys: 1.319 ± 0.178
4.947ArgAsp: 4.947 ± 1.86
6.926ArgGlu: 6.926 ± 1.754
0.989ArgPhe: 0.989 ± 0.697
5.277ArgGly: 5.277 ± 1.394
1.649ArgHis: 1.649 ± 1.071
2.968ArgIle: 2.968 ± 0.359
3.628ArgLys: 3.628 ± 1.731
5.277ArgLeu: 5.277 ± 1.001
0.66ArgMet: 0.66 ± 0.382
0.66ArgAsn: 0.66 ± 1.895
6.266ArgPro: 6.266 ± 1.216
1.979ArgGln: 1.979 ± 0.584
3.958ArgArg: 3.958 ± 2.528
4.617ArgSer: 4.617 ± 3.385
2.309ArgThr: 2.309 ± 0.445
3.628ArgVal: 3.628 ± 0.721
1.319ArgTrp: 1.319 ± 0.944
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.596SerAla: 6.596 ± 1.379
1.649SerCys: 1.649 ± 0.226
3.628SerAsp: 3.628 ± 0.933
4.617SerGlu: 4.617 ± 1.079
1.649SerPhe: 1.649 ± 1.071
5.277SerGly: 5.277 ± 0.921
1.649SerHis: 1.649 ± 0.226
2.639SerIle: 2.639 ± 1.207
2.968SerLys: 2.968 ± 1.095
5.277SerLeu: 5.277 ± 0.369
0.989SerMet: 0.989 ± 0.316
0.33SerAsn: 0.33 ± 0.191
6.596SerPro: 6.596 ± 1.197
3.958SerGln: 3.958 ± 1.631
3.628SerArg: 3.628 ± 2.063
5.607SerSer: 5.607 ± 0.33
5.937SerThr: 5.937 ± 0.944
1.979SerVal: 1.979 ± 0.378
1.979SerTrp: 1.979 ± 0.324
0.66SerTyr: 0.66 ± 0.301
0.0SerXaa: 0.0 ± 0.0
Thr
7.586ThrAla: 7.586 ± 1.413
1.979ThrCys: 1.979 ± 0.422
4.947ThrAsp: 4.947 ± 0.677
2.639ThrGlu: 2.639 ± 0.356
0.66ThrPhe: 0.66 ± 0.382
5.277ThrGly: 5.277 ± 1.589
0.989ThrHis: 0.989 ± 0.906
1.649ThrIle: 1.649 ± 0.226
2.639ThrLys: 2.639 ± 0.771
3.298ThrLeu: 3.298 ± 0.718
1.649ThrMet: 1.649 ± 0.364
3.298ThrAsn: 3.298 ± 0.805
4.617ThrPro: 4.617 ± 1.217
2.639ThrGln: 2.639 ± 0.449
2.968ThrArg: 2.968 ± 1.059
4.288ThrSer: 4.288 ± 1.726
3.298ThrThr: 3.298 ± 0.574
3.958ThrVal: 3.958 ± 0.581
0.989ThrTrp: 0.989 ± 0.292
1.319ThrTyr: 1.319 ± 0.178
0.0ThrXaa: 0.0 ± 0.0
Val
6.266ValAla: 6.266 ± 1.528
0.989ValCys: 0.989 ± 1.286
1.979ValAsp: 1.979 ± 0.422
1.979ValGlu: 1.979 ± 0.718
0.33ValPhe: 0.33 ± 0.191
5.937ValGly: 5.937 ± 0.973
0.66ValHis: 0.66 ± 0.826
7.586ValIle: 7.586 ± 1.605
1.979ValLys: 1.979 ± 0.378
7.256ValLeu: 7.256 ± 1.452
1.649ValMet: 1.649 ± 0.416
3.298ValAsn: 3.298 ± 1.494
1.979ValPro: 1.979 ± 0.705
2.968ValGln: 2.968 ± 0.925
3.628ValArg: 3.628 ± 0.721
3.628ValSer: 3.628 ± 1.076
4.947ValThr: 4.947 ± 0.65
5.277ValVal: 5.277 ± 0.435
1.319ValTrp: 1.319 ± 0.944
0.66ValTyr: 0.66 ± 0.257
0.0ValXaa: 0.0 ± 0.0
Trp
2.309TrpAla: 2.309 ± 0.445
0.0TrpCys: 0.0 ± 0.0
1.649TrpAsp: 1.649 ± 0.226
1.649TrpGlu: 1.649 ± 1.803
0.0TrpPhe: 0.0 ± 0.0
3.298TrpGly: 3.298 ± 0.697
0.0TrpHis: 0.0 ± 0.0
0.989TrpIle: 0.989 ± 0.288
0.66TrpLys: 0.66 ± 0.382
2.639TrpLeu: 2.639 ± 0.738
0.66TrpMet: 0.66 ± 0.257
1.319TrpAsn: 1.319 ± 0.385
3.628TrpPro: 3.628 ± 1.693
0.66TrpGln: 0.66 ± 0.301
1.649TrpArg: 1.649 ± 1.803
0.66TrpSer: 0.66 ± 0.826
0.66TrpThr: 0.66 ± 0.257
0.989TrpVal: 0.989 ± 0.697
0.0TrpTrp: 0.0 ± 0.0
0.33TrpTyr: 0.33 ± 0.413
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.319TyrAla: 1.319 ± 0.515
0.989TyrCys: 0.989 ± 0.292
0.0TyrAsp: 0.0 ± 0.0
0.33TyrGlu: 0.33 ± 0.191
0.0TyrPhe: 0.0 ± 0.0
0.33TyrGly: 0.33 ± 0.413
0.33TyrHis: 0.33 ± 0.413
0.989TyrIle: 0.989 ± 0.292
0.66TyrLys: 0.66 ± 0.301
0.989TyrLeu: 0.989 ± 0.697
0.33TyrMet: 0.33 ± 0.191
0.33TyrAsn: 0.33 ± 0.413
1.319TyrPro: 1.319 ± 0.178
1.319TyrGln: 1.319 ± 0.514
1.319TyrArg: 1.319 ± 1.105
0.66TyrSer: 0.66 ± 0.257
0.66TyrThr: 0.66 ± 0.257
1.979TyrVal: 1.979 ± 0.324
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3033 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski