The Easton school district is in a financial mess. The school board is considering terrible options like a tax increase of more than 10 percent or layoffs totaling more than 10 percent of the teaching staff. This post collects a few sources of information about the budget. (Let me know if you have more links.)
The Morning Call wrote a good summary of the last school board meeting: Ax falls on Easton schools.
The school district public finance page includes the proposed budget for 2010-2011 and the budgets from the last two years. The auditor's report from 2009 is also posted there. (All documents there are pdfs.) I'm not sure how useful the auditor's report is. There is this gem from the financial highlights on page 4:
The District's overall financial position as of June 30, 2009 continued to be very strong.As of June 30, 2010, not so much.
At the PyCon sprints, we looked into a lot of bugs in the standard library caused by interactions between strings and bytes. (A string holds a sequence of characters. A bytes object holds a sequence of bytes, e.g. 0-255.) I help maintain httplib and urllib, which read raw bytes from a socket and often convert them into strings. The details of those conversions are sometimes tricky. The rules for strings and bytes changed drastically in Python 3.0. Most of the standard library was converted from old to new automatically (by 2to3), and many of the times those conversions were incorrect.
A harmless example comes from httplib where an if / elif statement had tests from strings and for unicode strings. They were both converted to test for strings by the conversion tool. The code looked like this:
if isinstance(buf, str): # regular strings
David Beazley gave a tutorial at coroutines at PyCon 2009. The slides and code are available for download. I took them home with me on the plane. I had a fun time reading the slides and studying the code. It's a remarkably clear explanation of how generators can be used as coroutines, starting with Python 2.5. He runs through a good collection of examples, winding up with a simple OS-style task scheduler for cooperative multi-tasking coroutines.
One of his concluding points was really helpful for me. I found it hard to pay a lot of attention to the evolution of PEP 342, which added coroutine support for generators. I found it confusing that yield / generators were being extended to handle different use cases. David clarifies it in a helpful way:
There are three main uses of yield
Do NOT write generator functions that try to do more than one of these at once
- Iteration (a producer of data)
- Receiving messages (a consumer)
- A trap (cooperative multitasking)
David Simon, of Homicide, The Wire, and long ago the Baltimore Sun, wrote about the increasing secrecy of the Baltimore police in Sunday's Washington Post: In Baltimore, No One Left to Press the Police. He was making a large point about the role of newspapers.
"In an American city, a police officer with the authority to take human life can now do so in the shadows, while his higher-ups can claim that this is necessary not to avoid public accountability, but to mitigate against a nonexistent wave of threats. And the last remaining daily newspaper in town no longer has the manpower, the expertise or the institutional memory to challenge any of it."Simon argues that there aren't any bloggers fighting to keep the city government honest. The laws provide access to many police records, but an individual is left with little practical recourse if the police don't obey them. (They hassle photographers taking pictures on bridges, too.) I'm sympathetic to Simon's argument, but the primary problem is public accountability not the lack of a newspaper to provide it.
Python 3000 is ready! The official release may not come until tomorrow, but Barry has tagged the source and is preparing the release. We've been waiting for this release for almost nine years. The earliest reference I can find is a message from Guido to python-dev in Jan. 2000.
In the previous century, we had been thinking about Python 2 as the Python version that would break backwards compatibility to make real language improvements. Python 2 ended up being a big deal, but without too many compatibility issues. All the major changes were deferred to "Python 3000," a mythical creature. I'm eager to see what programmers make of this creature.
I remember John Dingell from my days writing for The Tech, the campus newspaper at M.I.T. He had a reputation as a bully, who liked to hold hearings and conduct investigations on fraud and waste in science to grandstand. The two I remember best ended up being busts as far as actually uncovering waste or fraud--audits of funding by research universities and a fraud case involving a researcher in David Baltimore's lab where the all charges were eventually dismissed.
The Tech, Feb. 7, 1992
The government -- particularly Rep. John D. Dingell (D-Mich.), chairman of the Oversight andInvestigations Subcommittee -- tried to publicly embarrass MIT and other universities for alleged misuses of funds, even before formal evidence was presented before the committee.
What had originally been a matter of Imanishi-Kari's questionable research data quickly swelled into a thorny and divisive debate over the validity of scientific research.
Baltimore derided the controversy as a witch hunt and believed that some people, like U.S. Representative John Dingell (D-Mich.), were using it unreasonably to call into question government money spent on funding research.
I spent a long, busy day running the polling station for Forks Western District #1 at the Forks Community Center. It was my first presidential election as a poll worker and an interesting day. I learned a lot and continue to be dismayed about how fragile the voting process is. It's easy for individual votes to be irrevocably lost despite the best efforts of the election board. The lines and registration process make it hard for people who want to vote to actually vote, regardless of whether there votes are recorded.
Depsite all the problems, I think we had a good day. Voter turnout was about 65%, more than 1,200 people voted either in person or by absentee ballot. We had long lines, ranging from 45 minutes first thing in the morning to an hour and fifteen minutes at midday. We had no serious problems, the vast majority of people were patient and friendly.
My location was two different polling stations in the same building. This causes lots of confusion for voters, since there are two different lines and they need to be in the right line to vote. The line for our precinct was much shorter than the other line. (Some people waited almost two hours in the other line.) The two election boards worked out a deal where someone who waited in the wrong line would be taken to the front of the other line. This minimized the actual harm and kept people happy, but meant I spent a lot of the day walking back and forth between the two lines.
We had two poll watchers / observes who were form the Obama campaign. They never identified themselves as Obama supporters, but I knew one of them from the neighborhood. The other came from New York. They were incredibly helpful and were essential to keeping the lines moving smoothly. They spent the day answering questions and helping people find the right line. The staff at the community center also helped out a lot.
My biggest concern was the lost votes. The tally of voters who signed in didn't match the tally of votes recorded on the electronic voting machines. Our precinct lost 26 votes out of 1186 cast, a 2.2% loss rate. The other polling station lost 19 out of 1746 votes, a 1.1% loss rate. I didn't think to check the counters on the machine until late in the day, so I don't know what went wrong early. After we started paying extra attention, we cut the error rate to less than one percent. It was probably a combination of human factors and usability issues that caused the problems, but I can't rule out some early glitch with the machine itself. I've got more data about lost votes at the end of the post.
A Princeton report concluded that the voting machines we use were too insecure to be used in New Jersey. My experience is consistent with their complaint that: "Design flaws in the user interface of the AVC Advantage disenfranchise voters."
We had seven provisional ballots cast. In two or three of those cases, I think the people simply weren't registered correctly and the ballot with not be counted. A few others will probably work out fine. I wish there were a way for me to check.
Forks Township might need to make big changes.
My friends in California have so many ballot initiatives to chose from. On any conceivable issue, you can probably find a ballot measure to oppose. There are two on the ballot for Northampton County Pa. Short summary of this post: Vote yes on both questions.
I went for my semi-annual poll worker training session on Sunday. The normal focus of the training session is on operating the voting machines: How to prepare them in the morning, operate them, and record the votes on the flash memory to be delivered for the county count. This year there was an added emphasis on the complications of a presidential election. The most obvious difference with the typical election is that more people come. There will be lines most of the day, primarily because of the time it takes to sign people in.
We were encouraged to be patient. It will be a long day. With long lines, we were asked to keep an eye out for people campaigning inside the polling place or doing anything else inappropriate. The first line of defense here is the election board, but we were encouraged to call the Register with any questions.
Common Cause has a good summary of Pennsylvania election procedures. We were told that the county would provide lots of provisional ballots for polling places and encouraged us to use them. I was happy to hear that Pennsylvania will count a provisional ballot cast at any polling place in the county. If you show up at the wrong polling place 10 minutes before the end of voting, you can cast a provisional ballot and it will still count. My only reservation is that it's more work to cast a provisional ballot. There's more paperwork for the voter and the poll worker. If we have a lot of votes cast this way, it will slow us down.
Provisional ballots sound like a mixed bag. The county collects the provisional ballots, decides whether each one is valid, and counts the valid ones. It will probably take two or three days to count them. The obvious advantage is that anyone not allowed to vote using the regular voting machines can still cast a ballot. If there's a problem and you don't cast a ballot, you can't fix that later. I've read a lot of warnings about provisional ballots. At some level, the warnings make sense. If you can vote on the official machine, that's better. Your vote will definitely be counted--at least to the extent you can trust the machines--but I don't have any idea how they decide which provisional ballots to count and which to toss. I do have some trust in the process:
Howard Erney is the acting chief registrar for this election. I don't know him well, but he runs voter training and often visits polling stations on election day. He seems fair, honest, and conscientious.
I asked about poll watchers and about challenges to voters. The answers here were pretty thin. We had two poll watchers during the primary. One woman was tracking Republication voters--making sure people who said they would vote really came to vote. The other poll watcher came and went. They both stayed while we counted the votes at the end of the day. I did find a summary of the Pennsylvania code on poll watchers that explains things in a fair amount of detail. They can challenge someone's right to vote, but that doesn't automatically disqualify someone from voting. It just makes more paperwork for all involved.
The Pa. League of Women Voters has some more useful information for poll workers. There's a checklist for poll watchers that will work just as well as a checklist for poll workers. I found a really detailed FAQ for Pennsylvania voters on their site, but I don't know if it's current.
I had a pumpkin and some Thai basil from my neighbor Tina. I found a recipe for Thai pumpkin soup and that formed the basis for a Thai pumpkin pizza that I made tonight. It turned out well. We'll definitely make it again, so I'm recording the recipe here. I made enough for a small 8" round pizza. All the proportions are approximate.
Instructions: Take a picture of yourself right now. Don't change your clothes, don't fix your hair - just take a picture. Post that picture with NO editing. Post these instructions with the picture.
Via: Betz, Kang, Egnor. Also: Haahr.
The exercise has been going on for a few weeks in the blogosphere. See Albums Going Steady or My Favorite Album. I was born in 1972, so it wasn't until sometime in the mid 1980s that I had favorite albums that were actually released that year.
I made an svn commit on the py3k branch today--my first commit in a long while.
Make a new urllib package .
It consists of code from urllib, urllib2, urlparse, and robotparser.
The old modules have all been removed. The new package has five
submodules: urllib.parse, urllib.request, urllib.response,
urllib.error, and urllib.robotparser. The urllib.request.urlopen()
function uses the url opener from urllib2.
Note that the unittests have not been renamed for the
beta, but they will be renamed in the future.
Joint work with Senthil Kumaran.
I won a local election a few years ago by a single vote--my own. It was fun to receive the official certification, which has a form to fill out with name and vote total. The form includes the word "votes" rather than "vote(s)," which would have been more appropriate in my case.
I missed PyCon this year. It was the first time I skipped a Python conference since I started going in 1997. I've been reading reports from bloggers eagerly. One of the larger discussions is about the sponsor lightning talks, which weren't well received. Ted Leung commented that we should thank the Perl folks for the lightning talks, but I think they have a long history at Python conferences.
I ran the first session of short talks at the Python conference in 1997: "Content-wise, anything goes, as long as it is deemed interesting for a room full of Python users and developers, and as long as the content is technical, not marketing material."
I have a distinct recollection of seeing a short talk by Christopher Small at a Usenix conference that inspired our short talks session. He had just a few minutes for the talk and something like 17 slides. He had someone turn the slides for him at regular intervals and he tried to keep up. It was really entertaining. (I would have guessed it was at the 1996 OSDI WIP session, but he's not on the list of speakers.)
The first speaker at our short talks session with Scott Deerwester, and he agreed to use the same presentation style as Small. I turned his slides for him, whether he was ready or not. He was a good sport and I think it set a light tone for the rest of the session.
We didn't have short talks the following year and didn't have a Python conference at all in 1999. David Beazley ran the short talks session in 2000. (Andrew Kuchling noted the short talks in his conference summary.) The Python folks started calling them lightning talks in 2001 at the 9th Python conference. David Ascher ran a session called lightning talks at OSCON in 2003, too.
I think the name lightning talk is due to Mark-Jason Dominus. We exchanged some email about the Python short talks long ago, when he was planning the first lightning talks session at a Perl conference. I've never seen a Perl lightning talk session, so I don't know how they compare to the Python short talks / lightning talks. I think they deserve a lot of credit for popularizing the idea.
We've always tried to have fun with the lightning talks. It's a good place for outrageous opinions or quick demos. We've gotten less and less formal about it, too. The first few session required you to sign up in advance and had a full schedule. Later, we let people sign up on the spot and made up the schedule as we went along.
The lightning talks have become one of the most popular sessions at the recent PyCons. They've been scheduled with nothing opposite them (like keynotes, unlike regular talks), and there have been several sessions of them.
He may argue the point, but I think he is wrong.
Update: mreid agrees. kang agrees.
I found this love letter to libraries via ResearchBuzz. I've only been to a few of these libraries myself:
Paul Graham recently posted An Alternative Theory of Unions that compares union workers with servers--in particular with the notion that Internt Bubble companies overpaid for servers the way manufacturing companies overpaid works in the 1950s. He picks car manufacturers as a specific example:
If you looked in the head of a 1950s auto executive, the attitude must have been: sure, give 'em whatever they ask for, so long as the new model isn't delayed.I don't agree with the argument in general, and I think this particular suggestion is just plain wrong. Robert McNamara who was responsible for financial controls and planning for the Ford Motor Company in the 1950s, ultimately becoming president in 1960. McNamara was well aware of the cost of every part of producing a car and worked hard to keep costs down at Ford.
Under Henry [Ford] senior and Harry Bennett the policies of the company were singularly primitive. The public was a problem, the unions were a problem, the bankers were a problem.Update: Some of the comments on Paul's article at Marginal Revolution make similar points.
There's no reason to post about armadillos today. Many years ago, I read an entertaining article in the Washington Post about armadillos. I remembered that the armadillos could be expected to survive in Maryland and Pennsylvania, but I didn't remember how slow the northward migration was. I foolishly hoped to see an armadillo in my backyard one day.
Tara and I found the original article in Google's news archive last night. It says
In another century or so, they could reach the White House as they continue their northward migration.It will be a long time until we see them in Easton.
My original flight from Dallas back to Newark was cancelled because of mechanical problems with the plane. I got bumped from a 10:30 flight to a 2:45 flight, which left me a few hours for hacking and socializing at the PyCon sprints this morning. I had breakfast with Jeff Elkner and caught up on his software and teaching projects, and I got to finish some bug fixes in typeobject.c that I had worked on yesterday. I find it very satisfying to fix an Armin Rigo crash bug. It's usually an accomplishment just to understand it.
I'm going to love Python 3000! Thomas Wouters and I implemented PEP 3104 tonight. It fixes a wart in the original nested scopes implementation that I did in 2001. In that version of nested scopes, we did not allow names defined in one function to be rebound in an enclosing function. It was impossible for the compiler to distinguish between an assignment that creates a local and an assignment that rebinds. Python 3000 will fix this using the nonlocal statement. I hate the name nonlocal, but no one has thought of a better name. Ka-Ping Yee wrote PEP 3104 and provided an exhaustive list of alternate names,
The code itself was quite simple. The only changes in compile.c were trivial. The symbol table needed more changes, because it had to recognize a new kind of declaration and propagate that information to the compiler. The symbol table uses a bit-field where a handful of the bits are used to represent the scope. I spent a lot of time scratching my head until I remember that I needed to increase the width of mask used to extract the scope-related bits. I think I'm not happy with the bit-field representation.
Pete Shinners and Neal Norwitz reviewed code and helped think of tests cases.
That wraps up a fun day of sprinting for me. I also closed several bugs and spent a few hours pouring over typeobject.c to fix some crasher bugs that Armin Rigo reported. I have a fix for one of them, but I want to do a little refactoring before I check it in. Next week, perhaps. I'm flying home first thing in the morning.
Mike Verdone found a great Python bug today. It's an interaction between two features.
I've been struggling with some odd corner cases of free variables and eval/exec in Python.
There is a long-standing bug report that you can't access free variables when you created a nested code block with eval or exec--for example, putting a lambda in an expression passed to eval. It works if the variable happens to be free in the text of the function containing the eval, but only then. The bug report claimed that Scheme worked differently, but I don't think Scheme provides a way to capture an arbitrary environment to pass to eval; it only provides access to top-level environments.
The name locals() is misleading. It returns free variables as well as local variables. The name suggests that this behavior is wrong, but it is necessary to make exec and eval work with free variables. The function returns the names visible in the current scope.
The discussion here is actually about much more than locals(). The same basic issues arise when you use exec or import * in a block or when you use the debugger or some other trace function installed via sys.settrace(). In all these cases, we extract variables in a dictionary to support introspection.
There is still a problem with locals(), which results from the use of locals() in debugging. The implementation makes sure that changes made to the locals dict are reflected in the running program in many cases. In old versions of Python, local variables were stored in a dictionary and locals() returned the actual dictionary. This feature is useful in the debugger. When the implementation changed to use a simple C array instead of a dict, the interpreter arranged to copy variables back and forth between the dictionary and the array when the debugger was used. (This behavior is also needed to make features like exec and import * work.)
In many cases, it is fine to copy free variables into the dictionary returned by locals. If changes are made to those variables, they can be written back to the appropriate part of the closure rather than creating local variables that shadow the free variables.
Class namespaces pose a serious problem in the current CPython implementation. The class stores local variables in a dictionary. When the body of the class finishes execution, this dictionary is passed to the class constructor. Keys in the dictionary become attributes of the class. If you call locals(), free variables could be copied into the dictionary for access in the debugger or introspection. But if they are copied, they will become attributes of the class, which was not intended. It's a messy problem, because it is possible, though inscrutable, for
the same name to be used for a free variable in a method and a class attribute; they have the same name, but they refer to different bindings.
For now, we are fixing this in Python 2.x by omitting free variables from the dictionary returned by locals() when locals() is called in a class block. This makes introspection more difficult but prevents locals() from polluting the class namespace.
What are some other solutions for the class problem? It might be possible to return a copy of the class dictionary with the free variables added. Changes to this dictionary are written back to the real dictionary or free variables when you run in the debugger. It wouldn't be possible to reflect all changes to the dictionary; it would only work in contexts like debugging or using exec. I'm not sure if that would be too confusing.
We could also make two different functions, where locals() and vars() seem like reasonable names. The locals() could return the actual dictionary object for classes, without adding the free variables. The vars() could return a dictionary will all variables, but it would be a copy. Then client code could get whatever they wanted.